Bug 212649

Summary: general protection fault, probably for non-canonical address 0x1856385d1408f284: 0000 [#1] SMP NOPTI, RIP: 0010:kmem_cache_alloc_trace+0xe9/0x2f0
Product: Memory Management Reporter: Erhard F. (erhard_f)
Component: OtherAssignee: Andrew Morton (akpm)
Status: RESOLVED OBSOLETE    
Severity: normal CC: airlied, dri-devel
Priority: P1    
Hardware: x86-64   
OS: Linux   
Kernel Version: 5.12-rc7 Subsystem:
Regression: No Bisected commit-id:
Attachments: dmesg (kernel 5.12-rc7)
kernel .config (kernel 5.12-rc7)

Description Erhard F. 2021-04-12 15:01:08 UTC
Created attachment 296333 [details]
dmesg (kernel 5.12-rc7)

Having seemingly random lockups probably related to my GeForce GT 710 I deciced to run a -rc kernel with some debugging switched on and got this trace pretty soon after booting. I am uncertain whether this is strictly nouveau or memory or locking related so I am posting here and CCing some. Apologies if this is unappropriate.

[...]
general protection fault, probably for non-canonical address 0x1856385d1408f284: 0000 [#1] SMP NOPTI
CPU: 0 PID: 582 Comm: X Not tainted 5.12.0-rc7-Excavator #2
Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./A320M-HDV R3.0, BIOS P3.10 06/26/2019
RIP: 0010:kmem_cache_alloc_trace+0xe9/0x2f0
Code: 0d 24 19 df 59 48 8b 01 48 83 79 10 00 48 89 04 24 0f 84 c8 01 00 00 48 85 c0 0f 84 bf 01 00 00 8b 4d 28 48 8b 7d 00 48 01 c1 <48> 8b 19 48 89 ce 48 33 9d b0 00 00 00 48 8d 4a 01 48 0f ce 48 31
RSP: 0018:ffffae32c9b27698 EFLAGS: 00010206
RAX: 1856385d1408f254 RBX: 0000000000000dc0 RCX: 1856385d1408f284
RDX: 00000000000077ba RSI: 0000000000000dc0 RDI: 00000000000391d0
RBP: ffff899180042600 R08: 00000000a680b1df R09: 0000000000000001
R10: 0000000000000001 R11: 0000000000000001 R12: 0000000000000dc0
R13: 0000000000000048 R14: ffffffffc0a98e03 R15: 0000000000000000
FS:  00007f297c4cd980(0000) GS:ffff899487400000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 000055d54909f790 CR3: 0000000151932000 CR4: 00000000001506f0
Call Trace:
 nvkm_mem_new_type+0xd3/0x2d0 [nouveau]
 nvkm_umem_new+0x12e/0x210 [nouveau]
 nvkm_ioctl_new+0x148/0x200 [nouveau]
 ? nvkm_umem_search+0xf0/0xf0 [nouveau]
 nvkm_ioctl+0xdc/0x180 [nouveau]
 nvif_object_ctor+0x119/0x1c0 [nouveau]
 nvif_mem_ctor_type+0xbd/0x180 [nouveau]
 ? nouveau_mem_host+0xc1/0x1c0 [nouveau]
 ? nouveau_mem_host+0xc1/0x1c0 [nouveau]
 ? rcu_read_lock_sched_held+0x3a/0x70
 ? __iommu_map+0x193/0x1f0
 nouveau_mem_host+0x100/0x1c0 [nouveau]
 nouveau_sgdma_bind+0x2b/0x80 [nouveau]
 nouveau_bo_move+0x6a/0x990 [nouveau]
 ? ttm_pool_alloc+0x4fa/0x5e0 [ttm]
 ttm_bo_handle_move_mem+0x8b/0x170 [ttm]
 ttm_bo_validate+0x193/0x1f0 [ttm]
 ? ttm_bo_init_reserved+0x373/0x3d0 [ttm]
 ttm_bo_init_reserved+0x2f2/0x3d0 [ttm]
 ttm_bo_init+0x59/0x130 [ttm]
 ? nouveau_ttm_io_mem_free+0x70/0x70 [nouveau]
 nouveau_bo_init+0xa3/0xc0 [nouveau]
 ? nouveau_ttm_io_mem_free+0x70/0x70 [nouveau]
 ? nouveau_gem_new+0xf0/0xf0 [nouveau]
 nouveau_gem_new+0x7a/0xf0 [nouveau]
 nouveau_gem_ioctl_new+0x4e/0xf0 [nouveau]
 ? nouveau_gem_new+0xf0/0xf0 [nouveau]
 drm_ioctl_kernel+0xa8/0xf0 [drm]
 drm_ioctl+0x20a/0x3a0 [drm]
 ? nouveau_gem_new+0xf0/0xf0 [nouveau]
 ? lockdep_hardirqs_on_prepare+0xd4/0x180
 ? _raw_spin_unlock_irqrestore+0x42/0x50
 nouveau_drm_ioctl+0x50/0xa0 [nouveau]
 __x64_sys_ioctl+0x81/0xa0
 do_syscall_64+0x33/0x40
 entry_SYSCALL_64_after_hwframe+0x44/0xae
RIP: 0033:0x7f297c0c6937
Code: 3c 1c 48 f7 d8 4c 39 e0 77 b9 e8 24 ff ff ff 85 c0 78 be 5b 4c 89 e0 5d 41 5c c3 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 01 45 0c 00 f7 d8 64 89 01 48
RSP: 002b:00007ffda9c21188 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
RAX: ffffffffffffffda RBX: 000000000000000e RCX: 00007f297c0c6937
RDX: 00007ffda9c211e8 RSI: 00000000c0306480 RDI: 000000000000000e
RBP: 00000000c0306480 R08: 000055ca69b350f0 R09: 00007ffda9c212c0
R10: 0000000000001000 R11: 0000000000000246 R12: 00007ffda9c211e8
R13: 00007ffda9c211e8 R14: 00007ffda9c212c0 R15: 0000000020000002
Modules linked in: rfkill dm_crypt nhpoly1305_avx2 nhpoly1305 chacha_generic chacha_x86_64 libchacha adiantum libpoly1305 algif_skcipher af_alg input_leds joydev hid_generic usbhid hid dm_mod ghash_generic gf128mul gcm xts ctr raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx nouveau ext4 crc16 mbcache jbd2 cbc md_mod crc32_generic evdev crc32_pclmul f2fs aes_generic ecb wmi aesni_intel led_class libaes crypto_simd drm_ttm_helper cryptd ttm i2c_algo_bit snd_hda_codec_generic snd_hda_codec_hdmi drm_kms_helper cfbfillrect snd_hda_intel syscopyarea cfbimgblt snd_intel_dspcfg sysfillrect snd_hda_codec sysimgblt fb_sys_fops cfbcopyarea snd_hwdep fam15h_power k10temp snd_hda_core fb snd_pcm font fbdev i2c_piix4 xhci_pci snd_timer ehci_pci snd xhci_hcd ehci_hcd soundcore usbcore usb_common video button acpi_cpufreq nfsd zram zsmalloc auth_rpcgss nct6775 hwmon_vid jc42 hwmon lockd drm grace fuse drm_panel_orientation_quirks configfs backlight sunrpc efivarfs
---[ end trace c2935bae5672ef71 ]---
RIP: 0010:kmem_cache_alloc_trace+0xe9/0x2f0
Code: 0d 24 19 df 59 48 8b 01 48 83 79 10 00 48 89 04 24 0f 84 c8 01 00 00 48 85 c0 0f 84 bf 01 00 00 8b 4d 28 48 8b 7d 00 48 01 c1 <48> 8b 19 48 89 ce 48 33 9d b0 00 00 00 48 8d 4a 01 48 0f ce 48 31
RSP: 0018:ffffae32c9b27698 EFLAGS: 00010206
RAX: 1856385d1408f254 RBX: 0000000000000dc0 RCX: 1856385d1408f284
RDX: 00000000000077ba RSI: 0000000000000dc0 RDI: 00000000000391d0
RBP: ffff899180042600 R08: 00000000a680b1df R09: 0000000000000001
R10: 0000000000000001 R11: 0000000000000001 R12: 0000000000000dc0
R13: 0000000000000048 R14: ffffffffc0a98e03 R15: 0000000000000000
FS:  00007f297c4cd980(0000) GS:ffff899487400000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 000055d54909f790 CR3: 0000000151932000 CR4: 00000000001506f0
general protection fault, probably for non-canonical address 0x1856385d1408f284: 0000 [#2] SMP NOPTI
CPU: 0 PID: 1066 Comm: firefox Tainted: G      D           5.12.0-rc7-Excavator #2
Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./A320M-HDV R3.0, BIOS P3.10 06/26/2019
RIP: 0010:kmem_cache_alloc_trace+0xe9/0x2f0
Code: 0d 24 19 df 59 48 8b 01 48 83 79 10 00 48 89 04 24 0f 84 c8 01 00 00 48 85 c0 0f 84 bf 01 00 00 8b 4d 28 48 8b 7d 00 48 01 c1 <48> 8b 19 48 89 ce 48 33 9d b0 00 00 00 48 8d 4a 01 48 0f ce 48 31
RSP: 0018:ffffae32c9effd28 EFLAGS: 00010206
RAX: 1856385d1408f254 RBX: 0000000000000dc0 RCX: 1856385d1408f284
RDX: 00000000000077ba RSI: 0000000000000dc0 RDI: 00000000000391d0
RBP: ffff899180042600 R08: 0000000000000001 R09: 0000000000000000
R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000000dc0
R13: 0000000000000058 R14: ffffffffa63e53f0 R15: 0000000000000000
FS:  00007efed9619780(0000) GS:ffff899487400000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: 00007efed8dfeeb8 CR3: 0000000116188000 CR4: 00000000001506f0
Call Trace:
 copy_semundo+0x70/0xe0
 copy_process+0x92c/0x1bb0
 kernel_clone+0x96/0x360
 __do_sys_clone+0x60/0x80
 do_syscall_64+0x33/0x40
 entry_SYSCALL_64_after_hwframe+0x44/0xae
RIP: 0033:0x7efed930f971
Code: 48 85 ff 74 3d 48 85 f6 74 38 48 83 ee 10 48 89 4e 08 48 89 3e 48 89 d7 4c 89 c2 4d 89 c8 4c 8b 54 24 08 b8 38 00 00 00 0f 05 <48> 85 c0 7c 13 74 01 c3 31 ed 58 5f ff d0 48 89 c7 b8 3c 00 00 00
RSP: 002b:00007fff37f36d68 EFLAGS: 00000202 ORIG_RAX: 0000000000000038
RAX: ffffffffffffffda RBX: 00007efed8dff640 RCX: 00007efed930f971
RDX: 00007efed8dff910 RSI: 00007efed8dfeeb0 RDI: 00000000003d0f00
RBP: 00007fff37f36e20 R08: 00007efed8dff640 R09: 00007efed8dff640
R10: 00007efed8dff910 R11: 0000000000000202 R12: 00007fff37f36e1e
R13: 00007fff37f36e1f R14: 0000000000000000 R15: 00007efed8dff640
Modules linked in: rfkill dm_crypt nhpoly1305_avx2 nhpoly1305 chacha_generic chacha_x86_64 libchacha adiantum libpoly1305 algif_skcipher af_alg input_leds joydev hid_generic usbhid hid dm_mod ghash_generic gf128mul gcm xts ctr raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx nouveau ext4 crc16 mbcache jbd2 cbc md_mod crc32_generic evdev crc32_pclmul f2fs aes_generic ecb wmi aesni_intel led_class libaes crypto_simd drm_ttm_helper cryptd ttm i2c_algo_bit snd_hda_codec_generic snd_hda_codec_hdmi drm_kms_helper cfbfillrect snd_hda_intel syscopyarea cfbimgblt snd_intel_dspcfg sysfillrect snd_hda_codec sysimgblt fb_sys_fops cfbcopyarea snd_hwdep fam15h_power k10temp snd_hda_core fb snd_pcm font fbdev i2c_piix4 xhci_pci snd_timer ehci_pci snd xhci_hcd ehci_hcd soundcore usbcore usb_common video button acpi_cpufreq nfsd zram zsmalloc auth_rpcgss nct6775 hwmon_vid jc42 hwmon lockd drm grace fuse drm_panel_orientation_quirks configfs backlight sunrpc efivarfs
---[ end trace c2935bae5672ef72 ]---
RIP: 0010:kmem_cache_alloc_trace+0xe9/0x2f0
[...]


 # lspci 
00:00.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Root Complex
00:00.2 IOMMU: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) I/O Memory Management Unit
00:02.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Host Bridge
00:02.4 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Root Port
00:02.5 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Root Port
00:03.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Host Bridge
00:03.1 PCI bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Root Port
00:08.0 Encryption controller: Advanced Micro Devices, Inc. [AMD] Carrizo Platform Security Processor
00:09.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Carrizo Audio Dummy Host Bridge
00:09.2 Audio device: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Audio Controller
00:10.0 USB controller: Advanced Micro Devices, Inc. [AMD] FCH USB XHCI Controller (rev 20)
00:11.0 SATA controller: Advanced Micro Devices, Inc. [AMD] FCH SATA Controller [AHCI mode] (rev 49)
00:12.0 USB controller: Advanced Micro Devices, Inc. [AMD] FCH USB EHCI Controller (rev 49)
00:14.0 SMBus: Advanced Micro Devices, Inc. [AMD] FCH SMBus Controller (rev 4a)
00:14.3 ISA bridge: Advanced Micro Devices, Inc. [AMD] FCH LPC Bridge (rev 11)
00:18.0 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Function 0
00:18.1 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Function 1
00:18.2 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Function 2
00:18.3 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Function 3
00:18.4 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Function 4
00:18.5 Host bridge: Advanced Micro Devices, Inc. [AMD] Family 15h (Models 60h-6fh) Processor Function 5
01:00.0 USB controller: Advanced Micro Devices, Inc. [AMD] Device 43bc (rev 02)
01:00.1 SATA controller: Advanced Micro Devices, Inc. [AMD] Device 43b8 (rev 02)
01:00.2 PCI bridge: Advanced Micro Devices, Inc. [AMD] Device 43b3 (rev 02)
02:04.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] 300 Series Chipset PCIe Port (rev 02)
02:06.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] 300 Series Chipset PCIe Port (rev 02)
02:07.0 PCI bridge: Advanced Micro Devices, Inc. [AMD] 300 Series Chipset PCIe Port (rev 02)
04:00.0 VGA compatible controller: NVIDIA Corporation GK208B [GeForce GT 710] (rev a1)
04:00.1 Audio device: NVIDIA Corporation GK208 HDMI/DP Audio Controller (rev a1)
05:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 15)
06:00.0 Non-Volatile memory controller: Sandisk Corp WD Black 2019/PC SN750 NVMe SSD
07:00.0 USB controller: ASMedia Technology Inc. ASM2142 USB 3.1 Host Controller
Comment 1 Erhard F. 2021-04-12 15:01:48 UTC
Created attachment 296335 [details]
kernel .config (kernel 5.12-rc7)
Comment 2 Erhard F. 2022-08-18 15:49:55 UTC
Sold the GeForce GT 710 some month ago, no longer able to test.