Bug 206697

Summary: #PF: supervisor read access in kernel mode, #PF: error_code(0x0000) - not-present page while building a large project
Product: Memory Management Reporter: Erhard F. (erhard_f)
Component: Page AllocatorAssignee: Andrew Morton (akpm)
Status: RESOLVED OBSOLETE    
Severity: normal CC: alexdeucher, dflogeras2, dmitry.klimanov, gabriel.sevecek, kernel-bugzilla, kernel, mpagano, taz.007, ukrovt
Priority: P1    
Hardware: IA-32   
OS: Linux   
Kernel Version: 5.5.6 Subsystem:
Regression: No Bisected commit-id:
Attachments: dmesg (kernel 5.5.6, Shuttle XPC FS51, Pentium 4)
kernel .config (kernel 5.5.6, Shuttle XPC FS51, Pentium 4)
dmesg (kernel 5.6-rc4, Shuttle XPC FS51, Pentium 4)
bisect01.log
revert

Description Erhard F. 2020-02-28 00:34:02 UTC
Created attachment 287693 [details]
dmesg (kernel 5.5.6, Shuttle XPC FS51, Pentium 4)

Happens every time on this machine when I build a large project (e.g. boost-1.72.0):

[...]
[ 1079.810216] BUG: kernel NULL pointer dereference, address: 00000000
[ 1079.810430] #PF: supervisor read access in kernel mode
[ 1079.810583] #PF: error_code(0x0000) - not-present page
[ 1079.810736] *pde = 00000000 
[ 1079.810825] Oops: 0000 [#1] SMP
[ 1079.810921] CPU: 0 PID: 53 Comm: kswapd0 Not tainted 5.5.6-gentoo-Pentium4 #6
[ 1079.811134] Hardware name:  /FS51, BIOS 6.00 PG 12/02/2003
[ 1079.811304] EIP: __cpa_process_fault+0x205/0x226
[ 1079.811443] Code: 2d 00 00 00 40 39 d0 76 1f 81 fa ff ff ff bf 76 17 c7 47 10 01 00 00 00 81 c3 00 00 00 40 c1 eb 0c 89 5f 18 31 f6 eb 19 8b 07 <ff> 30 53 68 54 fb c8 ca e8 84 7a 00 00 0f 0b 83 c4 0c be f2 ff ff
[ 1079.811979] EAX: 00000000 EBX: 00000000 ECX: 00000001 EDX: 00000000
[ 1079.812167] ESI: 00000001 EDI: f5e6fd44 EBP: f5e6fcbc ESP: f5e6fc98
[ 1079.812355] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 EFLAGS: 00010213
[ 1079.812557] CR0: 80050033 CR2: 00000000 CR3: 052af000 CR4: 000006d0
[ 1079.812740] Call Trace:
[ 1079.812820]  ? _raw_spin_lock+0x22/0x2a
[ 1079.812936]  ? lookup_address+0x1d/0x20
[ 1079.813047]  __change_page_attr_set_clr+0x85/0x551
[ 1079.813189]  ? __mutex_unlock_slowpath+0x20/0x1b6
[ 1079.813326]  ? mutex_unlock+0xb/0xd
[ 1079.813432]  ? _vm_unmap_aliases.part.0+0x11f/0x127
[ 1079.813575]  change_page_attr_set_clr+0xdc/0x1af
[ 1079.813715]  set_pages_array_wb+0x20/0x7b
[ 1079.813848]  ttm_pages_put+0x22/0x71 [ttm]
[ 1079.813975]  ttm_page_pool_free+0xf6/0x111 [ttm]
[ 1079.814116]  ttm_pool_shrink_scan+0x9c/0xd1 [ttm]
[ 1079.814261]  shrink_slab.constprop.0+0x248/0x38f
[ 1079.814398]  shrink_node+0x54a/0x70c
[ 1079.814505]  kswapd+0x4b9/0x62d
[ 1079.814601]  ? kswapd+0x4b9/0x62d
[ 1079.814705]  kthread+0xd1/0xd3
[ 1079.814797]  ? try_to_free_pages+0x3d4/0x3d4
[ 1079.814925]  ? kthread_delayed_work_timer_fn+0x6a/0x6a
[ 1079.815076]  ret_from_fork+0x2e/0x38
[ 1079.815182] Modules linked in: auth_rpcgss nfsv4 dns_resolver nfs lockd grace sunrpc ctr aes_generic libaes ccm hid_generic usbhid hid rt2500pci eeprom_93cx6 rt2x00pci rt2x00mmio rt2x00lib led_class mac80211 radeon evdev hwmon i2c_algo_bit drm_kms_helper cfbfillrect syscopyarea cfbimgblt sysfillrect sysimgblt fb_sys_fops cfbcopyarea fb font firewire_ohci fbdev ttm firewire_core 8139too crc_itu_t mii sr_mod cfg80211 cdrom drm fan thermal drm_panel_orientation_quirks ohci_pci snd_intel8x0 backlight 8250 snd_ac97_codec 8250_base serial_core ehci_pci ohci_hcd rfkill ehci_hcd ac97_bus libarc4 snd_pcm usbcore sis_agp snd_timer usb_common snd agpgart button processor soundcore i2c_sis96x
[ 1079.835024] CR2: 0000000000000000
[ 1079.844798] ---[ end trace 2acb3661952bc786 ]---
[ 1079.854557] EIP: __cpa_process_fault+0x205/0x226
[ 1079.864444] Code: 2d 00 00 00 40 39 d0 76 1f 81 fa ff ff ff bf 76 17 c7 47 10 01 00 00 00 81 c3 00 00 00 40 c1 eb 0c 89 5f 18 31 f6 eb 19 8b 07 <ff> 30 53 68 54 fb c8 ca e8 84 7a 00 00 0f 0b 83 c4 0c be f2 ff ff
[ 1079.874942] EAX: 00000000 EBX: 00000000 ECX: 00000001 EDX: 00000000
[ 1079.885370] ESI: 00000001 EDI: f5e6fd44 EBP: f5e6fcbc ESP: f5e6fc98
[ 1079.895776] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 EFLAGS: 00010213
[ 1079.906304] CR0: 80050033 CR2: 00000000 CR3: 052af000 CR4: 000006d0

Don't think it has much to do with ttm strictly speaking, as I am running this machine without X via ssh.

# inxi -b
System:    Kernel: 5.5.6-gentoo-Pentium4 i686 bits: 32 Console: tty 0 
           Distro: Gentoo Base System release 2.6 
Machine:   Type: Desktop Mobo: Shuttle model: FS51 serial: N/A BIOS: Phoenix v: 6.00 PG 
           date: 12/02/2003 
CPU:       Single Core: Intel Pentium 4 type: MCP speed: 2796 MHz 
Graphics:  Device-1: AMD RV350 [Radeon 9550/9600/X1050 Series] driver: radeon v: kernel 
           Display: server: X.org 1.20.6 driver: ati,radeon unloaded: fbdev,modesetting tty: 104x53 
           Message: Advanced graphics data unavailable in console for root. 
Network:   Device-1: Ralink RT2500 Wireless 802.11bg driver: rt2500pci 
           Device-2: Realtek RTL-8100/8101L/8139 PCI Fast Ethernet Adapter driver: 8139too 
Drives:    Local Storage: total: 76.69 GiB used: 2.77 GiB (3.6%) 
Info:      Processes: 95 Uptime: 18m Memory: 1.97 GiB used: 159.6 MiB (7.9%) Init: systemd 
           Shell: bash inxi: 3.0.36
Comment 1 Erhard F. 2020-02-28 00:34:42 UTC
Created attachment 287695 [details]
kernel .config (kernel 5.5.6, Shuttle XPC FS51, Pentium 4)
Comment 2 Erhard F. 2020-03-02 21:55:10 UTC
Created attachment 287765 [details]
dmesg (kernel 5.6-rc4, Shuttle XPC FS51, Pentium 4)

Same on kernel 5.6-rc4:

[...]
[  908.356444] BUG: kernel NULL pointer dereference, address: 00000000
[  908.356670] #PF: supervisor read access in kernel mode
[  908.356823] #PF: error_code(0x0000) - not-present page
[  908.356974] *pde = 00000000 
[  908.357064] Oops: 0000 [#1] SMP
[  908.357163] CPU: 0 PID: 53 Comm: kswapd0 Not tainted 5.6.0-rc4-Pentium4 #1
[  908.357367] Hardware name:  /FS51, BIOS 6.00 PG 12/02/2003
[  908.357535] EIP: __cpa_process_fault+0x205/0x226
[  908.357674] Code: 2d 00 00 00 40 39 d0 76 1f 81 fa ff ff ff bf 76 17 c7 47 10 01 00 00 00 81 c3 00 00 00 40 c1 eb 0c 89 5f 18 31 f6 eb 19 8b 07 <ff> 30 53 68 56 ba 89 c9 e8 f8 68 00 00 0f 0b 83 c4 0c be f2 ff ff
[  908.358228] EAX: 00000000 EBX: 00000000 ECX: 00000001 EDX: 00000000
[  908.358411] ESI: 00000001 EDI: f5e6fd4c EBP: f5e6fcc4 ESP: f5e6fca0
[  908.358598] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 EFLAGS: 00010213
[  908.358798] CR0: 80050033 CR2: 00000000 CR3: 00333000 CR4: 000006d0
[  908.358981] Call Trace:
[  908.359062]  ? _raw_spin_lock+0x22/0x2a
[  908.359176]  ? lookup_address+0x1d/0x20
[  908.359289]  __change_page_attr_set_clr+0x85/0x551
[  908.359436]  ? __mutex_unlock_slowpath+0x20/0x1b6
[  908.368244]  ? mutex_unlock+0xb/0xd
[  908.377037]  ? _vm_unmap_aliases.part.0+0x11f/0x127
[  908.385944]  change_page_attr_set_clr+0xdc/0x1af
[  908.394889]  set_pages_array_wb+0x20/0x7b
[  908.403630]  ttm_pages_put+0x22/0x71 [ttm]
[  908.412159]  ttm_page_pool_free+0xf6/0x111 [ttm]
[  908.420492]  ttm_pool_shrink_scan+0x9c/0xd1 [ttm]
[  908.428885]  shrink_slab.constprop.0+0x248/0x38f
[  908.437241]  shrink_node+0x533/0x6f2
[  908.445492]  kswapd+0x4b9/0x62d
[  908.453714]  ? kswapd+0x4b9/0x62d
[  908.461937]  kthread+0xd1/0xd3
[  908.470055]  ? try_to_free_pages+0x3d4/0x3d4
[  908.478143]  ? kthread_delayed_work_timer_fn+0x6a/0x6a
[  908.486242]  ret_from_fork+0x2e/0x38
[  908.494256] Modules linked in: auth_rpcgss nfsv4 dns_resolver nfs lockd grace sunrpc ctr aes_generic libaes ccm hid_generic usbhid hid rt2500pci eeprom_93cx6 rt2x00pci rt2x00mmio rt2x00lib led_class mac80211 radeon evdev hwmon i2c_algo_bit drm_kms_helper cfbfillrect syscopyarea cfg80211 cfbimgblt sysfillrect sysimgblt fb_sys_fops cfbcopyarea firewire_ohci firewire_core fb rfkill font crc_itu_t 8139too libarc4 mii fbdev sr_mod cdrom ttm thermal fan ohci_pci drm 8250 snd_intel8x0 8250_base serial_core snd_ac97_codec ac97_bus ehci_pci ohci_hcd snd_pcm drm_panel_orientation_quirks ehci_hcd button backlight snd_timer usbcore sis_agp agpgart snd i2c_sis96x usb_common processor soundcore zstd zram zsmalloc
[  908.522107] CR2: 0000000000000000
[  908.531646] ---[ end trace f8cc5b63e4c76d19 ]---
[  908.541190] EIP: __cpa_process_fault+0x205/0x226
[  908.550708] Code: 2d 00 00 00 40 39 d0 76 1f 81 fa ff ff ff bf 76 17 c7 47 10 01 00 00 00 81 c3 00 00 00 40 c1 eb 0c 89 5f 18 31 f6 eb 19 8b 07 <ff> 30 53 68 56 ba 89 c9 e8 f8 68 00 00 0f 0b 83 c4 0c be f2 ff ff
[  908.560958] EAX: 00000000 EBX: 00000000 ECX: 00000001 EDX: 00000000
[  908.571156] ESI: 00000001 EDI: f5e6fd4c EBP: f5e6fcc4 ESP: f5e6fca0
[  908.581412] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 EFLAGS: 00010213
[  908.591710] CR0: 80050033 CR2: 00000000 CR3: 00333000 CR4: 000006d0
Comment 3 Andrew Morton 2020-03-02 23:03:31 UTC
(switched to email.  Please respond via emailed reply-to-all, not via the
bugzilla web interface).

On Mon, 02 Mar 2020 21:55:10 +0000 bugzilla-daemon@bugzilla.kernel.org wrote:

> https://bugzilla.kernel.org/show_bug.cgi?id=206697
> 
> --- Comment #2 from Erhard F. (erhard_f@mailbox.org) ---
> Created attachment 287765 [details]
>   --> https://bugzilla.kernel.org/attachment.cgi?id=287765&action=edit
> dmesg (kernel 5.6-rc4, Shuttle XPC FS51, Pentium 4)
> 
> Same on kernel 5.6-rc4:

Thanks.  This looks like a regression in the DRM code.  I've added
suitable Ccs.


> [...]
> [  908.356444] BUG: kernel NULL pointer dereference, address: 00000000
> [  908.356670] #PF: supervisor read access in kernel mode
> [  908.356823] #PF: error_code(0x0000) - not-present page
> [  908.356974] *pde = 00000000 
> [  908.357064] Oops: 0000 [#1] SMP
> [  908.357163] CPU: 0 PID: 53 Comm: kswapd0 Not tainted 5.6.0-rc4-Pentium4 #1
> [  908.357367] Hardware name:  /FS51, BIOS 6.00 PG 12/02/2003
> [  908.357535] EIP: __cpa_process_fault+0x205/0x226
> [  908.357674] Code: 2d 00 00 00 40 39 d0 76 1f 81 fa ff ff ff bf 76 17 c7 47
> 10 01 00 00 00 81 c3 00 00 00 40 c1 eb 0c 89 5f 18 31 f6 eb 19 8b 07 <ff> 30
> 53
> 68 56 ba 89 c9 e8 f8 68 00 00 0f 0b 83 c4 0c be f2 ff ff
> [  908.358228] EAX: 00000000 EBX: 00000000 ECX: 00000001 EDX: 00000000
> [  908.358411] ESI: 00000001 EDI: f5e6fd4c EBP: f5e6fcc4 ESP: f5e6fca0
> [  908.358598] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 EFLAGS: 00010213
> [  908.358798] CR0: 80050033 CR2: 00000000 CR3: 00333000 CR4: 000006d0
> [  908.358981] Call Trace:
> [  908.359062]  ? _raw_spin_lock+0x22/0x2a
> [  908.359176]  ? lookup_address+0x1d/0x20
> [  908.359289]  __change_page_attr_set_clr+0x85/0x551
> [  908.359436]  ? __mutex_unlock_slowpath+0x20/0x1b6
> [  908.368244]  ? mutex_unlock+0xb/0xd
> [  908.377037]  ? _vm_unmap_aliases.part.0+0x11f/0x127
> [  908.385944]  change_page_attr_set_clr+0xdc/0x1af
> [  908.394889]  set_pages_array_wb+0x20/0x7b
> [  908.403630]  ttm_pages_put+0x22/0x71 [ttm]
> [  908.412159]  ttm_page_pool_free+0xf6/0x111 [ttm]
> [  908.420492]  ttm_pool_shrink_scan+0x9c/0xd1 [ttm]
> [  908.428885]  shrink_slab.constprop.0+0x248/0x38f
> [  908.437241]  shrink_node+0x533/0x6f2
> [  908.445492]  kswapd+0x4b9/0x62d
> [  908.453714]  ? kswapd+0x4b9/0x62d
> [  908.461937]  kthread+0xd1/0xd3
> [  908.470055]  ? try_to_free_pages+0x3d4/0x3d4
> [  908.478143]  ? kthread_delayed_work_timer_fn+0x6a/0x6a
> [  908.486242]  ret_from_fork+0x2e/0x38
> [  908.494256] Modules linked in: auth_rpcgss nfsv4 dns_resolver nfs lockd
> grace sunrpc ctr aes_generic libaes ccm hid_generic usbhid hid rt2500pci
> eeprom_93cx6 rt2x00pci rt2x00mmio rt2x00lib led_class mac80211 radeon evdev
> hwmon i2c_algo_bit drm_kms_helper cfbfillrect syscopyarea cfg80211 cfbimgblt
> sysfillrect sysimgblt fb_sys_fops cfbcopyarea firewire_ohci firewire_core fb
> rfkill font crc_itu_t 8139too libarc4 mii fbdev sr_mod cdrom ttm thermal fan
> ohci_pci drm 8250 snd_intel8x0 8250_base serial_core snd_ac97_codec ac97_bus
> ehci_pci ohci_hcd snd_pcm drm_panel_orientation_quirks ehci_hcd button
> backlight snd_timer usbcore sis_agp agpgart snd i2c_sis96x usb_common
> processor
> soundcore zstd zram zsmalloc
> [  908.522107] CR2: 0000000000000000
> [  908.531646] ---[ end trace f8cc5b63e4c76d19 ]---
> [  908.541190] EIP: __cpa_process_fault+0x205/0x226
> [  908.550708] Code: 2d 00 00 00 40 39 d0 76 1f 81 fa ff ff ff bf 76 17 c7 47
> 10 01 00 00 00 81 c3 00 00 00 40 c1 eb 0c 89 5f 18 31 f6 eb 19 8b 07 <ff> 30
> 53
> 68 56 ba 89 c9 e8 f8 68 00 00 0f 0b 83 c4 0c be f2 ff ff
> [  908.560958] EAX: 00000000 EBX: 00000000 ECX: 00000001 EDX: 00000000
> [  908.571156] ESI: 00000001 EDI: f5e6fd4c EBP: f5e6fcc4 ESP: f5e6fca0
> [  908.581412] DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 EFLAGS: 00010213
> [  908.591710] CR0: 80050033 CR2: 00000000 CR3: 00333000 CR4: 000006d0
Comment 4 Erhard F. 2020-03-06 00:44:11 UTC
On Mon, 02 Mar 2020 23:03:31 +0000
bugzilla-daemon@bugzilla.kernel.org wrote:

> https://bugzilla.kernel.org/show_bug.cgi?id=206697
> 
> --- Comment #3 from Andrew Morton (akpm@linux-foundation.org) ---
> (switched to email.  Please respond via emailed reply-to-all, not via the
> bugzilla web interface).
> 
> On Mon, 02 Mar 2020 21:55:10 +0000 bugzilla-daemon@bugzilla.kernel.org wrote:
> 
> > https://bugzilla.kernel.org/show_bug.cgi?id=206697
> > 
> > --- Comment #2 from Erhard F. (erhard_f@mailbox.org) ---
> > Created attachment 287765 [details]  
> >   --> https://bugzilla.kernel.org/attachment.cgi?id=287765&action=edit  
> > dmesg (kernel 5.6-rc4, Shuttle XPC FS51, Pentium 4)
> > 
> > Same on kernel 5.6-rc4:  
> 
> Thanks.  This looks like a regression in the DRM code.  I've added
> suitable Ccs.

Been running the box without loading ttm, drm, radeon modules (kernel 5.6-rc4) via ssh for two days now building stuff. Which worked flawlessly. With ttm, drm, radeon loaded I hit this bug within half an hour. So it really seems the drm code causing this bug.
Comment 5 Erhard F. 2020-04-17 07:45:23 UTC
Looks mostly the same on kernel 5.7-rc1. The line after kthread+0xd1/0xd3 is different. It was "? try_to_free_pages+0x3d4/0x3d4" on 5.5.6 and 5.6-rc4, but is "? shrink_node+0x6f2/0x6f2" on 5.7-rc1 now.

[...]
Apr 17 00:28:40 BUG: kernel NULL pointer dereference, address: 00000000
Apr 17 00:28:40 #PF: supervisor read access in kernel mode
Apr 17 00:28:40 #PF: error_code(0x0000) - not-present page
Apr 17 00:28:40 *pde = 00000000 
Apr 17 00:28:40 Oops: 0000 [#1] SMP
Apr 17 00:28:40 CPU: 0 PID: 53 Comm: kswapd0 Not tainted 5.7.0-rc1-Pentium4 #1
Apr 17 00:28:40 Hardware name:  /FS51, BIOS 6.00 PG 12/02/2003
Apr 17 00:28:40 EIP: __cpa_process_fault+0x205/0x226
Apr 17 00:28:40 Code: 2d 00 00 00 40 39 d0 76 1f 81 fa ff ff ff bf 76 17 c7 47 10 01 00 00 00 81 c3 00 00 00 40 c1 eb 0c 89 5f 18 31 f6 eb 19 8b 07 <ff> 30 53 68 d5 c7 cb d8 e8 cb 68 00 00 0f 0b>
Apr 17 00:28:40 EAX: 00000000 EBX: 00000000 ECX: 00000001 EDX: 00000000
Apr 17 00:28:40 ESI: 00000001 EDI: f5e5bd4c EBP: f5e5bcc4 ESP: f5e5bca0
Apr 17 00:28:40 DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 EFLAGS: 00010213
Apr 17 00:28:40 CR0: 80050033 CR2: 00000000 CR3: 05ab3000 CR4: 000006d0
Apr 17 00:28:40 Call Trace:
Apr 17 00:28:40  ? _raw_spin_lock+0x22/0x2a
Apr 17 00:28:40  ? lookup_address+0x1d/0x20
Apr 17 00:28:40  __change_page_attr_set_clr+0x85/0x551
Apr 17 00:28:40  ? __mutex_unlock_slowpath+0x20/0x1b6
Apr 17 00:28:40  ? mutex_unlock+0xb/0xd
Apr 17 00:28:40  ? _vm_unmap_aliases.part.0+0x11f/0x127
Apr 17 00:28:40  change_page_attr_set_clr+0xdc/0x1af
Apr 17 00:28:40  set_pages_array_wb+0x20/0x7b
Apr 17 00:28:40  ttm_pages_put+0x22/0x71 [ttm]
Apr 17 00:28:40  ttm_page_pool_free+0xa1/0x111 [ttm]
Apr 17 00:28:40  ttm_pool_shrink_scan+0x9c/0xd1 [ttm]
Apr 17 00:28:40  shrink_slab.constprop.0+0x248/0x38f
Apr 17 00:28:40  shrink_node+0x533/0x6f2
Apr 17 00:28:40  kswapd+0x4b6/0x628
Apr 17 00:28:40  ? kswapd+0x4b6/0x628
Apr 17 00:28:40  kthread+0xd1/0xd3
Apr 17 00:28:40  ? shrink_node+0x6f2/0x6f2
Apr 17 00:28:40  ? kthread_delayed_work_timer_fn+0x6a/0x6a
Apr 17 00:28:40  ret_from_fork+0x2e/0x38
Apr 17 00:28:40 Modules linked in: fuse auth_rpcgss nfsv4 dns_resolver nfs lockd grace sunrpc rt2500pci eeprom_93cx6 rt2x00pci rt2x00mmio rt2x00lib led_class mac80211 radeon hwmon i2c_algo_bit d>
Apr 17 00:28:40 CR2: 0000000000000000
Apr 17 00:28:40 ---[ end trace 49fbdfbb6e459a06 ]---
Apr 17 00:28:40 EIP: __cpa_process_fault+0x205/0x226
Apr 17 00:28:40 Code: 2d 00 00 00 40 39 d0 76 1f 81 fa ff ff ff bf 76 17 c7 47 10 01 00 00 00 81 c3 00 00 00 40 c1 eb 0c 89 5f 18 31 f6 eb 19 8b 07 <ff> 30 53 68 d5 c7 cb d8 e8 cb 68 00 00 0f 0b>
Apr 17 00:28:40 EAX: 00000000 EBX: 00000000 ECX: 00000001 EDX: 00000000
Apr 17 00:28:40 ESI: 00000001 EDI: f5e5bd4c EBP: f5e5bcc4 ESP: f5e5bca0
Apr 17 00:28:40 DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 EFLAGS: 00010213
Apr 17 00:28:40 CR0: 80050033 CR2: 00000000 CR3: 05ab3000 CR4: 000006d0
Comment 6 Andrew Morton 2020-04-17 21:58:16 UTC
On Mon, 2 Mar 2020 15:03:29 -0800 Andrew Morton <akpm@linux-foundation.org> wrote:

> 
> (switched to email.  Please respond via emailed reply-to-all, not via the
> bugzilla web interface).
> 
> On Mon, 02 Mar 2020 21:55:10 +0000 bugzilla-daemon@bugzilla.kernel.org wrote:
> 
> > https://bugzilla.kernel.org/show_bug.cgi?id=206697
> > 
> > --- Comment #2 from Erhard F. (erhard_f@mailbox.org) ---
> > Created attachment 287765 [details]
> >   --> https://bugzilla.kernel.org/attachment.cgi?id=287765&action=edit
> > dmesg (kernel 5.6-rc4, Shuttle XPC FS51, Pentium 4)
> > 
> > Same on kernel 5.6-rc4:
> 
> Thanks.  This looks like a regression in the DRM code.  I've added
> suitable Ccs.

Erhard, please let's handle this issue via email, not via the bugzilla
interface.

This does appear to be a DRM issue, and it has been reproduced in
5.7-rc1.

Latest update below:

From: bugzilla-daemon@bugzilla.kernel.org
To: akpm@linux-foundation.org
Subject: [Bug 206697] #PF: supervisor read access in kernel mode, #PF: error_code(0x0000) - not-present page while building a large project
Date: Fri, 17 Apr 2020 07:45:23 +0000

https://bugzilla.kernel.org/show_bug.cgi?id=206697

--- Comment #5 from Erhard F. (erhard_f@mailbox.org) ---
Looks mostly the same on kernel 5.7-rc1. The line after kthread+0xd1/0xd3 is
different. It was "? try_to_free_pages+0x3d4/0x3d4" on 5.5.6 and 5.6-rc4, but
is "? shrink_node+0x6f2/0x6f2" on 5.7-rc1 now.

[...]
Apr 17 00:28:40 BUG: kernel NULL pointer dereference, address: 00000000
Apr 17 00:28:40 #PF: supervisor read access in kernel mode
Apr 17 00:28:40 #PF: error_code(0x0000) - not-present page
Apr 17 00:28:40 *pde = 00000000 
Apr 17 00:28:40 Oops: 0000 [#1] SMP
Apr 17 00:28:40 CPU: 0 PID: 53 Comm: kswapd0 Not tainted 5.7.0-rc1-Pentium4 #1
Apr 17 00:28:40 Hardware name:  /FS51, BIOS 6.00 PG 12/02/2003
Apr 17 00:28:40 EIP: __cpa_process_fault+0x205/0x226
Apr 17 00:28:40 Code: 2d 00 00 00 40 39 d0 76 1f 81 fa ff ff ff bf 76 17 c7 47
10 01 00 00 00 81 c3 00 00 00 40 c1 eb 0c 89 5f 18 31 f6 eb 19 8b 07 <ff> 30 53
68 d5 c7 cb d8 e8 cb 68 00 00 0f 0b>
Apr 17 00:28:40 EAX: 00000000 EBX: 00000000 ECX: 00000001 EDX: 00000000
Apr 17 00:28:40 ESI: 00000001 EDI: f5e5bd4c EBP: f5e5bcc4 ESP: f5e5bca0
Apr 17 00:28:40 DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 EFLAGS: 00010213
Apr 17 00:28:40 CR0: 80050033 CR2: 00000000 CR3: 05ab3000 CR4: 000006d0
Apr 17 00:28:40 Call Trace:
Apr 17 00:28:40  ? _raw_spin_lock+0x22/0x2a
Apr 17 00:28:40  ? lookup_address+0x1d/0x20
Apr 17 00:28:40  __change_page_attr_set_clr+0x85/0x551
Apr 17 00:28:40  ? __mutex_unlock_slowpath+0x20/0x1b6
Apr 17 00:28:40  ? mutex_unlock+0xb/0xd
Apr 17 00:28:40  ? _vm_unmap_aliases.part.0+0x11f/0x127
Apr 17 00:28:40  change_page_attr_set_clr+0xdc/0x1af
Apr 17 00:28:40  set_pages_array_wb+0x20/0x7b
Apr 17 00:28:40  ttm_pages_put+0x22/0x71 [ttm]
Apr 17 00:28:40  ttm_page_pool_free+0xa1/0x111 [ttm]
Apr 17 00:28:40  ttm_pool_shrink_scan+0x9c/0xd1 [ttm]
Apr 17 00:28:40  shrink_slab.constprop.0+0x248/0x38f
Apr 17 00:28:40  shrink_node+0x533/0x6f2
Apr 17 00:28:40  kswapd+0x4b6/0x628
Apr 17 00:28:40  ? kswapd+0x4b6/0x628
Apr 17 00:28:40  kthread+0xd1/0xd3
Apr 17 00:28:40  ? shrink_node+0x6f2/0x6f2
Apr 17 00:28:40  ? kthread_delayed_work_timer_fn+0x6a/0x6a
Apr 17 00:28:40  ret_from_fork+0x2e/0x38
Apr 17 00:28:40 Modules linked in: fuse auth_rpcgss nfsv4 dns_resolver nfs
lockd grace sunrpc rt2500pci eeprom_93cx6 rt2x00pci rt2x00mmio rt2x00lib
led_class mac80211 radeon hwmon i2c_algo_bit d>
Apr 17 00:28:40 CR2: 0000000000000000
Apr 17 00:28:40 ---[ end trace 49fbdfbb6e459a06 ]---
Apr 17 00:28:40 EIP: __cpa_process_fault+0x205/0x226
Apr 17 00:28:40 Code: 2d 00 00 00 40 39 d0 76 1f 81 fa ff ff ff bf 76 17 c7 47
10 01 00 00 00 81 c3 00 00 00 40 c1 eb 0c 89 5f 18 31 f6 eb 19 8b 07 <ff> 30 53
68 d5 c7 cb d8 e8 cb 68 00 00 0f 0b>
Apr 17 00:28:40 EAX: 00000000 EBX: 00000000 ECX: 00000001 EDX: 00000000
Apr 17 00:28:40 ESI: 00000001 EDI: f5e5bd4c EBP: f5e5bcc4 ESP: f5e5bca0
Apr 17 00:28:40 DS: 007b ES: 007b FS: 00d8 GS: 00e0 SS: 0068 EFLAGS: 00010213
Apr 17 00:28:40 CR0: 80050033 CR2: 00000000 CR3: 05ab3000 CR4: 000006d0
Comment 7 Alex Deucher 2020-04-22 20:20:24 UTC
Can you bisect?  If you are not actually using the GPU, I it shouldn't really be allocating any memory other than what it allocates for the console framebuffer and any driver structures.  I think what may be happening is that you are hitting memory pressure and ttm is attempting to return memory and hits some failure.
Comment 8 Erhard F. 2020-04-23 00:24:57 UTC
On Wed, 22 Apr 2020 20:20:24 +0000
bugzilla-daemon@bugzilla.kernel.org wrote:

> https://bugzilla.kernel.org/show_bug.cgi?id=206697
> 
> Alex Deucher (alexdeucher@gmail.com) changed:
> 
>            What    |Removed                     |Added
> ----------------------------------------------------------------------------
>                  CC|                            |alexdeucher@gmail.com
> 
> --- Comment #7 from Alex Deucher (alexdeucher@gmail.com) ---
> Can you bisect?  If you are not actually using the GPU, I it shouldn't really
> be allocating any memory other than what it allocates for the console
> framebuffer and any driver structures.  I think what may be happening is that
> you are hitting memory pressure and ttm is attempting to return memory and
> hits
> some failure.

I will try to but it will take a considerable amount of time.

It's an old i686 box of course, but it's new to me. The oldest kernel I ran this machine on is 5.5.6, the one for which I reported this bug. Older kernels won't boot it's btrfs root partition (xxhash, zstd compressed). So I have to do another install in an ext4 partition, replicate the setup and find a suitable starting point for bisecting.
Comment 9 Erhard F. 2020-05-09 17:29:19 UTC
Created attachment 289027 [details]
bisect01.log

Finally I finished bisecting. Criterion for a 'good' bisect run were successfully building llvm 5 times in a row. At 'bad' bisect runs the bug did show up at the 1st llvm build most of the time, rarely at the 2nd llvm build at the latest.

It turned out the offending commit is also the cause of this org bug:
https://gitlab.freedesktop.org/xorg/driver/xf86-video-ati/-/issues/191#note_489802

# git bisect good | tee -a ~/bisect01.log
33b3ad3788aba846fc8b9a065fe2685a0b64f713 is the first bad commit
commit 33b3ad3788aba846fc8b9a065fe2685a0b64f713
Author: Christoph Hellwig <hch@lst.de>
Date:   Thu Aug 15 09:27:00 2019 +0200

    drm/radeon: handle PCIe root ports with addressing limitations
    
    radeon uses a need_dma32 flag to indicate to the drm core that some
    allocations need to be done using GFP_DMA32, but it only checks the
    device addressing capabilities to make that decision.  Unfortunately
    PCIe root ports that have limited addressing exist as well.  Use the
    dma_addressing_limited instead to also take those into account.
    
    Reviewed-by: Christian König <christian.koenig@amd.com>
    Reported-by: Atish Patra <Atish.Patra@wdc.com>
    Signed-off-by: Christoph Hellwig <hch@lst.de>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>

 drivers/gpu/drm/radeon/radeon.h        |  1 -
 drivers/gpu/drm/radeon/radeon_device.c | 12 +++++-------
 drivers/gpu/drm/radeon/radeon_ttm.c    |  2 +-
 3 files changed, 6 insertions(+), 9 deletions(-)
Comment 10 Alex Deucher 2020-08-17 15:13:22 UTC
Created attachment 290953 [details]
revert

does this revert fix the issue?
Comment 11 Alex Deucher 2020-09-16 20:50:21 UTC
Does it work correctly with 5.9-rc1 or newer?
Comment 12 Alex Deucher 2020-09-16 22:06:25 UTC
Another thing to try, does setting radeon.agpmode=-1 fix the issue?
Comment 13 Erhard F. 2020-09-17 16:24:03 UTC
On Wed, 16 Sep 2020 20:50:21 +0000
bugzilla-daemon@bugzilla.kernel.org wrote:

> https://bugzilla.kernel.org/show_bug.cgi?id=206697
> 
> --- Comment #11 from Alex Deucher (alexdeucher@gmail.com) ---
> Does it work correctly with 5.9-rc1 or newer?

Yes, I had the machine building with 5.9-rc2 for 2 days without issues. Though in my 5.9-rc .config I have not set AGP nor AGP_SIS 'cause it is no longer needed.

I did not try building stuff with an affected kernel and radeon.agpmode=-1 yet. But I shall try and report back as soon as I have.
Comment 14 Erhard F. 2020-09-18 16:10:47 UTC
On Wed, 16 Sep 2020 22:06:25 +0000
bugzilla-daemon@bugzilla.kernel.org wrote:

> https://bugzilla.kernel.org/show_bug.cgi?id=206697
> 
> --- Comment #12 from Alex Deucher (alexdeucher@gmail.com) ---
> Another thing to try, does setting radeon.agpmode=-1 fix the issue?

Yes, booting an affected kernel like 5.8.1 with radeon.agpmode=-1 also seems to fix the issue. Succesfully built llvm 10 times in a row which would normally crash the machine on the 2nd or 3rd build at the latest.
Comment 15 taz.007 2020-12-18 20:22:24 UTC
I'm hitting the same bug with kernel 5.9.14, same stack as the original comment. However I'm using the nouveau driver (so not radeon).

Hardware name: Acer Aspire R3610/FMCP7A-ION-LE, BIOS P01-A4 11/03/2009

Modules linked in: dm_mod rpcsec_gss_krb5 md4 cmac nls_utf8 cifs libdes dns_resolver fscache fuse hwmon_vid nouveau snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio snd_hda_intel snd_intel_dspcfg snd_hda_codec ath5k mxm_wmi snd_hda_core ttm ath mac80211 wmi_bmof drm_kms_helper snd_hwdep coretemp cec snd_pcm cfg80211 input_leds rc_core syscopyarea mousedev snd_timer sysfillrect sysimgblt snd rfkill fb_sys_fops pcspkr i2c_algo_bit libarc4 forcedeth soundcore nv_tco i2c_nforce2 wmi evdev tcp_bbr nfsd drm auth_rpcgss nfs_acl sg lockd grace sunrpc nfs_ssc agpgart ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 hid_generic usbhid hid uas usb_storage ohci_pci ehci_pci ohci_hcd ehci_hcd
Comment 16 Markus 2021-07-14 21:27:40 UTC
I'm also seeing a similar/related bug with nouveau and Kernels 5.10/5.11/5.12

X would freeze randomly (mouse and audio still working, then full crash).
Things are fully stable with Kernel 5.4.
The remainder of the system was not changed (NixOS 21.05 stable).
Three exemplary stack traces obtained with 5.10 and 5.12:

Jun 15 14:48:36 BUG: unable to handle page fault for address: 0000000101652191
Jun 15 14:48:36 #PF: supervisor read access in kernel mode
Jun 15 14:48:36 #PF: error_code(0x0000) - not-present page
Jun 15 14:48:36 PGD 109c90067 P4D 109c90067 PUD 0 
Jun 15 14:48:36 Oops: 0000 [#1] SMP NOPTI
Jun 15 14:48:36 CPU: 1 PID: 1344 Comm: X Tainted: P           O      5.10.43 #1-NixOS
Jun 15 14:48:36 Hardware name: Acer Aspire X3300/WMCP78M, BIOS P01-A3 08/10/2009
Jun 15 14:48:36 RIP: 0010:__kmalloc+0xaa/0x240
Jun 15 14:48:36 Code: 03 05 72 ea b9 63 49 8b 00 49 83 78 10 00 48 89 04 24 0f 84 7f 01 00 00 48 85 c0 0f 84 76 01 00 00 41 8b 4c 24 28 49 8b 3c 24 <48> 8b 1c 08 48 8d 4a 01 65 48 0f c7 0f 0f 94 c0 84 c0 74 b7 41 8b
Jun 15 14:48:36 RSP: 0018:ffffabcf83543908 EFLAGS: 00010202
Jun 15 14:48:36 RAX: 0000000101652161 RBX: ffff8b1f99e39170 RCX: 0000000000000030
Jun 15 14:48:36 RDX: 000000000024ffe9 RSI: 0000000000000cc0 RDI: 000000000002ed20
Jun 15 14:48:36 RBP: 0000000000000cc0 R08: ffff8b2097caed20 R09: 000000000000001a
Jun 15 14:48:36 R10: ffffabcf83543add R11: ffffabcf835439e8 R12: ffff8b1f80042a00
Jun 15 14:48:36 R13: 0000000000000052 R14: ffffffffc12f27e8 R15: ffff8b1f80042a00
Jun 15 14:48:36 FS:  00007fda7a23b980(0000) GS:ffff8b2097c80000(0000) knlGS:0000000000000000
Jun 15 14:48:36 CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 15 14:48:36 CR2: 0000000101652191 CR3: 0000000115e0c000 CR4: 00000000000006e0
Jun 15 14:48:36 Call Trace:
Jun 15 14:48:36  nvif_object_ctor+0xa8/0x1c0 [nouveau]
Jun 15 14:48:36  ? nvkm_vmm_ptes_get_map+0x2c/0x90 [nouveau]
Jun 15 14:48:36  ? nvkm_vmm_ref_sptes.isra.0+0x1b0/0x1b0 [nouveau]
Jun 15 14:48:36  nvif_mem_ctor_type+0xfb/0x190 [nouveau]
Jun 15 14:48:36  ? get_partial_node.part.0+0x1a8/0x250
Jun 15 14:48:36  nouveau_mem_vram+0xe3/0x180 [nouveau]
Jun 15 14:48:36  nouveau_vram_manager_new+0x6d/0x90 [nouveau]
Jun 15 14:48:36  ? ttm_bo_mem_placement.constprop.0+0x87/0xd0 [ttm]
Jun 15 14:48:36  ttm_bo_mem_space+0x81/0x270 [ttm]
Jun 15 14:48:36  ? drm_mm_insert_node_in_range+0x2c0/0x4a0 [drm]
Jun 15 14:48:36  ttm_bo_validate+0x125/0x170 [ttm]
Jun 15 14:48:36  ttm_bo_init_reserved+0x2a8/0x330 [ttm]
Jun 15 14:48:36  ttm_bo_init+0x6d/0xe0 [ttm]
Jun 15 14:48:36  ? nouveau_bo_move+0x5c0/0x5c0 [nouveau]
Jun 15 14:48:36  nouveau_bo_init+0xaf/0xc0 [nouveau]
Jun 15 14:48:36  ? nouveau_bo_move+0x5c0/0x5c0 [nouveau]
Jun 15 14:48:36  ? nouveau_gem_new+0xf0/0xf0 [nouveau]
Jun 15 14:48:36  nouveau_gem_new+0x74/0xf0 [nouveau]
Jun 15 14:48:36  nouveau_gem_ioctl_new+0x53/0xf0 [nouveau]
Jun 15 14:48:36  ? nouveau_gem_new+0xf0/0xf0 [nouveau]
Jun 15 14:48:36  drm_ioctl_kernel+0xaa/0xf0 [drm]
Jun 15 14:48:36  drm_ioctl+0x1ec/0x390 [drm]
Jun 15 14:48:36  ? nouveau_gem_new+0xf0/0xf0 [nouveau]
Jun 15 14:48:36  ? selinux_file_ioctl+0x135/0x230
Jun 15 14:48:36  nouveau_drm_ioctl+0x55/0xa0 [nouveau]
Jun 15 14:48:36  __x64_sys_ioctl+0x83/0xb0
Jun 15 14:48:36  do_syscall_64+0x33/0x40
Jun 15 14:48:36  entry_SYSCALL_64_after_hwframe+0x44/0xa9
Jun 15 14:48:36 RIP: 0033:0x7fda7a656b07
Jun 15 14:48:36 Code: c0 75 b5 48 8d 3c 2b e8 17 ff ff ff 85 c0 78 b6 48 89 d8 5b 5d 41 5c c3 66 2e 0f 1f 84 00 00 00 00 00 90 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 39 73 0c 00 f7 d8 64 89 01 48
Jun 15 14:48:36 RSP: 002b:00007ffd322077e8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Jun 15 14:48:36 RAX: ffffffffffffffda RBX: 00007ffd32207840 RCX: 00007fda7a656b07
Jun 15 14:48:36 RDX: 00007ffd32207840 RSI: 00000000c0306480 RDI: 0000000000000010
Jun 15 14:48:36 RBP: 00000000c0306480 R08: 0000000000000000 R09: 00000000006e9010
Jun 15 14:48:36 R10: 00007fda7a71ea00 R11: 0000000000000246 R12: 00007ffd32207840
Jun 15 14:48:36 R13: 0000000000000010 R14: 0000000000752440 R15: 0000000000000000
Jun 15 14:48:36 Modules linked in: nls_iso8859_1 nls_cp437 vfat fat rfcomm xt_nat nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter overlay wireguard curve25519_x86_64 libchacha20poly1305 libbla>
Jun 15 14:48:36  btintel macvlan veth snd_rawmidi snd_seq_device joydev bluetooth bridge snd_hda_core uas mc stp drm_kms_helper llc snd_hwdep tun snd_pcm psmouse i2c_algo_bit fb_sys_fops evdev ecdh_generic rfkill ecc mac>
Jun 15 14:48:36  libps2 ohci_pci ahci libahci pata_amd ehci_pci ohci_hcd libata ehci_hcd scsi_mod usbcore usb_common i8042 rtc_cmos serio dm_snapshot dm_bufio dm_mod
Jun 15 14:48:36 CR2: 0000000101652191
Jun 15 14:48:36 ---[ end trace 2330b22334e80ee2 ]---
Jun 15 14:48:36 RIP: 0010:__kmalloc+0xaa/0x240
Jun 15 14:48:36 Code: 03 05 72 ea b9 63 49 8b 00 49 83 78 10 00 48 89 04 24 0f 84 7f 01 00 00 48 85 c0 0f 84 76 01 00 00 41 8b 4c 24 28 49 8b 3c 24 <48> 8b 1c 08 48 8d 4a 01 65 48 0f c7 0f 0f 94 c0 84 c0 74 b7 41 8b
Jun 15 14:48:36 RSP: 0018:ffffabcf83543908 EFLAGS: 00010202
Jun 15 14:48:36 RAX: 0000000101652161 RBX: ffff8b1f99e39170 RCX: 0000000000000030
Jun 15 14:48:36 RDX: 000000000024ffe9 RSI: 0000000000000cc0 RDI: 000000000002ed20
Jun 15 14:48:36 RBP: 0000000000000cc0 R08: ffff8b2097caed20 R09: 000000000000001a
Jun 15 14:48:36 R10: ffffabcf83543add R11: ffffabcf835439e8 R12: ffff8b1f80042a00
Jun 15 14:48:36 R13: 0000000000000052 R14: ffffffffc12f27e8 R15: ffff8b1f80042a00
Jun 15 14:48:36 FS:  00007fda7a23b980(0000) GS:ffff8b2097c80000(0000) knlGS:0000000000000000
Jun 15 14:48:36 CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 15 14:48:36 CR2: 0000000101652191 CR3: 0000000115e0c000 CR4: 00000000000006e0
Jun 15 14:48:50 BUG: kernel NULL pointer dereference, address: 00000000000008b8
-- Boot 99ce3a7f32174d6996bd5af9262b8597 --
Jun 15 15:06:02 Linux version 5.10.43 (nixbld@localhost) (gcc (GCC) 10.3.0, GNU ld (GNU Binutils) 2.35.1) #1-NixOS SMP Thu Jun 10 11:39:29 UTC 2021



BUG: unable to handle page fault for address: 00000001006071da
Jun 17 11:35:14 #PF: supervisor read access in kernel mode
Jun 17 11:35:14 #PF: error_code(0x0000) - not-present page
Jun 17 11:35:14 PGD 10a19b067 P4D 10a19b067 PUD 0 
Jun 17 11:35:14 Oops: 0000 [#1] SMP NOPTI
Jun 17 11:35:14 CPU: 0 PID: 1408 Comm: X Tainted: P           O      5.10.43 #1-NixOS
Jun 17 11:35:14 Hardware name: Acer Aspire X3300/WMCP78M, BIOS P01-A3 08/10/2009
Jun 17 11:35:14 RIP: 0010:__kmalloc+0xaa/0x240
Jun 17 11:35:14 Code: 03 05 72 ea d9 4b 49 8b 00 49 83 78 10 00 48 89 04 24 0f 84 7f 01 00 00 48 85 c0 0f 84 76 01 00 00 41 8b 4c 24 28 49 8>
Jun 17 11:35:14 RSP: 0018:ffffad9003517908 EFLAGS: 00010206
Jun 17 11:35:14 RAX: 00000001006071aa RBX: ffff9cbb16efc170 RCX: 0000000000000030
Jun 17 11:35:14 RDX: 0000000000039747 RSI: 0000000000000cc0 RDI: 000000000002ed20
Jun 17 11:35:14 RBP: 0000000000000cc0 R08: ffff9cbc17c2ed20 R09: 000000000000001a
Jun 17 11:35:14 R10: ffffad9003517add R11: ffffad90035179e8 R12: ffff9cbb00042a00
Jun 17 11:35:14 R13: 0000000000000052 R14: ffffffffc10ea7e8 R15: ffff9cbb00042a00
Jun 17 11:35:14 FS:  00007f8db582a980(0000) GS:ffff9cbc17c00000(0000) knlGS:0000000000000000
Jun 17 11:35:14 CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 17 11:35:14 CR2: 00000001006071da CR3: 0000000114270000 CR4: 00000000000006f0
Jun 17 11:35:14 Call Trace:
Jun 17 11:35:14  nvif_object_ctor+0xa8/0x1c0 [nouveau]
Jun 17 11:35:14  ? nvkm_vmm_ptes_get_map+0x2c/0x90 [nouveau]
Jun 17 11:35:14  ? nvkm_vmm_ref_sptes.isra.0+0x1b0/0x1b0 [nouveau]
Jun 17 11:35:14  nvif_mem_ctor_type+0xfb/0x190 [nouveau]
Jun 17 11:35:14  ? dma_resv_list_alloc+0x1b/0x40
Jun 17 11:35:14  nouveau_mem_vram+0xe3/0x180 [nouveau]
Jun 17 11:35:14  nouveau_vram_manager_new+0x6d/0x90 [nouveau]
Jun 17 11:35:14  ? ttm_bo_mem_placement.constprop.0+0x87/0xd0 [ttm]
Jun 17 11:35:14  ttm_bo_mem_space+0x81/0x270 [ttm]
Jun 17 11:35:14  ? drm_mm_insert_node_in_range+0x2c0/0x4a0 [drm]
Jun 17 11:35:14  ttm_bo_validate+0x125/0x170 [ttm]
Jun 17 11:35:14  ttm_bo_init_reserved+0x2a8/0x330 [ttm]
Jun 17 11:35:14  ttm_bo_init+0x6d/0xe0 [ttm]
Jun 17 11:35:14  ? nouveau_bo_move+0x5c0/0x5c0 [nouveau]
Jun 17 11:35:14  nouveau_bo_init+0xaf/0xc0 [nouveau]
Jun 17 11:35:14  ? nouveau_bo_move+0x5c0/0x5c0 [nouveau]
Jun 17 11:35:14  ? nouveau_gem_new+0xf0/0xf0 [nouveau]
Jun 17 11:35:14  nouveau_gem_new+0x74/0xf0 [nouveau]
Jun 17 11:35:14  nouveau_gem_ioctl_new+0x53/0xf0 [nouveau]
Jun 17 11:35:14  ? nouveau_gem_new+0xf0/0xf0 [nouveau]
Jun 17 11:35:14  drm_ioctl_kernel+0xaa/0xf0 [drm]
Jun 17 11:35:14  drm_ioctl+0x1ec/0x390 [drm]
Jun 17 11:35:14  ? nouveau_gem_new+0xf0/0xf0 [nouveau]
Jun 17 11:35:14  ? selinux_file_ioctl+0x135/0x230
Jun 17 11:35:14  nouveau_drm_ioctl+0x55/0xa0 [nouveau]
Jun 17 11:35:14  __x64_sys_ioctl+0x83/0xb0
Jun 17 11:35:14  do_syscall_64+0x33/0x40
Jun 17 11:35:14  entry_SYSCALL_64_after_hwframe+0x44/0xa9
Jun 17 11:35:14 RIP: 0033:0x7f8db5c45b07
Jun 17 11:35:14 Code: c0 75 b5 48 8d 3c 2b e8 17 ff ff ff 85 c0 78 b6 48 89 d8 5b 5d 41 5c c3 66 2e 0f 1f 84 00 00 00 00 00 90 b8 10 00 00 0>
Jun 17 11:35:14 RSP: 002b:00007ffcd0cd41f8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Jun 17 11:35:14 RAX: ffffffffffffffda RBX: 00007ffcd0cd4250 RCX: 00007f8db5c45b07
Jun 17 11:35:14 RDX: 00007ffcd0cd4250 RSI: 00000000c0306480 RDI: 0000000000000010
Jun 17 11:35:14 RBP: 00000000c0306480 R08: 0000000000000000 R09: 0000000000990010
Jun 17 11:35:14 R10: 00007f8db5d0da00 R11: 0000000000000246 R12: 00007ffcd0cd4250
Jun 17 11:35:14 R13: 0000000000000010 R14: 00000000009f9440 R15: 0000000000000000
Jun 17 11:35:14 Modules linked in: rfcomm xt_nat nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter overlay wiregua>
Jun 17 11:35:14  drm_kms_helper snd_usbmidi_lib snd_hda_core snd_rawmidi joydev snd_seq_device mc snd_hwdep mousedev snd_pcm ecdh_generic i2>
Jun 17 11:35:14  pata_amd ehci_pci atkbd ehci_hcd libps2 libata usbcore scsi_mod usb_common i8042 rtc_cmos serio dm_snapshot dm_bufio dm_mod
Jun 17 11:35:14 CR2: 00000001006071da
Jun 17 11:35:14 ---[ end trace 763b2ff4d412ff41 ]---
Jun 17 11:35:14 RIP: 0010:__kmalloc+0xaa/0x240
Jun 17 11:35:14 Code: 03 05 72 ea d9 4b 49 8b 00 49 83 78 10 00 48 89 04 24 0f 84 7f 01 00 00 48 85 c0 0f 84 76 01 00 00 41 8b 4c 24 28 49 8>
Jun 17 11:35:14 RSP: 0018:ffffad9003517908 EFLAGS: 00010206
Jun 17 11:35:14 RAX: 00000001006071aa RBX: ffff9cbb16efc170 RCX: 0000000000000030
Jun 17 11:35:14 RDX: 0000000000039747 RSI: 0000000000000cc0 RDI: 000000000002ed20
Jun 17 11:35:14 RBP: 0000000000000cc0 R08: ffff9cbc17c2ed20 R09: 000000000000001a
Jun 17 11:35:14 R10: ffffad9003517add R11: ffffad90035179e8 R12: ffff9cbb00042a00
Jun 17 11:35:14 R13: 0000000000000052 R14: ffffffffc10ea7e8 R15: ffff9cbb00042a00
Jun 17 11:35:14 FS:  00007f8db582a980(0000) GS:ffff9cbc17c00000(0000) knlGS:0000000000000000
Jun 17 11:35:14 CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 17 11:35:14 CR2: 00000001006071da CR3: 0000000114270000 CR4: 00000000000006f0

Jun 17 17:07:56 quathlon xsession[2092]: Hiding the terminal
Jun 17 17:07:58 quathlon xsession[2092]: Showing the terminal
Jun 17 17:08:16 BUG: unable to handle page fault for address: 00000000ff340819
Jun 17 17:08:16 #PF: supervisor instruction fetch in kernel mode
Jun 17 17:08:16 #PF: error_code(0x0010) - not-present page
Jun 17 17:08:16 PGD 123e50067 P4D 123e50067 PUD 0 
Jun 17 17:08:16 Oops: 0010 [#1] SMP NOPTI
Jun 17 17:08:16 CPU: 1 PID: 1392 Comm: X Tainted: P           O      5.10.43 #1-NixOS
Jun 17 17:08:16 Hardware name: Acer Aspire X3300/WMCP78M, BIOS P01-A3 08/10/2009
Jun 17 17:08:16 RIP: 0010:0xff340819
Jun 17 17:08:16 Code: Unable to access opcode bytes at RIP 0xff3407ef.
Jun 17 17:08:16 RSP: 0018:ffffab5703537c20 EFLAGS: 00010202
Jun 17 17:08:16 RAX: 00000000ff340819 RBX: 0000000000000001 RCX: 0000000000000001
Jun 17 17:08:16 RDX: 7fffffffffffffff RSI: 0000000000000001 RDI: ffff9a9058861f88
Jun 17 17:08:16 RBP: ffff9a9058861f88 R08: 0000000000000000 R09: ffffab5703537bd4
Jun 17 17:08:16 R10: 0000000000000000 R11: 0000000000000000 R12: 7fffffffffffffff
Jun 17 17:08:16 R13: ffff9a8edd20ae28 R14: 0000000000000000 R15: ffff9a8fe5ac7500
Jun 17 17:08:16 FS:  00007f835500c980(0000) GS:ffff9a90d7c80000(0000) knlGS:0000000000000000
Jun 17 17:08:16 CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 17 17:08:16 CR2: 00000000ff340819 CR3: 000000010b018000 CR4: 00000000000006e0
Jun 17 17:08:16 Call Trace:
Jun 17 17:08:16  ? dma_fence_wait_timeout+0x3f/0xd0
Jun 17 17:08:16  ? nouveau_fence_sync+0x1bb/0x220 [nouveau]
Jun 17 17:08:16  ? nouveau_gem_ioctl_pushbuf+0x677/0x1120 [nouveau]
Jun 17 17:08:16  ? nouveau_gem_ioctl_new+0xf0/0xf0 [nouveau]
Jun 17 17:08:16  ? drm_ioctl_kernel+0xaa/0xf0 [drm]
Jun 17 17:08:16  ? drm_ioctl+0x1ec/0x390 [drm]
Jun 17 17:08:16  ? nouveau_gem_ioctl_new+0xf0/0xf0 [nouveau]
Jun 17 17:08:16  ? selinux_file_ioctl+0x135/0x230
Jun 17 17:08:16  ? nouveau_drm_ioctl+0x55/0xa0 [nouveau]
Jun 17 17:08:16  ? __x64_sys_ioctl+0x83/0xb0
Jun 17 17:08:16  ? do_syscall_64+0x33/0x40
Jun 17 17:08:16  ? entry_SYSCALL_64_after_hwframe+0x44/0xa9
Jun 17 17:08:16 Modules linked in: nls_iso8859_1 nls_cp437 vfat fat rfcomm xt_nat nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addr>
Jun 17 17:08:16  i2c_algo_bit fb_sys_fops snd_hda_core syscopyarea stp llc snd_rawmidi tun wmi_bmof snd_seq_device mc btusb snd_hwdep snd_pc>
Jun 17 17:08:16  libps2 ahci libahci ohci_pci pata_amd libata ehci_pci ohci_hcd ehci_hcd scsi_mod usbcore usb_common rtc_cmos i8042 serio dm>
Jun 17 17:08:16 CR2: 00000000ff340819
Jun 17 17:08:16 ---[ end trace 494bba1f91a619c6 ]---
Jun 17 17:08:16 RIP: 0010:0xff340819
Jun 17 17:08:16 Code: Unable to access opcode bytes at RIP 0xff3407ef.
Jun 17 17:08:16 RSP: 0018:ffffab5703537c20 EFLAGS: 00010202
Jun 17 17:08:16 RAX: 00000000ff340819 RBX: 0000000000000001 RCX: 0000000000000001
Jun 17 17:08:16 RDX: 7fffffffffffffff RSI: 0000000000000001 RDI: ffff9a9058861f88
Jun 17 17:08:16 RBP: ffff9a9058861f88 R08: 0000000000000000 R09: ffffab5703537bd4
Jun 17 17:08:16 R10: 0000000000000000 R11: 0000000000000000 R12: 7fffffffffffffff
Jun 17 17:08:16 R13: ffff9a8edd20ae28 R14: 0000000000000000 R15: ffff9a8fe5ac7500
Jun 17 17:08:16 FS:  00007f835500c980(0000) GS:ffff9a90d7c80000(0000) knlGS:0000000000000000
Jun 17 17:08:16 CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 17 17:08:16 CR2: 00000000ff340819 CR3: 000000010b018000 CR4: 00000000000006e0

Jul 13 17:31:58 ------------[ cut here ]------------
Jul 13 17:31:58 WARNING: CPU: 1 PID: 1238 at kernel/workqueue.c:1480 __queue_work+0x338/0x3d0
Jul 13 17:31:58 Modules linked in: cdc_ether usbnet mii xt_nat nf_conntrack_netlink nfnetlink xfrm_user xfrm_algo xt_addrtype br_netfilter o>
Jul 13 17:31:58  drm_ttm_helper edac_mce_amd ttm btrtl mc snd_hda_core mac_hid btbcm edac_core evdev btintel snd_hwdep snd_pcm uas video blu>
Jul 13 17:31:58  libata ohci_hcd ehci_pci ehci_hcd scsi_mod usbcore usb_common i8042 rtc_cmos serio dm_snapshot dm_bufio dm_mod
Jul 13 17:31:58 CPU: 1 PID: 1238 Comm: X Tainted: P           O      5.12.15 #1-NixOS
Jul 13 17:31:58 Hardware name: Acer Aspire X3300/WMCP78M, BIOS P01-A3 08/10/2009
Jul 13 17:31:58 RIP: 0010:__queue_work+0x338/0x3d0
Jul 13 17:31:58 Code: cd f7 7a a9 00 01 ff 00 75 0f 65 48 8b 3c 25 c0 7b 01 00 f6 47 2c 20 75 24 0f 0b 48 83 c4 10 5b 5d 41 5c 41 5d 41 5e 4>
Jul 13 17:31:58 RSP: 0018:ffffa466034c7c98 EFLAGS: 00010083
Jul 13 17:31:58 RAX: ffff88b940874f07 RBX: ffff88ba57cafa00 RCX: 0000000000000000
Jul 13 17:31:58 RDX: ff88b940874f00ff RSI: 00000000043a7800 RDI: ffff88ba57cab980
Jul 13 17:31:58 RBP: 0000000000000047 R08: ffff88ba57cab980 R09: ffffffff86251618
Jul 13 17:31:58 R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000001
Jul 13 17:31:58 R13: 0000000000029248 R14: ffff88b940052800 R15: ffff88b940874eff
Jul 13 17:31:58 FS:  00007fb0e0c4d980(0000) GS:ffff88ba57c80000(0000) knlGS:0000000000000000
Jul 13 17:31:58 CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jul 13 17:31:58 CR2: 000031f409812000 CR3: 000000010e864000 CR4: 00000000000006e0
Jul 13 17:31:58 Call Trace:
Jul 13 17:31:58  queue_work_on+0x31/0x50
Jul 13 17:31:58  nouveau_cli_work_queue+0xb4/0xe0 [nouveau]
Jul 13 17:31:58  nouveau_gem_object_close+0x12e/0x1e0 [nouveau]
Jul 13 17:31:58  drm_gem_object_release_handle+0x2b/0x80 [drm]
Jul 13 17:31:58  drm_gem_handle_delete+0x58/0x90 [drm]
Jul 13 17:31:58  ? drm_gem_handle_create+0x40/0x40 [drm]
Jul 13 17:31:58  drm_ioctl_kernel+0xaa/0xf0 [drm]
Jul 13 17:31:58  drm_ioctl+0x1ec/0x390 [drm]
Jul 13 17:31:58  ? drm_gem_handle_create+0x40/0x40 [drm]
Jul 13 17:31:58  ? selinux_file_ioctl+0x135/0x230
Jul 13 17:31:58  nouveau_drm_ioctl+0x55/0xa0 [nouveau]
Jul 13 17:31:58  __x64_sys_ioctl+0x83/0xb0
Jul 13 17:31:58  do_syscall_64+0x33/0x40
Jul 13 17:31:58  entry_SYSCALL_64_after_hwframe+0x44/0xae
Jul 13 17:31:58 RIP: 0033:0x7fb0e1068b07
Jul 13 17:31:58 Code: c0 75 b5 48 8d 3c 2b e8 17 ff ff ff 85 c0 78 b6 48 89 d8 5b 5d 41 5c c3 66 2e 0f 1f 84 00 00 00 00 00 90 b8 10 00 00 0>
Jul 13 17:31:58 RSP: 002b:00007ffc6223de28 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
Jul 13 17:31:58 RAX: ffffffffffffffda RBX: 00007ffc6223de60 RCX: 00007fb0e1068b07
Jul 13 17:31:58 RDX: 00007ffc6223de60 RSI: 0000000040086409 RDI: 0000000000000010
Jul 13 17:31:58 RBP: 0000000040086409 R08: 0000000000a23440 R09: 0000000000a3e088
Jul 13 17:31:58 R10: 00007fb0d473b400 R11: 0000000000000246 R12: 0000000000a23330
Jul 13 17:31:58 R13: 0000000000000010 R14: 0000000000a2d0d8 R15: 0000000000a234f0
Jul 13 17:31:58 ---[ end trace adf787e5bf13c409 ]---
Jul 13 17:31:58 BUG: unable to handle page fault for address: 00000001003c318a
Jul 13 17:31:58 #PF: supervisor read access in kernel mode
Jul 13 17:31:58 #PF: error_code(0x0000) - not-present page
-- Boot bfc53be93cd044de8a92c5661c957c00 --
Jul 13 17:33:52 Linux version 5.12.15 (nixbld@localhost) (gcc (GCC) 10.3.0, GNU ld (GNU Binutils) 2.35.1) #1-NixOS SMP Wed Jul 7 12:26:52 U
Comment 17 gabriel.sevecek 2021-09-01 11:41:19 UTC
Experiencing similar issue (full system freeze) on Ubuntu 21.04 running 5.13.5:

Sep 01 13:26:25 VIE4-DEV-NB010 kernel: WARNING: CPU: 7 PID: 131 at arch/x86/include/asm/kfence.h:44 kfence_unprotect+0x3c/0xa0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: Modules linked in: veth rfcomm xt_nat nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype br_netfilter ccm xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp nft_compat nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nft_counter nf_tables nfnetlink bridge stp llc cmac algif_hash algif_skcipher af_alg bnep overlay snd_hda_codec_hdmi snd_ctl_led snd_hda_codec_realtek snd_hda_codec_generic snd_sof_pci_intel_tgl snd_sof_intel_hda_common snd_soc_hdac_hda soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi intel_tcc_cooling soundwire_bus dell_laptop x86_pkg_temp_thermal intel_powerclamp intel_rapl_msr mei_hdcp ledtrig_audio coretemp snd_soc_core snd_compress kvm_intel ac97_bus snd_pcm_dmaengine kvm snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec snd_hda_core snd_hwdep nls_iso8859_1 snd_pcm
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  crct10dif_pclmul iwlmvm ghash_clmulni_intel aesni_intel crypto_simd cryptd mac80211 snd_seq_midi snd_seq_midi_event snd_rawmidi intel_cstate libarc4 dell_wmi snd_seq uvcvideo dell_smbios videobuf2_vmalloc btusb btrtl dcdbas btbcm videobuf2_memops snd_seq_device btintel snd_timer videobuf2_v4l2 videobuf2_common i915 serio_raw efi_pstore snd iwlwifi bluetooth dell_wmi_sysman dell_wmi_descriptor videodev wmi_bmof hid_sensor_accel_3d hid_sensor_gyro_3d soundcore input_leds joydev mc drm_kms_helper hid_sensor_custom_intel_hinge cfg80211 hid_sensor_als mei_me hid_sensor_trigger industrialio_triggered_buffer ecdh_generic hid_multitouch apple_mfi_fastcharge kfifo_buf mei ecc hid_sensor_iio_common cec industrialio rc_core i2c_algo_bit processor_thermal_device fb_sys_fops syscopyarea cros_ec_ishtp processor_thermal_rfim processor_thermal_mbox cros_ec processor_thermal_rapl sysfillrect intel_rapl_common sysimgblt ucsi_acpi intel_soc_dts_iosf typec_ucsi typec mac_hid int3403_thermal
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  int340x_thermal_zone soc_button_array intel_hid sparse_keymap int3400_thermal acpi_pad acpi_thermal_rel acpi_tad sch_fq_codel msr nfsd parport_pc nfs_acl ppdev lockd auth_rpcgss grace drm lp parport sunrpc ip_tables x_tables autofs4 btrfs blake2b_generic zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear hid_apple usbhid hid_sensor_custom hid_sensor_hub intel_ishtp_loader intel_ishtp_hid hid_generic rtsx_pci_sdmmc crc32_pclmul i2c_i801 psmouse i2c_smbus rtsx_pci nvme intel_lpss_pci thunderbolt intel_ish_ipc intel_lpss xhci_pci idma64 nvme_core intel_ishtp xhci_pci_renesas wmi i2c_hid_acpi i2c_hid hid video pinctrl_tigerlake
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: CPU: 7 PID: 131 Comm: kswapd0 Not tainted 5.13.5-051305-generic #202107251335
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: Hardware name: Dell Inc. Latitude 7420/09RW31, BIOS 1.4.1 02/22/2021
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RIP: 0010:kfence_unprotect+0x3c/0xa0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: Code: 49 89 fc 48 8d 75 ec 48 83 ec 10 65 48 8b 04 25 28 00 00 00 48 89 45 f0 31 c0 e8 ff 57 d9 ff 48 85 c0 74 06 83 7d ec 01 74 0f <0f> 0b 0f 0b c6 05 69 7c e6 01 00 31 c0 eb 35 48 89 c2 48 8b 38 48
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RSP: 0000:ffffbd6140e538e8 EFLAGS: 00010046
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: ffffffffa8210000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RDX: ffffbd6140e538ec RSI: 0000000000000000 RDI: ffffffffa8210000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RBP: ffffbd6140e53900 R08: 0000000000000000 R09: 0000000000000000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: R13: ffffbd6140e53a08 R14: ffffbd6140e53a08 R15: 0000000000000000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: FS:  0000000000000000(0000) GS:ffff9d977f7c0000(0000) knlGS:0000000000000000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: CR2: 0000000000000018 CR3: 0000000120b28003 CR4: 0000000000770ee0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: PKRU: 55555554
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: Call Trace:
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  kfence_handle_page_fault+0x97/0x250
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  page_fault_oops+0x88/0x130
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  do_user_addr_fault+0x308/0x650
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  exc_page_fault+0x7d/0x170
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  asm_exc_page_fault+0x1e/0x30
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RIP: 0010:set_page_dirty+0x38/0xc0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: Code: e8 0d af 01 00 49 8b 54 24 08 48 8d 4a ff 83 e2 01 4c 0f 45 e1 48 85 c0 74 56 49 8b 4c 24 08 48 8b 40 68 48 8d 51 ff 83 e1 01 <48> 8b 40 18 49 0f 44 d4 48 8b 12 f7 c2 00 00 04 00 75 1c 48 85 c0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RSP: 0000:ffffbd6140e53ab8 EFLAGS: 00010246
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RAX: 0000000000000000 RBX: ffffbd6140e53bc0 RCX: 0000000000000000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RDX: dead0000000000ff RSI: 0000000000082020 RDI: ffffe25e919732c0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RBP: ffffbd6140e53ac0 R08: 0000000000000130 R09: 0000000000000000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: R10: 000000000000001d R11: 000000000000001d R12: ffffe25e919732c0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: R13: 0000000000160001 R14: 0000000000000001 R15: ffffe25e919732c0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  ? set_page_dirty+0x13/0xc0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  add_to_swap+0x51/0x80
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  shrink_page_list+0x995/0xb60
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  shrink_inactive_list+0x163/0x430
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  shrink_lruvec+0x218/0x300
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  ? shrink_slab+0x109/0x120
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  ? shrink_slab+0x109/0x120
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  shrink_node_memcgs+0x17b/0x1d0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  shrink_node+0x14f/0x540
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  balance_pgdat+0x270/0x590
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  kswapd+0x106/0x1a0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  kthread+0x11c/0x140
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  ? balance_pgdat+0x590/0x590
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  ? set_kthread_struct+0x50/0x50
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  ret_from_fork+0x1f/0x30
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: ---[ end trace 7c67d721c83d089c ]---
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: ------------[ cut here ]------------
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: WARNING: CPU: 7 PID: 131 at mm/kfence/core.c:135 kfence_unprotect+0x3e/0xa0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: Modules linked in: veth rfcomm xt_nat nf_conntrack_netlink xfrm_user xfrm_algo xt_addrtype br_netfilter ccm xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp nft_compat nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nft_counter nf_tables nfnetlink bridge stp llc cmac algif_hash algif_skcipher af_alg bnep overlay snd_hda_codec_hdmi snd_ctl_led snd_hda_codec_realtek snd_hda_codec_generic snd_sof_pci_intel_tgl snd_sof_intel_hda_common snd_soc_hdac_hda soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci snd_sof_xtensa_dsp snd_sof snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi intel_tcc_cooling soundwire_bus dell_laptop x86_pkg_temp_thermal intel_powerclamp intel_rapl_msr mei_hdcp ledtrig_audio coretemp snd_soc_core snd_compress kvm_intel ac97_bus snd_pcm_dmaengine kvm snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi snd_hda_codec snd_hda_core snd_hwdep nls_iso8859_1 snd_pcm
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  crct10dif_pclmul iwlmvm ghash_clmulni_intel aesni_intel crypto_simd cryptd mac80211 snd_seq_midi snd_seq_midi_event snd_rawmidi intel_cstate libarc4 dell_wmi snd_seq uvcvideo dell_smbios videobuf2_vmalloc btusb btrtl dcdbas btbcm videobuf2_memops snd_seq_device btintel snd_timer videobuf2_v4l2 videobuf2_common i915 serio_raw efi_pstore snd iwlwifi bluetooth dell_wmi_sysman dell_wmi_descriptor videodev wmi_bmof hid_sensor_accel_3d hid_sensor_gyro_3d soundcore input_leds joydev mc drm_kms_helper hid_sensor_custom_intel_hinge cfg80211 hid_sensor_als mei_me hid_sensor_trigger industrialio_triggered_buffer ecdh_generic hid_multitouch apple_mfi_fastcharge kfifo_buf mei ecc hid_sensor_iio_common cec industrialio rc_core i2c_algo_bit processor_thermal_device fb_sys_fops syscopyarea cros_ec_ishtp processor_thermal_rfim processor_thermal_mbox cros_ec processor_thermal_rapl sysfillrect intel_rapl_common sysimgblt ucsi_acpi intel_soc_dts_iosf typec_ucsi typec mac_hid int3403_thermal
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  int340x_thermal_zone soc_button_array intel_hid sparse_keymap int3400_thermal acpi_pad acpi_thermal_rel acpi_tad sch_fq_codel msr nfsd parport_pc nfs_acl ppdev lockd auth_rpcgss grace drm lp parport sunrpc ip_tables x_tables autofs4 btrfs blake2b_generic zstd_compress raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear hid_apple usbhid hid_sensor_custom hid_sensor_hub intel_ishtp_loader intel_ishtp_hid hid_generic rtsx_pci_sdmmc crc32_pclmul i2c_i801 psmouse i2c_smbus rtsx_pci nvme intel_lpss_pci thunderbolt intel_ish_ipc intel_lpss xhci_pci idma64 nvme_core intel_ishtp xhci_pci_renesas wmi i2c_hid_acpi i2c_hid hid video pinctrl_tigerlake
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: CPU: 7 PID: 131 Comm: kswapd0 Tainted: G        W         5.13.5-051305-generic #202107251335
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: Hardware name: Dell Inc. Latitude 7420/09RW31, BIOS 1.4.1 02/22/2021
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RIP: 0010:kfence_unprotect+0x3e/0xa0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: Code: fc 48 8d 75 ec 48 83 ec 10 65 48 8b 04 25 28 00 00 00 48 89 45 f0 31 c0 e8 ff 57 d9 ff 48 85 c0 74 06 83 7d ec 01 74 0f 0f 0b <0f> 0b c6 05 69 7c e6 01 00 31 c0 eb 35 48 89 c2 48 8b 38 48 89 f8
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RSP: 0000:ffffbd6140e538e8 EFLAGS: 00010046
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RAX: 0000000000000000 RBX: 0000000000000000 RCX: ffffffffa8210000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RDX: ffffbd6140e538ec RSI: 0000000000000000 RDI: ffffffffa8210000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RBP: ffffbd6140e53900 R08: 0000000000000000 R09: 0000000000000000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: 0000000000000000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: R13: ffffbd6140e53a08 R14: ffffbd6140e53a08 R15: 0000000000000000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: FS:  0000000000000000(0000) GS:ffff9d977f7c0000(0000) knlGS:0000000000000000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: CR2: 0000000000000018 CR3: 0000000120b28003 CR4: 0000000000770ee0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: PKRU: 55555554
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: Call Trace:
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  kfence_handle_page_fault+0x97/0x250
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  page_fault_oops+0x88/0x130
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  do_user_addr_fault+0x308/0x650
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  exc_page_fault+0x7d/0x170
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  asm_exc_page_fault+0x1e/0x30
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RIP: 0010:set_page_dirty+0x38/0xc0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: Code: e8 0d af 01 00 49 8b 54 24 08 48 8d 4a ff 83 e2 01 4c 0f 45 e1 48 85 c0 74 56 49 8b 4c 24 08 48 8b 40 68 48 8d 51 ff 83 e1 01 <48> 8b 40 18 49 0f 44 d4 48 8b 12 f7 c2 00 00 04 00 75 1c 48 85 c0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RSP: 0000:ffffbd6140e53ab8 EFLAGS: 00010246
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RAX: 0000000000000000 RBX: ffffbd6140e53bc0 RCX: 0000000000000000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RDX: dead0000000000ff RSI: 0000000000082020 RDI: ffffe25e919732c0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: RBP: ffffbd6140e53ac0 R08: 0000000000000130 R09: 0000000000000000
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: R10: 000000000000001d R11: 000000000000001d R12: ffffe25e919732c0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: R13: 0000000000160001 R14: 0000000000000001 R15: ffffe25e919732c0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  ? set_page_dirty+0x13/0xc0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  add_to_swap+0x51/0x80
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  shrink_page_list+0x995/0xb60
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  shrink_inactive_list+0x163/0x430
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  shrink_lruvec+0x218/0x300
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  ? shrink_slab+0x109/0x120
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  ? shrink_slab+0x109/0x120
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  shrink_node_memcgs+0x17b/0x1d0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  shrink_node+0x14f/0x540
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  balance_pgdat+0x270/0x590
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  kswapd+0x106/0x1a0
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  kthread+0x11c/0x140
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  ? balance_pgdat+0x590/0x590
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  ? set_kthread_struct+0x50/0x50
Sep 01 13:26:25 VIE4-DEV-NB010 kernel:  ret_from_fork+0x1f/0x30
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: ---[ end trace 7c67d721c83d089d ]---
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: BUG: kernel NULL pointer dereference, address: 0000000000000018
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: #PF: supervisor read access in kernel mode
Sep 01 13:26:25 VIE4-DEV-NB010 kernel: #PF: error_code(0x0000) - not-present page
Comment 18 ukro 2022-01-28 07:39:41 UTC
Experience similar issue:
Jan 28 07:33:54 matrix kernel: BUG: kernel NULL pointer dereference, address: 0000000000000000
Jan 28 07:33:54 matrix kernel: #PF: supervisor instruction fetch in kernel mode
Jan 28 07:33:54 matrix kernel: #PF: error_code(0x0010) - not-present page
Jan 28 07:33:54 matrix kernel: PGD 0 P4D 0 
Jan 28 07:33:54 matrix kernel: Oops: 0010 [#1] SMP NOPTI
Jan 28 07:33:54 matrix kernel: CPU: 7 PID: 1544 Comm: dp_sync_taskq Tainted: P    B      O      5.13.19-3-pve #1
Jan 28 07:33:54 matrix kernel: Hardware name: ASUS System Product Name/PRIME B460M-A, BIOS 1605 04/07/2021
Jan 28 07:33:54 matrix kernel: RIP: 0010:0x0
Jan 28 07:33:54 matrix kernel: Code: Unable to access opcode bytes at RIP 0xffffffffffffffd6.
Jan 28 07:33:54 matrix kernel: RSP: 0018:ffffbcd5c299bd80 EFLAGS: 00010246
Jan 28 07:33:54 matrix kernel: RAX: 0000000000000000 RBX: ffff9faa1c0907c0 RCX: 0000000000000000
Jan 28 07:33:54 matrix kernel: RDX: ffff9fa4484e4b00 RSI: ffff9faa1c0907c0 RDI: ffff9fa4493d1430
Jan 28 07:33:54 matrix kernel: RBP: ffffbcd5c299bda0 R08: 0000000000000001 R09: 0000000000000001
Jan 28 07:33:54 matrix kernel: R10: ffff9fa53279c400 R11: ffff9fa53279c400 R12: ffff9faa1c0907c0
Jan 28 07:33:54 matrix kernel: R13: ffff9fa4493d1430 R14: ffff9fab5f371bc0 R15: ffff9fa4493d3280
Jan 28 07:33:54 matrix kernel: FS:  0000000000000000(0000) GS:ffff9fab8f180000(0000) knlGS:0000000000000000
Jan 28 07:33:54 matrix kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 28 07:33:54 matrix kernel: CR2: ffffffffffffffd6 CR3: 000000022229a006 CR4: 00000000007726e0
Jan 28 07:33:54 matrix kernel: PKRU: 55555554
Jan 28 07:33:54 matrix kernel: Call Trace:
Jan 28 07:33:54 matrix kernel:  <TASK>
Jan 28 07:33:54 matrix kernel:  multilist_insert+0x1d/0xd0 [zfs]
Jan 28 07:33:54 matrix kernel:  sync_dnodes_task+0x6e/0xb0 [zfs]
Jan 28 07:33:54 matrix kernel:  taskq_thread+0x2b7/0x500 [spl]
Jan 28 07:33:54 matrix kernel:  ? wake_up_q+0xa0/0xa0
Jan 28 07:33:54 matrix kernel:  ? taskq_thread_spawn+0x60/0x60 [spl]
Jan 28 07:33:54 matrix kernel:  kthread+0x128/0x150
Jan 28 07:33:54 matrix kernel:  ? set_kthread_struct+0x50/0x50
Jan 28 07:33:54 matrix kernel:  ret_from_fork+0x1f/0x30
Jan 28 07:33:54 matrix kernel:  </TASK>
Jan 28 07:33:54 matrix kernel: Modules linked in: act_police cls_basic sch_ingress sch_htb tcp_diag inet_diag nfsd auth_rpcgss nfs_acl lockd grace veth ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter snd_hda_codec_hdmi softdog snd_sof_pci_intel_cnl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence intel_rapl_msr snd_hda_codec_realtek snd_sof_intel_hda intel_rapl_common snd_hda_codec_generic intel_tcc_cooling snd_sof_pci x86_pkg_temp_thermal snd_sof_xtensa_dsp intel_powerclamp snd_sof snd_soc_hdac_hda snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi soundwire_bus ledtrig_audio snd_soc_core snd_compress ppdev kvm_intel ac97_bus snd_pcm_dmaengine nfnetlink_log zfs(PO) nfnetlink kvm snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi zunicode(PO) crct10dif_pclmul snd_hda_codec ghash_clmulni_intel zzstd(O) mei_hdcp aesni_intel zlua(O) i915 snd_hda_core zavl(PO) crypto_simd cryptd snd_hwdep icp(PO) rapl snd_pcm
Jan 28 07:33:54 matrix kernel:  snd_timer drm_kms_helper intel_cstate cec rc_core snd joydev input_leds i2c_algo_bit eeepc_wmi mei_me soundcore fb_sys_fops asus_wmi syscopyarea sysfillrect mei sparse_keymap pcspkr mxm_wmi sysimgblt efi_pstore wmi_bmof ee1004 parport_pc zcommon(PO) parport znvpair(PO) mac_hid acpi_tad acpi_pad spl(O) vhost_net vhost vhost_iotlb tap ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi wireguard curve25519_x86_64 libchacha20poly1305 chacha_x86_64 poly1305_x86_64 libblake2s blake2s_x86_64 libcurve25519_generic libchacha libblake2s_generic ip6_udp_tunnel udp_tunnel nct6775 hwmon_vid coretemp vfio_pci vfio_virqfd irqbypass vfio_iommu_type1 vfio drm sunrpc ip_tables x_tables autofs4 btrfs blake2b_generic xor zstd_compress raid6_pq hid_generic usbkbd uas usbhid usb_storage hid dm_thin_pool dm_persistent_data dm_bio_prison dm_bufio libcrc32c crc32_pclmul nvme xhci_pci i2c_i801 xhci_pci_renesas i2c_smbus r8169 nvme_core ahci realtek xhci_hcd
Jan 28 07:33:54 matrix kernel:  libahci wmi video pinctrl_sunrisepoint
Jan 28 07:33:54 matrix kernel: CR2: 0000000000000000
Jan 28 07:33:54 matrix kernel: ---[ end trace 5baa21cdf89f62ad ]---
Jan 28 07:33:54 matrix kernel: general protection fault, probably for non-canonical address 0xffff9fa6963b80: 0000 [#2] SMP NOPTI
Jan 28 07:33:54 matrix kernel: CPU: 10 PID: 1549 Comm: dp_sync_taskq Tainted: P    B D    O      5.13.19-3-pve #1
Jan 28 07:33:54 matrix kernel: Hardware name: ASUS System Product Name/PRIME B460M-A, BIOS 1605 04/07/2021
Jan 28 07:33:54 matrix kernel: RIP: 0010:zpl_get_file_info+0x52/0x240 [zfs]
Jan 28 07:33:54 matrix kernel: Code: c0 83 ff 11 74 09 83 ff 2c 0f 85 09 01 00 00 49 c7 44 24 10 00 00 00 00 48 85 db 0f 84 65 01 00 00 83 ff 11 0f 84 1e 01 00 00 <8b> 03 85 c0 0f 84 93 00 00 00 48 8b 13 48 89 55 d0 3d 00 2f 50 5a
Jan 28 07:33:54 matrix kernel: RSP: 0018:ffffbcd5c29c3c20 EFLAGS: 00010206
Jan 28 07:33:54 matrix kernel: RAX: 0000000000000000 RBX: 00ffff9fa6963b80 RCX: 00ffff9fa6963b80
Jan 28 07:33:54 matrix kernel: RDX: ffffbcd5c29c3c80 RSI: 00ffff9fa6963b80 RDI: 000000000000002c
Jan 28 07:33:54 matrix kernel: RBP: ffffbcd5c29c3c50 R08: 0000000000001388 R09: 0000000000000000
Jan 28 07:33:54 matrix kernel: R10: ffff9fa53279f001 R11: ffff9fa53279f000 R12: ffffbcd5c29c3c80
Jan 28 07:33:54 matrix kernel: R13: ffff9faa1c0903e0 R14: ffff9fa4493d1000 R15: ffff9faa1c090508
Jan 28 07:33:54 matrix kernel: FS:  0000000000000000(0000) GS:ffff9fab8f300000(0000) knlGS:0000000000000000
Jan 28 07:33:54 matrix kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 28 07:33:54 matrix kernel: CR2: 00007f1ecaa1051d CR3: 000000022229a006 CR4: 00000000007726e0
Jan 28 07:33:54 matrix kernel: PKRU: 55555554
Jan 28 07:33:54 matrix kernel: Call Trace:
Jan 28 07:33:54 matrix kernel:  <TASK>
Jan 28 07:33:54 matrix kernel:  dmu_objset_userquota_get_ids+0x11f/0x490 [zfs]
Jan 28 07:33:54 matrix kernel:  ? __raw_callee_save___native_queued_spin_unlock+0x15/0x23
Jan 28 07:33:54 matrix kernel:  dnode_sync+0x123/0xa80 [zfs]
Jan 28 07:33:54 matrix kernel:  sync_dnodes_task+0x79/0xb0 [zfs]
Jan 28 07:33:54 matrix kernel:  taskq_thread+0x2b7/0x500 [spl]
Jan 28 07:33:54 matrix kernel:  ? wake_up_q+0xa0/0xa0
Jan 28 07:33:54 matrix kernel:  ? taskq_thread_spawn+0x60/0x60 [spl]
Jan 28 07:33:54 matrix kernel:  kthread+0x128/0x150
Jan 28 07:33:54 matrix kernel:  ? set_kthread_struct+0x50/0x50
Jan 28 07:33:54 matrix kernel:  ret_from_fork+0x1f/0x30
Jan 28 07:33:54 matrix kernel:  </TASK>
Jan 28 07:33:54 matrix kernel: Modules linked in: act_police cls_basic sch_ingress sch_htb tcp_diag inet_diag nfsd auth_rpcgss nfs_acl lockd grace veth ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter snd_hda_codec_hdmi softdog snd_sof_pci_intel_cnl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence intel_rapl_msr snd_hda_codec_realtek snd_sof_intel_hda intel_rapl_common snd_hda_codec_generic intel_tcc_cooling snd_sof_pci x86_pkg_temp_thermal snd_sof_xtensa_dsp intel_powerclamp snd_sof snd_soc_hdac_hda snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi soundwire_bus ledtrig_audio snd_soc_core snd_compress ppdev kvm_intel ac97_bus snd_pcm_dmaengine nfnetlink_log zfs(PO) nfnetlink kvm snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi zunicode(PO) crct10dif_pclmul snd_hda_codec ghash_clmulni_intel zzstd(O) mei_hdcp aesni_intel zlua(O) i915 snd_hda_core zavl(PO) crypto_simd cryptd snd_hwdep icp(PO) rapl snd_pcm
Jan 28 07:33:54 matrix kernel:  snd_timer drm_kms_helper intel_cstate cec rc_core snd joydev input_leds i2c_algo_bit eeepc_wmi mei_me soundcore fb_sys_fops asus_wmi syscopyarea sysfillrect mei sparse_keymap pcspkr mxm_wmi sysimgblt efi_pstore wmi_bmof ee1004 parport_pc zcommon(PO) parport znvpair(PO) mac_hid acpi_tad acpi_pad spl(O) vhost_net vhost vhost_iotlb tap ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi wireguard curve25519_x86_64 libchacha20poly1305 chacha_x86_64 poly1305_x86_64 libblake2s blake2s_x86_64 libcurve25519_generic libchacha libblake2s_generic ip6_udp_tunnel udp_tunnel nct6775 hwmon_vid coretemp vfio_pci vfio_virqfd irqbypass vfio_iommu_type1 vfio drm sunrpc ip_tables x_tables autofs4 btrfs blake2b_generic xor zstd_compress raid6_pq hid_generic usbkbd uas usbhid usb_storage hid dm_thin_pool dm_persistent_data dm_bio_prison dm_bufio libcrc32c crc32_pclmul nvme xhci_pci i2c_i801 xhci_pci_renesas i2c_smbus r8169 nvme_core ahci realtek xhci_hcd
Jan 28 07:33:54 matrix kernel:  libahci wmi video pinctrl_sunrisepoint
Jan 28 07:33:54 matrix kernel: BUG: unable to handle page fault for address: 0000000061f0cba9
Jan 28 07:33:54 matrix kernel: ---[ end trace 5baa21cdf89f62ae ]---
Jan 28 07:33:55 matrix kernel: #PF: supervisor read access in kernel mode
Jan 28 07:33:55 matrix kernel: #PF: error_code(0x0000) - not-present page
Jan 28 07:33:55 matrix kernel: PGD 0 P4D 0 
Jan 28 07:33:55 matrix kernel: RIP: 0010:0x0
Jan 28 07:33:55 matrix kernel: Oops: 0000 [#3] SMP NOPTI
Jan 28 07:33:55 matrix kernel: Code: Unable to access opcode bytes at RIP 0xffffffffffffffd6.
Jan 28 07:33:55 matrix kernel: CPU: 4 PID: 1548 Comm: dp_sync_taskq Tainted: P    B D    O      5.13.19-3-pve #1
Jan 28 07:33:55 matrix kernel: RSP: 0018:ffffbcd5c299bd80 EFLAGS: 00010246
Jan 28 07:33:55 matrix kernel: Hardware name: ASUS System Product Name/PRIME B460M-A, BIOS 1605 04/07/2021
Jan 28 07:33:55 matrix kernel: 
Jan 28 07:33:55 matrix kernel: RAX: 0000000000000000 RBX: ffff9faa1c0907c0 RCX: 0000000000000000
Jan 28 07:33:55 matrix kernel: RIP: 0010:sync_dnodes_task+0x3b/0xb0 [zfs]
Jan 28 07:33:55 matrix kernel: RDX: ffff9fa4484e4b00 RSI: ffff9faa1c0907c0 RDI: ffff9fa4493d1430
Jan 28 07:33:55 matrix kernel: Code: 55 41 54 53 8b 77 08 48 8b 3f e8 50 e3 05 00 49 8b 5e 18 49 89 c7 eb 50 49 8b 44 24 50 4c 89 e6 4c 89 ff 48 8b 80 30 01 00 00 <48> 8b 40 18 49 89 84 24 30 03 00 00 e8 c4 e4 05 00 49 8b 44 24 40
Jan 28 07:33:55 matrix kernel: RBP: ffffbcd5c299bda0 R08: 0000000000000001 R09: 0000000000000001
Jan 28 07:33:55 matrix kernel: RSP: 0018:ffffbcd5c29bbdb0 EFLAGS: 00010282
Jan 28 07:33:55 matrix kernel: R10: ffff9fa53279c400 R11: ffff9fa53279c400 R12: ffff9faa1c0907c0
Jan 28 07:33:55 matrix kernel: 
Jan 28 07:33:55 matrix kernel: RAX: 0000000061f0cb91 RBX: ffff9fa83f577c80 RCX: 0000000000000000
Jan 28 07:33:55 matrix kernel: R13: ffff9fa4493d1430 R14: ffff9fab5f371bc0 R15: ffff9fa4493d3280
Jan 28 07:33:55 matrix kernel: RDX: ffff9fa49fbe8c98 RSI: ffff9fa49fbe8bb0 RDI: ffff9fa4493d3480
Jan 28 07:33:55 matrix kernel: FS:  0000000000000000(0000) GS:ffff9fab8f180000(0000) knlGS:0000000000000000
Jan 28 07:33:55 matrix kernel: RBP: ffffbcd5c29bbdd8 R08: 0000000000000001 R09: 0000000000000001
Jan 28 07:33:55 matrix kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 28 07:33:55 matrix kernel: R10: ffff9faa40b33200 R11: ffff9faa40b33200 R12: ffff9fa49fbe8bb0
Jan 28 07:33:55 matrix kernel: CR2: ffffffffffffffd6 CR3: 000000022229a006 CR4: 00000000007726e0
Jan 28 07:33:55 matrix kernel: R13: ffff9fa4493d13b0 R14: ffff9fab5f371320 R15: ffff9fa4493d3480
Jan 28 07:33:55 matrix kernel: PKRU: 55555554
Jan 28 07:33:55 matrix kernel: FS:  0000000000000000(0000) GS:ffff9fab8f000000(0000) knlGS:0000000000000000
Jan 28 07:33:55 matrix kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 28 07:33:55 matrix kernel: CR2: 0000000061f0cba9 CR3: 000000034fc10002 CR4: 00000000007726e0
Jan 28 07:33:55 matrix kernel: PKRU: 55555554
Jan 28 07:33:55 matrix kernel: Call Trace:
Jan 28 07:33:55 matrix kernel:  <TASK>
Jan 28 07:33:55 matrix kernel:  taskq_thread+0x2b7/0x500 [spl]
Jan 28 07:33:55 matrix kernel:  ? wake_up_q+0xa0/0xa0
Jan 28 07:33:55 matrix kernel:  ? taskq_thread_spawn+0x60/0x60 [spl]
Jan 28 07:33:55 matrix kernel:  kthread+0x128/0x150
Jan 28 07:33:55 matrix kernel: RIP: 0010:0x0
Jan 28 07:33:55 matrix kernel:  ? set_kthread_struct+0x50/0x50
Jan 28 07:33:55 matrix kernel: Code: Unable to access opcode bytes at RIP 0xffffffffffffffd6.
Jan 28 07:33:55 matrix kernel:  ret_from_fork+0x1f/0x30
Jan 28 07:33:55 matrix kernel: RSP: 0018:ffffbcd5c299bd80 EFLAGS: 00010246
Jan 28 07:33:55 matrix kernel:  </TASK>
Jan 28 07:33:55 matrix kernel: Modules linked in:
Jan 28 07:33:55 matrix kernel: RAX: 0000000000000000 RBX: ffff9faa1c0907c0 RCX: 0000000000000000
Jan 28 07:33:55 matrix kernel:  act_police
Jan 28 07:33:55 matrix kernel: RDX: ffff9fa4484e4b00 RSI: ffff9faa1c0907c0 RDI: ffff9fa4493d1430
Jan 28 07:33:55 matrix kernel:  cls_basic sch_ingress
Jan 28 07:33:55 matrix kernel: RBP: ffffbcd5c299bda0 R08: 0000000000000001 R09: 0000000000000001
Jan 28 07:33:55 matrix kernel:  sch_htb
Jan 28 07:33:55 matrix kernel: R10: ffff9fa53279c400 R11: ffff9fa53279c400 R12: ffff9faa1c0907c0
Jan 28 07:33:55 matrix kernel:  tcp_diag
Jan 28 07:33:55 matrix kernel: R13: ffff9fa4493d1430 R14: ffff9fab5f371bc0 R15: ffff9fa4493d3280
Jan 28 07:33:55 matrix kernel:  inet_diag nfsd auth_rpcgss nfs_acl lockd grace veth ebtable_filter ebtables ip_set ip6table_raw iptable_raw ip6table_filter ip6_tables iptable_filter bpfilter snd_hda_codec_hdmi softdog snd_sof_pci_intel_cnl snd_sof_intel_hda_common soundwire_intel soundwire_generic_allocation soundwire_cadence intel_rapl_msr snd_hda_codec_realtek snd_sof_intel_hda
Jan 28 07:33:55 matrix kernel: FS:  0000000000000000(0000) GS:ffff9fab8f300000(0000) knlGS:0000000000000000
Jan 28 07:33:55 matrix kernel:  intel_rapl_common
Jan 28 07:33:55 matrix kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 28 07:33:55 matrix kernel:  snd_hda_codec_generic
Jan 28 07:33:55 matrix kernel: CR2: ffffffffffffffd6 CR3: 000000022229a001 CR4: 00000000007726e0
Jan 28 07:33:55 matrix kernel:  intel_tcc_cooling
Jan 28 07:33:55 matrix kernel: PKRU: 55555554
Jan 28 07:33:55 matrix kernel:  snd_sof_pci x86_pkg_temp_thermal snd_sof_xtensa_dsp intel_powerclamp snd_sof snd_soc_hdac_hda snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi soundwire_bus ledtrig_audio snd_soc_core snd_compress ppdev kvm_intel ac97_bus snd_pcm_dmaengine nfnetlink_log zfs(PO) nfnetlink kvm snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi zunicode(PO) crct10dif_pclmul snd_hda_codec ghash_clmulni_intel zzstd(O) mei_hdcp aesni_intel zlua(O) i915 snd_hda_core zavl(PO) crypto_simd cryptd snd_hwdep icp(PO) rapl snd_pcm snd_timer drm_kms_helper intel_cstate cec rc_core snd joydev input_leds i2c_algo_bit eeepc_wmi mei_me soundcore fb_sys_fops asus_wmi syscopyarea sysfillrect mei sparse_keymap pcspkr mxm_wmi sysimgblt efi_pstore wmi_bmof ee1004 parport_pc zcommon(PO) parport znvpair(PO) mac_hid acpi_tad acpi_pad spl(O) vhost_net vhost vhost_iotlb tap ib_iser rdma_cm iw_cm ib_cm ib_core iscsi_tcp libiscsi_tcp libiscsi scsi_transport_iscsi wireguard curve25519_x86_64 libchacha20poly1305
Comment 19 Dmitry K 2023-08-29 05:03:05 UTC
Aug 29 07:37:28 laptop kernel: BUG: unable to handle page fault for address: ffff93785a5d3904
Aug 29 07:37:28 laptop kernel: #PF: supervisor read access in kernel mode
Aug 29 07:37:28 laptop kernel: #PF: error_code(0x0000) - not-present page
Aug 29 07:37:28 laptop kernel: PGD 0 P4D 0 
Aug 29 07:37:28 laptop kernel: Oops: 0000 [#5] PREEMPT SMP NOPTI
Aug 29 07:37:28 laptop kernel: CPU: 3 PID: 26282 Comm: mc Tainted: P      D W  OE      6.2.0-26-generic #26~22.04.1-Ubuntu
Aug 29 07:37:28 laptop kernel: Hardware name: Dell Inc. XPS 15 9520/0MWGD4, BIOS 1.16.0 07/07/2023
Aug 29 07:37:28 laptop kernel: RIP: 0010:__d_lookup_rcu+0x8c/0x170
Aug 29 07:37:28 laptop kernel: Code: ff ff ff ff 48 c1 ee 20 41 89 f1 49 89 f2 41 83 e1 07 42 8d 0c cd 00 00 00 00 49 d3 e3 49 f7 d3 eb 08 48 8b 00 48 85 c0 74 7b <44> 8b 40 fc 4c 39 60 10 75 ee 48 83 78 08 00 74 e7 48 39 58 1>
Aug 29 07:37:28 laptop kernel: RSP: 0018:ffffa2e25f2efb40 EFLAGS: 00010287
Aug 29 07:37:28 laptop kernel: RAX: ffff93785a5d3908 RBX: 00000011dfdd8c58 RCX: 0000000000000008
Aug 29 07:37:28 laptop kernel: RDX: 00000000006feec6 RSI: 0000000000000011 RDI: ffff978403ab5780
Aug 29 07:37:28 laptop kernel: RBP: ffffa2e25f2efb70 R08: 0000000000000000 R09: 0000000000000001
Aug 29 07:37:28 laptop kernel: R10: 0000000000000011 R11: 00000000000000ff R12: ffff978403ab5780
Aug 29 07:37:28 laptop kernel: R13: ffffa2e25f2efc64 R14: ffff9775811d0053 R15: ffffa2e25f2efc30
Aug 29 07:37:28 laptop kernel: FS:  00007f702d905b80(0000) GS:ffff9784cf4c0000(0000) knlGS:0000000000000000
Aug 29 07:37:28 laptop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 29 07:37:28 laptop kernel: CR2: ffff93785a5d3904 CR3: 0000000116dae000 CR4: 0000000000750ee0
Aug 29 07:37:28 laptop kernel: PKRU: 55555554
Aug 29 07:37:28 laptop kernel: Call Trace:
Aug 29 07:37:28 laptop kernel:  <TASK>
Aug 29 07:37:28 laptop kernel:  lookup_fast+0x2c/0x100
Aug 29 07:37:28 laptop kernel:  walk_component+0x2c/0x190
Aug 29 07:37:28 laptop kernel:  path_lookupat+0x6e/0x1c0
Aug 29 07:37:28 laptop kernel:  ? try_to_unlazy+0x60/0xe0
Aug 29 07:37:28 laptop kernel:  filename_lookup+0xcf/0x1e0
Aug 29 07:37:28 laptop kernel:  ? check_heap_object+0x14d/0x1a0
Aug 29 07:37:28 laptop kernel:  ? __check_object_size.part.0+0x72/0x150
Aug 29 07:37:28 laptop kernel:  vfs_statx+0x91/0x170
Aug 29 07:37:28 laptop kernel:  vfs_fstatat+0x59/0x80
Aug 29 07:37:28 laptop kernel:  __do_sys_newfstatat+0x2b/0x70
Aug 29 07:37:28 laptop kernel:  ? __do_sys_newfstatat+0x3e/0x70
Aug 29 07:37:28 laptop kernel:  __x64_sys_newfstatat+0x1c/0x30
Aug 29 07:37:28 laptop kernel:  do_syscall_64+0x59/0x90
Aug 29 07:37:28 laptop kernel:  ? exit_to_user_mode_prepare+0x3b/0xd0
Aug 29 07:37:28 laptop kernel:  ? syscall_exit_to_user_mode+0x2a/0x50
Aug 29 07:37:28 laptop kernel:  ? do_syscall_64+0x69/0x90
Aug 29 07:37:28 laptop kernel:  ? syscall_exit_to_user_mode+0x2a/0x50
Aug 29 07:37:28 laptop kernel:  ? do_syscall_64+0x69/0x90
Aug 29 07:37:28 laptop kernel:  ? do_syscall_64+0x69/0x90
Aug 29 07:37:28 laptop kernel:  ? do_syscall_64+0x69/0x90
Aug 29 07:37:28 laptop kernel:  ? do_syscall_64+0x69/0x90
Aug 29 07:37:28 laptop kernel:  entry_SYSCALL_64_after_hwframe+0x72/0xdc
Aug 29 07:37:28 laptop kernel: RIP: 0033:0x7f702d313eee
Aug 29 07:37:28 laptop kernel: Code: 48 89 f2 b9 00 01 00 00 48 89 fe bf 9c ff ff ff e9 07 00 00 00 0f 1f 80 00 00 00 00 f3 0f 1e fa 41 89 ca b8 06 01 00 00 0f 05 <3d> 00 f0 ff ff 77 0b 31 c0 c3 0f 1f 84 00 00 00 00 00 48 8b 1>
Aug 29 07:37:28 laptop kernel: RSP: 002b:00007ffcabae9e08 EFLAGS: 00000246 ORIG_RAX: 0000000000000106
Aug 29 07:37:28 laptop kernel: RAX: ffffffffffffffda RBX: 000055594a92e170 RCX: 00007f702d313eee
Aug 29 07:37:28 laptop kernel: RDX: 00007ffcabae9e40 RSI: 000055594a923cb0 RDI: 00000000ffffff9c
Aug 29 07:37:28 laptop kernel: RBP: 000055594a951a40 R08: 0000555948bd2940 R09: 0000000000000001
Aug 29 07:37:28 laptop kernel: R10: 0000000000000100 R11: 0000000000000246 R12: 000055594a95b2b0
Aug 29 07:37:28 laptop kernel: R13: 000055594a963930 R14: 000055594a94fe80 R15: 00007ffcabae9e40
Aug 29 07:37:28 laptop kernel:  </TASK>
Aug 29 07:37:28 laptop kernel: Modules linked in: tls rfcomm xt_CHECKSUM xt_MASQUERADE xt_conntrack ipt_REJECT nf_reject_ipv4 xt_tcpudp nft_compat nft_chain_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 nf_tables nfnet>
Aug 29 07:37:28 laptop kernel:  snd_intel_sdw_acpi kvm_intel sunrpc snd_hda_codec binfmt_misc snd_hda_core dell_laptop kvm iwlmvm snd_hwdep irqbypass crct10dif_pclmul mei_pxp mei_hdcp polyval_clmulni snd_pcm intel_rapl_msr joy>
Aug 29 07:37:28 laptop kernel:  nvidia_uvm(POE) acpi_thermal_rel sparse_keymap acpi_pad acpi_tad zstd sch_fq_codel zram coretemp msr parport_pc ppdev lp ramoops parport reed_solomon pstore_blk pstore_zone efi_pstore ip_tables >
Aug 29 07:37:28 laptop kernel: CR2: ffff93785a5d3904
Aug 29 07:37:28 laptop kernel: ---[ end trace 0000000000000000 ]---
Aug 29 07:37:28 laptop kernel: RIP: 0010:__d_lookup_rcu+0x8c/0x170
Aug 29 07:37:28 laptop kernel: Code: ff ff ff ff 48 c1 ee 20 41 89 f1 49 89 f2 41 83 e1 07 42 8d 0c cd 00 00 00 00 49 d3 e3 49 f7 d3 eb 08 48 8b 00 48 85 c0 74 7b <44> 8b 40 fc 4c 39 60 10 75 ee 48 83 78 08 00 74 e7 48 39 58 1>
Aug 29 07:37:28 laptop kernel: RSP: 0018:ffffa2e24f1c3b00 EFLAGS: 00010287
Aug 29 07:37:28 laptop kernel: RAX: 00000000000064e0 RBX: 000000157a3c23e9 RCX: 0000000000000028
Aug 29 07:37:28 laptop kernel: RDX: 00000000003d1e11 RSI: 0000000000000015 RDI: ffff977747fed000
Aug 29 07:37:28 laptop kernel: RBP: ffffa2e24f1c3b30 R08: 0000000000000000 R09: 0000000000000005
Aug 29 07:37:28 laptop kernel: R10: 0000000000000015 R11: 000000ffffffffff R12: ffff977747fed000
Aug 29 07:37:28 laptop kernel: R13: ffffa2e24f1c3c24 R14: ffff9775a18f8054 R15: ffffa2e24f1c3bf0
Aug 29 07:37:28 laptop kernel: FS:  00007f702d905b80(0000) GS:ffff9784cf4c0000(0000) knlGS:0000000000000000
Aug 29 07:37:28 laptop kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Aug 29 07:37:28 laptop kernel: CR2: ffff93785a5d3904 CR3: 0000000116dae000 CR4: 0000000000750ee0
Aug 29 07:37:28 laptop kernel: PKRU: 55555554
Comment 20 Erhard F. 2023-08-30 23:47:16 UTC
Closing here as the original issue on my machine was fixed back in 2020 and was connected to radeon/ttm modules.

Apart from the issue taz.007 got with his card + nouveau driver the other "BUG: unable to handle page fault for address" likely have different origins.

If your issues persist please read https://docs.kernel.org/admin-guide/reporting-issues.html#step-by-step-guide-how-to-report-issues-to-the-kernel-maintainers and open new bugs as stated there.

If your are not running a vanilla kernel but a distribution kernel better report your problems to your Linux Distribution Maintainers first. ;)