Bug 218818 - BUG: unable to handle page fault for address: 00000000000a0955
Summary: BUG: unable to handle page fault for address: 00000000000a0955
Status: NEW
Alias: None
Product: Memory Management
Classification: Unclassified
Component: Page Allocator (show other bugs)
Hardware: All Linux
: P3 normal
Assignee: Andrew Morton
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2024-05-08 06:58 UTC by Jean-Louis Dupond
Modified: 2024-11-16 13:41 UTC (History)
5 users (show)

See Also:
Kernel Version:
Subsystem:
Regression: No
Bisected commit-id:


Attachments

Description Jean-Louis Dupond 2024-05-08 06:58:16 UTC
I've hit the following crash/hang for the 4th time or so. Mostly on shutting down firefox or another quite memory intensive piece of software.
It started quite recently, afaik only since 6.8.x kernel.

apr 17 08:24:17 xxx kernel: BUG: unable to handle page fault for address: 00000000000a0955
apr 17 08:24:17 xxx kernel: #PF: supervisor read access in kernel mode
apr 17 08:24:17 xxx kernel: #PF: error_code(0x0000) - not-present page
apr 17 08:24:17 xxx kernel: PGD 0 P4D 0
apr 17 08:24:17 xxx kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI
apr 17 08:24:17 xxx kernel: CPU: 1 PID: 354474 Comm: Isolated Web Co Tainted: P        W  OE      6.8.4-arch1-1 #1 7ea0d8fced45b5f098eb034690645970f116c34c
apr 17 08:24:17 xxx kernel: Hardware name: Dell Inc. XPS 15 9530/01WV13, BIOS 1.9.0 11/13/2023
apr 17 08:24:17 xxx kernel: RIP: 0010:zswap_lru_del+0x4e/0x80
apr 17 08:24:17 xxx kernel: Code: e1 32 01 48 01 d0 48 c1 e8 0c 48 c1 e0 06 48 03 05 ff e0 32 01 4c 8b 20 e8 cf 92 dc ff 48 8b 4d 38 49 c1 ec 3b 48 85 c9 74 04 <48> 8b 49 10 48 8d 75 40 44 89 e2 48 89 df e8 9f 46 fa ff 5b 5d 41
apr 17 08:24:17 xxx kernel: RSP: 0018:ffffa744240dba40 EFLAGS: 00010202
apr 17 08:24:17 xxx kernel: RAX: 0000000000000002 RBX: ffff8f9c49e6a9f0 RCX: 00000000000a0945
apr 17 08:24:17 xxx kernel: RDX: ffff8f9cd944ea80 RSI: ffff8fa1b720c910 RDI: ffff8f9c49e6a9f0
apr 17 08:24:17 xxx kernel: RBP: ffff8fa1b720c910 R08: ffff8f9c44bb6770 R09: 0000000000000000
apr 17 08:24:17 xxx kernel: R10: ffff8fa1b720c050 R11: 0000000000000000 R12: 0000000000000000
apr 17 08:24:17 xxx kernel: R13: 000000000034f1d6 R14: 000000000034f1d5 R15: 000000000034f1d5
apr 17 08:24:17 xxx kernel: FS:  0000000000000000(0000) GS:ffff8fa3ae040000(0000) knlGS:0000000000000000
apr 17 08:24:17 xxx kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
apr 17 08:24:17 xxx kernel: CR2: 00000000000a0955 CR3: 0000000160020000 CR4: 0000000000f50ef0
apr 17 08:24:17 xxx kernel: PKRU: 55555554
apr 17 08:24:17 xxx kernel: Call Trace:
apr 17 08:24:17 xxx kernel:  <TASK>
apr 17 08:24:17 xxx kernel:  ? __die+0x23/0x70
apr 17 08:24:17 xxx kernel:  ? page_fault_oops+0x171/0x4e0
apr 17 08:24:17 xxx kernel:  ? __slab_free+0xdf/0x320
apr 17 08:24:17 xxx kernel:  ? exc_page_fault+0x7f/0x180
apr 17 08:24:17 xxx kernel:  ? asm_exc_page_fault+0x26/0x30
apr 17 08:24:17 xxx kernel:  ? zswap_lru_del+0x4e/0x80
apr 17 08:24:17 xxx kernel:  zswap_free_entry+0x77/0x110
apr 17 08:24:17 xxx kernel:  zswap_invalidate+0x73/0x80
apr 17 08:24:17 xxx kernel:  swap_range_free+0x7e/0xd0
apr 17 08:24:17 xxx kernel:  swapcache_free_entries+0x10e/0x270
apr 17 08:24:17 xxx kernel:  free_swap_slot+0xb4/0xc0
apr 17 08:24:17 xxx kernel:  __swap_entry_free+0x72/0x90
apr 17 08:24:17 xxx kernel:  free_swap_and_cache+0x57/0x100
apr 17 08:24:17 xxx kernel:  unmap_page_range+0x7d1/0x1110
apr 17 08:24:17 xxx kernel:  unmap_vmas+0xb5/0x190
apr 17 08:24:17 xxx kernel:  exit_mmap+0xf7/0x3a0
apr 17 08:24:17 xxx kernel:  __mmput+0x3e/0x130
apr 17 08:24:17 xxx kernel:  do_exit+0x31c/0xb60
apr 17 08:24:17 xxx kernel:  do_group_exit+0x31/0x80
apr 17 08:24:17 xxx kernel:  __x64_sys_exit_group+0x18/0x20
apr 17 08:24:17 xxx kernel:  do_syscall_64+0x86/0x170
apr 17 08:24:17 xxx kernel:  ? do_user_addr_fault+0x304/0x670
apr 17 08:24:17 xxx kernel:  ? exc_page_fault+0x7f/0x180
apr 17 08:24:17 xxx kernel: entry_SYSCALL_64_after_hwframe+0x73/0x7b
apr 17 08:24:17 xxx kernel: RIP: 0033:0x75dc81af366d
apr 17 08:24:17 xxx kernel: Code: Unable to access opcode bytes at 0x75dc81af3643.
apr 17 08:24:17 xxx kernel: RSP: 002b:00007ffd45231bb8 EFLAGS: 00000246 ORIG_RAX: 00000000000000e7
apr 17 08:24:17 xxx kernel: RAX: ffffffffffffffda RBX: 000075dc818b0130 RCX: 000075dc81af366d
apr 17 08:24:17 xxx kernel: RDX: 00000000000000e7 RSI: ffffffffffffff40 RDI: 0000000000000000
apr 17 08:24:17 xxx kernel: RBP: 0000000000000003 R08: 00000000c36daaec R09: 000075dc502001c8
apr 17 08:24:17 xxx kernel: R10: 000075dc762df4d0 R11: 0000000000000246 R12: 000075dc818b0258
apr 17 08:24:17 xxx kernel: R13: 000075dc7488d878 R14: 000075dc818b0130 R15: 000075dc818b0258
apr 17 08:24:17 xxx kernel:  </TASK>
apr 17 08:24:17 xxx kernel: Modules linked in: ufs uas usb_storage wireguard curve25519_x86_64 libchacha20poly1305 chacha_x86_64 poly1305_x86_64 libcurve25519_generic libchacha ip6_udp_tunnel udp_tunnel tun snd_seq_dummy snd_hrtimer snd_seq ccm algif_aead crypto_null des3_ede_x86_64 des_generic libdes algif_skcipher cmac md4 algif_h>
apr 17 08:24:17 xxx kernel:  snd_compress coretemp ac97_bus pkcs8_key_parser nvidia_drm(POE) snd_pcm_dmaengine nvidia_uvm(POE) mac80211 uvcvideo btusb snd_hda_intel nvidia_modeset(POE) snd_usb_audio videobuf2_vmalloc libarc4 r8153_ecm btrtl snd_intel_dspcfg hid_sensor_hub uvc kvm_intel ptp iTCO_wdt spi_pxa2xx_platform snd_usbmidi_li>
apr 17 08:24:17 xxx kernel:  pcspkr snd_pcm wmi_bmof snd_hda_scodec_cs35l41 drm_exec dell_wmi_descriptor firmware_attributes_class i2c_i801 intel_lpss_pci ucsi_acpi mtd mc ecdh_generic mii intel_rapl_common i2c_smbus joydev cfg80211 snd_timer intel_lpss intel_ish_ipc mei_me typec_ucsi gpu_sched snd_hda_cs_dsp_ctls processor_thermal_>
apr 17 08:24:17 xxx kernel:  gf128mul ghash_clmulni_intel sha512_ssse3 serio_raw sha256_ssse3 atkbd nvme sha1_ssse3 rtsx_pci_sdmmc libps2 aesni_intel mmc_core nvme_core spi_intel_pci vivaldi_fmap xhci_pci crypto_simd cryptd xhci_pci_renesas spi_intel rtsx_pci nvme_auth i8042 serio i915 i2c_algo_bit drm_buddy video wmi ttm intel_gtt >
apr 17 08:24:17 xxx kernel: CR2: 00000000000a0955
apr 17 08:24:17 xxx kernel: ---[ end trace 0000000000000000 ]---
apr 17 08:24:17 xxx kernel: RIP: 0010:zswap_lru_del+0x4e/0x80
apr 17 08:24:17 xxx kernel: Code: e1 32 01 48 01 d0 48 c1 e8 0c 48 c1 e0 06 48 03 05 ff e0 32 01 4c 8b 20 e8 cf 92 dc ff 48 8b 4d 38 49 c1 ec 3b 48 85 c9 74 04 <48> 8b 49 10 48 8d 75 40 44 89 e2 48 89 df e8 9f 46 fa ff 5b 5d 41
apr 17 08:24:17 xxx kernel: RSP: 0018:ffffa744240dba40 EFLAGS: 00010202
apr 17 08:24:17 xxx kernel: RAX: 0000000000000002 RBX: ffff8f9c49e6a9f0 RCX: 00000000000a0945
apr 17 08:24:17 xxx kernel: RDX: ffff8f9cd944ea80 RSI: ffff8fa1b720c910 RDI: ffff8f9c49e6a9f0
apr 17 08:24:17 xxx kernel: RBP: ffff8fa1b720c910 R08: ffff8f9c44bb6770 R09: 0000000000000000
apr 17 08:24:17 xxx kernel: R10: ffff8fa1b720c050 R11: 0000000000000000 R12: 0000000000000000
apr 17 08:24:17 xxx kernel: R13: 000000000034f1d6 R14: 000000000034f1d5 R15: 000000000034f1d5
apr 17 08:24:17 xxx kernel: FS:  0000000000000000(0000) GS:ffff8fa3ae040000(0000) knlGS:0000000000000000
apr 17 08:24:17 xxx kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
apr 17 08:24:17 xxx kernel: CR2: 00000000000a0955 CR3: 0000000160020000 CR4: 0000000000f50ef0
apr 17 08:24:17 xxx kernel: PKRU: 55555554
apr 17 08:24:17 xxx kernel: note: Isolated Web Co[354474] exited with irqs disabled
apr 17 08:24:17 xxx kernel: note: Isolated Web Co[354474] exited with preempt_count 4
apr 17 08:24:17 xxx kernel: Fixing recursive fault but reboot is needed!
apr 17 08:24:17 xxx kernel: BUG: scheduling while atomic: Isolated Web Co/354474/0x00000000
apr 17 08:24:18 xxx kernel: Modules linked in: ufs uas usb_storage wireguard curve25519_x86_64 libchacha20poly1305 chacha_x86_64 poly1305_x86_64 libcurve25519_generic libchacha ip6_udp_tunnel udp_tunnel tun snd_seq_dummy snd_hrtimer snd_seq ccm algif_aead crypto_null des3_ede_x86_64 des_generic libdes algif_skcipher cmac md4 algif_h>
apr 17 08:24:18 xxx kernel:  snd_compress coretemp ac97_bus pkcs8_key_parser nvidia_drm(POE) snd_pcm_dmaengine nvidia_uvm(POE) mac80211 uvcvideo btusb snd_hda_intel nvidia_modeset(POE) snd_usb_audio videobuf2_vmalloc libarc4 r8153_ecm btrtl snd_intel_dspcfg hid_sensor_hub uvc kvm_intel ptp iTCO_wdt spi_pxa2xx_platform snd_usbmidi_li>
apr 17 08:24:18 xxx kernel:  pcspkr snd_pcm wmi_bmof snd_hda_scodec_cs35l41 drm_exec dell_wmi_descriptor firmware_attributes_class i2c_i801 intel_lpss_pci ucsi_acpi mtd mc ecdh_generic mii intel_rapl_common i2c_smbus joydev cfg80211 snd_timer intel_lpss intel_ish_ipc mei_me typec_ucsi gpu_sched snd_hda_cs_dsp_ctls processor_thermal_>
apr 17 08:24:18 xxx kernel:  gf128mul ghash_clmulni_intel sha512_ssse3 serio_raw sha256_ssse3 atkbd nvme sha1_ssse3 rtsx_pci_sdmmc libps2 aesni_intel mmc_core nvme_core spi_intel_pci vivaldi_fmap xhci_pci crypto_simd cryptd xhci_pci_renesas spi_intel rtsx_pci nvme_auth i8042 serio i915 i2c_algo_bit drm_buddy video wmi ttm intel_gtt >
apr 17 08:24:18 xxx kernel: CPU: 1 PID: 354474 Comm: Isolated Web Co Tainted: P      D W  OE      6.8.4-arch1-1 #1 7ea0d8fced45b5f098eb034690645970f116c34c
apr 17 08:24:18 xxx kernel: Hardware name: Dell Inc. XPS 15 9530/01WV13, BIOS 1.9.0 11/13/2023
apr 17 08:24:18 xxx kernel: Call Trace:
apr 17 08:24:18 xxx kernel:  <TASK>
apr 17 08:24:18 xxx kernel:  dump_stack_lvl+0x47/0x60
apr 17 08:24:18 xxx kernel:  __schedule_bug+0x56/0x70
apr 17 08:24:18 xxx kernel:  __schedule+0x10f0/0x1520
apr 17 08:24:18 xxx kernel:  ? __wake_up_klogd.part.0+0x3c/0x60
apr 17 08:24:18 xxx kernel:  ? vprintk_emit+0x175/0x2b0
apr 17 08:24:18 xxx kernel:  ? _printk+0x64/0x80
apr 17 08:24:18 xxx kernel:  do_task_dead+0x43/0x50
apr 17 08:24:18 xxx kernel:  make_task_dead+0x151/0x170
apr 17 08:24:18 xxx kernel:  rewind_stack_and_make_dead+0x17/0x20
apr 17 08:24:18 xxx kernel: RIP: 0033:0x75dc81af366d
apr 17 08:24:18 xxx kernel: Code: Unable to access opcode bytes at 0x75dc81af3643.
apr 17 08:24:18 xxx kernel: RSP: 002b:00007ffd45231bb8 EFLAGS: 00000246 ORIG_RAX: 00000000000000e7
apr 17 08:24:18 xxx kernel: RAX: ffffffffffffffda RBX: 000075dc818b0130 RCX: 000075dc81af366d
apr 17 08:24:18 xxx kernel: RDX: 00000000000000e7 RSI: ffffffffffffff40 RDI: 0000000000000000
apr 17 08:24:18 xxx kernel: RBP: 0000000000000003 R08: 00000000c36daaec R09: 000075dc502001c8
apr 17 08:24:18 xxx kernel: R10: 000075dc762df4d0 R11: 0000000000000246 R12: 000075dc818b0258
apr 17 08:24:18 xxx kernel: R13: 000075dc7488d878 R14: 000075dc818b0130 R15: 000075dc818b0258
apr 17 08:24:18 xxx kernel:  </TASK>
apr 17 08:24:18 xxx kernel: ------------[ cut here ]------------
apr 17 08:24:18 xxx kernel: Voluntary context switch within RCU read-side critical section!
apr 17 08:24:18 xxx kernel: WARNING: CPU: 1 PID: 354474 at kernel/rcu/tree_plugin.h:320 rcu_note_context_switch+0x591/0x680
apr 17 08:24:18 xxx kernel: Modules linked in: ufs uas usb_storage wireguard curve25519_x86_64 libchacha20poly1305 chacha_x86_64 poly1305_x86_64 libcurve25519_generic libchacha ip6_udp_tunnel udp_tunnel tun snd_seq_dummy snd_hrtimer snd_seq ccm algif_aead crypto_null des3_ede_x86_64 des_generic libdes algif_skcipher cmac md4 algif_h>
apr 17 08:24:18 xxx kernel:  snd_compress coretemp ac97_bus pkcs8_key_parser nvidia_drm(POE) snd_pcm_dmaengine nvidia_uvm(POE) mac80211 uvcvideo btusb snd_hda_intel nvidia_modeset(POE) snd_usb_audio videobuf2_vmalloc libarc4 r8153_ecm btrtl snd_intel_dspcfg hid_sensor_hub uvc kvm_intel ptp iTCO_wdt spi_pxa2xx_platform snd_usbmidi_li>
apr 17 08:24:18 xxx kernel:  pcspkr snd_pcm wmi_bmof snd_hda_scodec_cs35l41 drm_exec dell_wmi_descriptor firmware_attributes_class i2c_i801 intel_lpss_pci ucsi_acpi mtd mc ecdh_generic mii intel_rapl_common i2c_smbus joydev cfg80211 snd_timer intel_lpss intel_ish_ipc mei_me typec_ucsi gpu_sched snd_hda_cs_dsp_ctls processor_thermal_>
apr 17 08:24:18 xxx kernel:  gf128mul ghash_clmulni_intel sha512_ssse3 serio_raw sha256_ssse3 atkbd nvme sha1_ssse3 rtsx_pci_sdmmc libps2 aesni_intel mmc_core nvme_core spi_intel_pci vivaldi_fmap xhci_pci crypto_simd cryptd xhci_pci_renesas spi_intel rtsx_pci nvme_auth i8042 serio i915 i2c_algo_bit drm_buddy video wmi ttm intel_gtt >
apr 17 08:24:18 xxx kernel: CPU: 1 PID: 354474 Comm: Isolated Web Co Tainted: P      D W  OE      6.8.4-arch1-1 #1 7ea0d8fced45b5f098eb034690645970f116c34c
apr 17 08:24:18 xxx kernel: Hardware name: Dell Inc. XPS 15 9530/01WV13, BIOS 1.9.0 11/13/2023
apr 17 08:24:18 xxx kernel: RIP: 0010:rcu_note_context_switch+0x591/0x680
apr 17 08:24:18 xxx kernel: Code: ff 45 85 ed 0f 84 14 fc ff ff 4d 89 bc 24 a0 00 00 00 e9 07 fc ff ff 48 c7 c7 d8 be 23 ab c6 05 79 f7 e4 01 01 e8 2f de f3 ff <0f> 0b e9 c6 fa ff ff 48 89 fe 4c 89 ff e8 ed ab 4f 00 49 83 bc 24
apr 17 08:24:18 xxx kernel: RSP: 0018:ffffa744240dbe30 EFLAGS: 00010082
apr 17 08:24:18 xxx kernel: RAX: 0000000000000000 RBX: ffff8fa3ae0756c0 RCX: 0000000000000027
apr 17 08:24:18 xxx kernel: RDX: ffff8fa3ae0619c8 RSI: 0000000000000001 RDI: ffff8fa3ae0619c0
apr 17 08:24:18 xxx kernel: RBP: 0000000000000000 R08: 0000000000000000 R09: ffffa744240dbcc0
apr 17 08:24:18 xxx kernel: R10: ffffa744240dbcb8 R11: 0000000000000003 R12: ffff8f9cd944ea80
apr 17 08:24:18 xxx kernel: R13: ffff8f9cd944ea80 R14: 0000000000000000 R15: 0000000000000000
apr 17 08:24:18 xxx kernel: FS:  0000000000000000(0000) GS:ffff8fa3ae040000(0000) knlGS:0000000000000000
apr 17 08:24:18 xxx kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
apr 17 08:24:18 xxx kernel: CR2: 00000000000a0955 CR3: 0000000160020000 CR4: 0000000000f50ef0
apr 17 08:24:18 xxx kernel: PKRU: 55555554
apr 17 08:24:18 xxx kernel: Call Trace:
apr 17 08:24:18 xxx kernel:  <TASK>
apr 17 08:24:18 xxx kernel:  ? rcu_note_context_switch+0x591/0x680
apr 17 08:24:18 xxx kernel:  ? __warn+0x81/0x130
apr 17 08:24:18 xxx kernel:  ? rcu_note_context_switch+0x591/0x680
apr 17 08:24:18 xxx kernel:  ? report_bug+0x171/0x1a0
apr 17 08:24:18 xxx kernel:  ? console_unlock+0x78/0x120
apr 17 08:24:18 xxx kernel:  ? handle_bug+0x3c/0x80
apr 17 08:24:18 xxx kernel:  ? exc_invalid_op+0x17/0x70
apr 17 08:24:18 xxx kernel:  ? asm_exc_invalid_op+0x1a/0x20
apr 17 08:24:18 xxx kernel:  ? rcu_note_context_switch+0x591/0x680
apr 17 08:24:18 xxx kernel:  __schedule+0xbe/0x1520
apr 17 08:24:18 xxx kernel:  ? __wake_up_klogd.part.0+0x3c/0x60
apr 17 08:24:18 xxx kernel:  ? vprintk_emit+0x175/0x2b0
apr 17 08:24:18 xxx kernel:  ? _printk+0x64/0x80
apr 17 08:24:18 xxx kernel:  do_task_dead+0x43/0x50
apr 17 08:24:18 xxx kernel:  make_task_dead+0x151/0x170
apr 17 08:24:18 xxx kernel:  rewind_stack_and_make_dead+0x17/0x20
apr 17 08:24:18 xxx kernel: RIP: 0033:0x75dc81af366d
apr 17 08:24:18 xxx kernel: Code: Unable to access opcode bytes at 0x75dc81af3643.
apr 17 08:24:18 xxx kernel: RSP: 002b:00007ffd45231bb8 EFLAGS: 00000246 ORIG_RAX: 00000000000000e7
apr 17 08:24:18 xxx kernel: RAX: ffffffffffffffda RBX: 000075dc818b0130 RCX: 000075dc81af366d
apr 17 08:24:18 xxx kernel: RDX: 00000000000000e7 RSI: ffffffffffffff40 RDI: 0000000000000000
apr 17 08:24:18 xxx kernel: RBP: 0000000000000003 R08: 00000000c36daaec R09: 000075dc502001c8
apr 17 08:24:18 xxx kernel: R10: 000075dc762df4d0 R11: 0000000000000246 R12: 000075dc818b0258
apr 17 08:24:18 xxx kernel: R13: 000075dc7488d878 R14: 000075dc818b0130 R15: 000075dc818b0258
apr 17 08:24:18 xxx kernel:  </TASK>
apr 17 08:24:18 xxx kernel: ---[ end trace 0000000000000000 ]---


Found also some other reports: https://bbs.archlinux.org/viewtopic.php?pid=2161733#p2161733

Had the crash again yesterday on 6.8.9

Let me know if you need additional information! 🙂

Thanks
Jean-Louis
Comment 1 Jean-Louis Dupond 2024-05-08 07:02:21 UTC
yesterday it was:
mei 07 16:38:54 xxx kernel: BUG: unable to handle page fault for address: 00000000000f2bf3
mei 07 16:38:54 xxx kernel: #PF: supervisor read access in kernel mode
mei 07 16:38:54 xxx kernel: #PF: error_code(0x0000) - not-present page

Again when closing Firefox
Comment 2 Artem S. Tashkinov 2024-05-08 13:32:04 UTC
Please run memtest86 or memtest86+ for at least an hour.
Comment 3 Jean-Louis Dupond 2024-05-09 18:39:19 UTC
(In reply to Artem S. Tashkinov from comment #2)
> Please run memtest86 or memtest86+ for at least an hour.

Did run it for +9 hours today, no errors.
So seems like memory itself is fine.
Comment 4 The Linux kernel's regression tracker (Thorsten Leemhuis) 2024-05-10 05:49:53 UTC
Is going back to some earlier kernel series (6.6.y?) for a few days a option to rule out problems of the hardware or some other software?

And if it happens again, could you save the whole error please? That would allow checking if the backtraces are similar or differ each time.
Comment 5 Jean-Louis Dupond 2024-05-10 06:56:13 UTC
(In reply to The Linux kernel's regression tracker (Thorsten Leemhuis) from comment #4)
> Is going back to some earlier kernel series (6.6.y?) for a few days a option
> to rule out problems of the hardware or some other software?

I doubt, thing is that this happens like twice a month. So no good reproducer.
So if I run 6.6.x for example for a week or 2, and it didn't occur. Can we say it's not there or I was just lucky :)

> 
> And if it happens again, could you save the whole error please? That would
> allow checking if the backtraces are similar or differ each time.

I'll do. The last one on the 7th of may just didn't log the whole stacktrace. So I didnt have it.

BUT I think we have some more info in https://bbs.archlinux.org/viewtopic.php?id=294475
The stacktrace there looks similar. And somebody reports it didn't occur anymore since he disabled zswap.

A similar one here also: https://www.reddit.com/r/linux_gaming/comments/1b7qxjp/whenever_i_play_rdr2_for_a_while_and_exit_the/

But I also stumbled upon https://forums.developer.nvidia.com/t/series-550-freezes-laptop/284772
A whole lot of people reporting similar crashes, but all with Nvidia driver installed (just like me).
Comment 6 daniil 2024-05-10 07:01:31 UTC
Check your hard drive/ssd. You may ran out of storage space because its fails on swap
Comment 7 The Linux kernel's regression tracker (Thorsten Leemhuis) 2024-05-10 07:18:18 UTC
> all with Nvidia driver installed (just like me).

Well, then chances are slim that a developer will look into this; and the zswap developers are unlikely to see this report here anyway. For details see:

https://linux-regtracking.leemhuis.info/post/frequent-reasons-why-linux-kernel-bug-reports-are-ignored/
Comment 8 nvaert1986 2024-06-07 09:25:59 UTC
I'm experiencing the exact same issue on a Dell Precision 3581. It also happens on kernel 6.6.32, but it's rare. I'm currently running kernel 6.9.3 where it happens frequently and when it occurs, it always occurs when shutting down the laptop.
Comment 9 Jean-Louis Dupond 2024-06-07 09:27:15 UTC
(In reply to nvaert1986 from comment #8)
> I'm experiencing the exact same issue on a Dell Precision 3581. It also
> happens on kernel 6.6.32, but it's rare. I'm currently running kernel 6.9.3
> where it happens frequently and when it occurs, it always occurs when
> shutting down the laptop.

Also running the nvidia module or not?
For me it seems to be fixed (at least it didn't occur anymore) since the switch to nvidia-open drivers.
Comment 10 nvaert1986 2024-06-07 11:01:11 UTC
(In reply to nvaert1986 from comment #8)
> I'm experiencing the exact same issue on a Dell Precision 3581. It also
> happens on kernel 6.6.32, but it's rare. I'm currently running kernel 6.9.3
> where it happens frequently and when it occurs, it always occurs when
> shutting down the laptop.

I've updated to te latest 550 series nvidia-drivers in ~amd64 for now to see what that does. Hopefully it'll fix the annoying bug.
Comment 11 Alexey Morar 2024-11-16 13:41:20 UTC
Hi.
I have the same problem on Arch Linux 

This host is used as hypervisor, however, all the VM are shutdown. My network is set up as a bridge. 
Interestingly, today I hit this error after adding the second IP address with a different CIDR and start ping devices on that network. 

[root@archlinux ~]# [ 6186.630109] BUG: unable to handle page fault for address: 0000000000204fa0
[ 6186.630491] #PF: supervisor write access in kernel mode
[ 6186.630566] #PF: error_code(0x0002) - not-present page
[ 6186.630640] PGD 0 P4D 0
[ 6186.630685] Oops: Oops: 0002 [#1] PREEMPT SMP PTI
[ 6186.630760] CPU: 3 UID: 0 PID: 0 Comm: swapper/3 Not tainted 6.11.7-arch1-1 #1 1400000003000000474e5500ee13b5ab63fad4da
[ 6186.630931] Hardware name: HP HP EliteDesk 800 G3 SFF/8299, BIOS P01 Ver. 02.50 07/17/2024
[ 6186.631044] RIP: 0010:_raw_spin_lock+0x17/0x30
[ 6186.631115] Code: 0f 1f 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 f3 0f 1e fa 0f 1f 44 00 00 65 ff 05 a8 c2 1b 5c 31 c0 ba 01 00 00 00 <f0> 0f b1 17 75 05 c3 cc cc cc cc 89 c6 e8 f7 01 00 00 90 c3 cc cc
[ 6186.631350] RSP: 0018:ffffba6400204f98 EFLAGS: 00010046
[ 6186.631427] RAX: 0000000000000000 RBX: ffff8f6cc104cc00 RCX: 00000001001af23f
[ 6186.631524] RDX: 0000000000000001 RSI: 76ffffffa3e69dc6 RDI: 0000000000204fa0
[ 6186.631622] RBP: 0000000000000001 R08: 0000160bcfc9116a R09: 30c7a363b7c70caf
[ 6186.632020] R10: 0000000000000000 R11: ffffba6400204ff8 R12: ffff8f6cc104cce4
[ 6186.632114] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 6186.632211] FS:  0000000000000000(0000) GS:ffff8f71dbb80000(0000) knlGS:0000000000000000
[ 6186.632322] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 6186.632402] CR2: 0000000000204fa0 CR3: 0000000122476006 CR4: 00000000003706f0
[ 6186.632498] Call Trace:
[ 6186.632539]  <IRQ>
[ 6186.632575]  ? __die_body.cold+0x19/0x27
[ 6186.632637]  ? page_fault_oops+0x15a/0x2d0
[ 6186.632705]  ? exc_page_fault+0x81/0x190
[ 6186.632765]  ? asm_exc_page_fault+0x26/0x30
[ 6186.632831]  ? _raw_spin_lock+0x17/0x30
[ 6186.632894]  handle_irq_event+0x56/0x90
[ 6186.632958]  handle_edge_irq+0x9a/0x260
[ 6186.633019]  __common_interrupt+0x3e/0xa0
[ 6186.633084]  common_interrupt+0x80/0xa0
[ 6186.633196]  </IRQ>
[ 6186.633233]  <TASK>
[ 6186.633266]  asm_common_interrupt+0x26/0x40
[ 6186.633693] RIP: 0010:cpuidle_enter_state+0xc6/0x420
[ 6186.633767] Code: 00 00 e8 7d 53 2d ff e8 28 f1 ff ff 49 89 c5 0f 1f 44 00 00 31 ff e8 39 34 2c ff 45 84 ff 0f 85 aa 01 00 00 fb 0f 1f 44 00 00 <45> 85 f6 0f 88 84 01 00 00 49 63 d6 48 8d 04 52 48 8d 04 82 49 8d
[ 6186.634011] RSP: 0018:ffffba6400147e80 EFLAGS: 00000246
[ 6186.634091] RAX: ffff8f71dbb80000 RBX: 0000000000000002 RCX: 0000000000000000
[ 6186.634196] RDX: 000005a06fe37981 RSI: fffffffdbfe789de RDI: 0000000000000000
[ 6186.634294] RBP: ffff8f71dbbc12c8 R08: 0000000000000004 R09: 000000000000004e
[ 6186.634390] R10: 0000000000000018 R11: ffff8f71dbbb4be4 R12: ffffffffa5152b80
[ 6186.634487] R13: 000005a06fe37981 R14: 0000000000000002 R15: 0000000000000000
[ 6186.634593]  cpuidle_enter+0x2d/0x40
[ 6186.634654]  do_idle+0x1b0/0x210
[ 6186.634711]  cpu_startup_entry+0x29/0x30
[ 6186.634771]  start_secondary+0x11c/0x140
[ 6186.634832]  common_startup_64+0x13e/0x141
[ 6186.634904]  </TASK>
[ 6186.635314] Modules linked in: xt_nat xt_tcpudp veth xt_conntrack xt_MASQUERADE nf_conntrack_netlink xfrm_user xfrm_algo ip6table_nat ip6table_filter ip6_tables iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c xt_addrtype iptable_filter overlay rpcrdma rdma_cm iw_cm ib_cm ib_core bridge stp llc vfat fat intel_rapl_msr intel_rapl_common intel_uncore_frequency intel_uncore_frequency_common intel_pmc_core_pltdrv intel_pmc_core intel_vsec pmt_telemetry pmt_class intel_tcc_cooling x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_soc_avs snd_soc_hda_codec snd_hda_ext_core snd_hda_codec_hdmi kvm snd_hda_codec_conexant snd_soc_core snd_hda_codec_generic crct10dif_pclmul crc32_pclmul snd_compress polyval_clmulni ac97_bus polyval_generic snd_pcm_dmaengine ghash_clmulni_intel snd_hda_intel snd_usb_audio sha512_ssse3 snd_intel_dspcfg sha256_ssse3 snd_intel_sdw_acpi snd_usbmidi_lib sha1_ssse3 snd_ump snd_hda_codec snd_rawmidi aesni_intel iwlwifi snd_hda_core gf128mul snd_seq_device crypto_simd
[ 6186.635473]  snd_hwdep cryptd mc mei_wdt mei_hdcp mei_pxp r8169 snd_pcm rapl cfg80211 realtek intel_cstate snd_timer hp_wmi mdio_devres platform_profileintel_uncore mei_me sparse_keymap psmouse wmi_bmof snd libphy pcspkr rfkill mei soundcore acpi_pad mousedev joydev mac_hid nfsd auth_rpcgss nfs_acl lockd grace crypto_user loop dm_mod sunrpc nfnetlink ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 hid_logitech_hidpp hid_logitech_dj hid_generic usbhid nouveau i915 drm_ttm_helper gpu_sched serio_raw drm_gpuvm atkbd drm_exec libps2 mxm_wmi vivaldi_fmap drm_buddy i2c_algo_bit nvme crc32c_intel intel_gtt ttm nvme_core drm_display_helper nvme_auth xhci_pci cec xhci_pci_renesas video i8042 serio wmi vfio_pci vfio_pci_core vfio_iommu_type1 vfio iommufd
[ 6186.637799] CR2: 0000000000204fa0
[ 6186.637851] ---[ end trace 0000000000000000 ]---
[ 6186.637921] RIP: 0010:_raw_spin_lock+0x17/0x30
[ 6186.637989] Code: 0f 1f 00 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 90 f3 0f 1e fa 0f 1f 44 00 00 65 ff 05 a8 c2 1b 5c 31 c0 ba 01 00 00 00 <f0> 0f b1 17 75 05 c3 cc cc cc cc 89 c6 e8 f7 01 00 00 90 c3 cc cc
[ 6186.638453] RSP: 0018:ffffba6400204f98 EFLAGS: 00010046
[ 6186.638528] RAX: 0000000000000000 RBX: ffff8f6cc104cc00 RCX: 00000001001af23f
[ 6186.638628] RDX: 0000000000000001 RSI: 76ffffffa3e69dc6 RDI: 0000000000204fa0
[ 6186.638724] RBP: 0000000000000001 R08: 0000160bcfc9116a R09: 30c7a363b7c70caf
[ 6186.638818] R10: 0000000000000000 R11: ffffba6400204ff8 R12: ffff8f6cc104cce4
[ 6186.638914] R13: 0000000000000000 R14: 0000000000000000 R15: 0000000000000000
[ 6186.639011] FS:  0000000000000000(0000) GS:ffff8f71dbb80000(0000) knlGS:0000000000000000
[ 6186.639121] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[ 6186.639201] CR2: 0000000000204fa0 CR3: 0000000122476006 CR4: 00000000003706f0
[ 6186.639299] Kernel panic - not syncing: Fatal exception in interrupt
[ 6186.639445] Kernel Offset: 0x22000000 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffffbfffffff)
[ 6186.639706] ---[ end Kernel panic - not syncing: Fatal exception in interrupt ]---

Note You need to log in before you can comment on or make changes to this bug.