Bug 211323

Summary: Ops mens
Product: Memory Management Reporter: Joilnen (joilnen)
Component: Page AllocatorAssignee: Andrew Morton (akpm)
Status: NEW ---    
Severity: normal CC: kernel, koalp, milan.plzik
Priority: P1    
Hardware: IA-64   
OS: Linux   
Kernel Version: Linux version 5.10.7-arch1-1 Subsystem:
Regression: No Bisected commit-id:
Attachments: dmesg content after triggering the issue.

Description Joilnen 2021-01-24 03:27:07 UTC
------------[ cut here ]------------
Jan 23 23:19:35 dijkstra kernel: kernel BUG at include/linux/mm.h:707!
Jan 23 23:19:35 dijkstra kernel: invalid opcode: 0000 [#1] PREEMPT SMP PTI
Jan 23 23:19:36 dijkstra kernel: CPU: 0 PID: 64876 Comm: kworker/0:2 Tainted: G        W  OE     5.10.7-arch1-1 #1
Jan 23 23:19:36 dijkstra kernel: Hardware name: LENOVO 80UJ/Nano 4A2, BIOS 1GCN19WW 10/03/2016
Jan 23 23:19:36 dijkstra kernel: Workqueue: zswap-shrink shrink_worker
Jan 23 23:19:36 dijkstra kernel: RIP: 0010:__free_pages+0x83/0x90
Jan 23 23:19:36 dijkstra kernel: Code: 33 e4 ff ff eb d1 85 f6 75 09 5b 5d 41 5c e9 a4 fe ff ff 5b 31 d2 5d 41 5c e9 19 e4 ff ff 48 c7 c
6 00 02 94 a1 e8 dd 64 fd ff <0f> 0b 5b 5d 41 5c c3 66 0f 1f 44 00 00 0f 1f 44 00 00 48 85 ff 75
Jan 23 23:19:36 dijkstra kernel: RSP: 0018:ffffab14c2f4fd60 EFLAGS: 00010286
Jan 23 23:19:36 dijkstra kernel: RAX: 0000000000000000 RBX: ffff8f51f7101c00 RCX: ffff8f5239218bb8
Jan 23 23:19:36 dijkstra kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff8f5239218bb0
Jan 23 23:19:36 dijkstra kernel: RBP: ffffcbed40dcff40 R08: 0000000000000000 R09: ffffab14c2f4fa60
Jan 23 23:19:36 dijkstra kernel: R10: ffffab14c2f4fa58 R11: ffffffffa20cb228 R12: ffffcbed40dcff40
Jan 23 23:19:36 dijkstra kernel: R13: ffffcbed40dcff48 R14: ffff8f51f73fd000 R15: ffffcbed40dcff48
Jan 23 23:19:36 dijkstra kernel: FS:  0000000000000000(0000) GS:ffff8f5239200000(0000) knlGS:0000000000000000
Jan 23 23:19:36 dijkstra kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jan 23 23:19:36 dijkstra kernel: CR2: 00007f80dd71e000 CR3: 0000000006e10000 CR4: 00000000001006f0
Jan 23 23:19:36 dijkstra kernel: Call Trace:
Jan 23 23:19:36 dijkstra kernel:  z3fold_zpool_shrink+0x146/0x720
Jan 23 23:19:36 dijkstra kernel:  ? sched_clock+0x5/0x10
Jan 23 23:19:36 dijkstra kernel:  ? sched_clock_cpu+0xc/0xb0
Jan 23 23:19:36 dijkstra kernel:  shrink_worker+0x19/0x50
Jan 23 23:19:36 dijkstra kernel:  process_one_work+0x1d6/0x3a0
Jan 23 23:19:36 dijkstra kernel:  worker_thread+0x4d/0x3d0
Jan 23 23:19:36 dijkstra kernel:  ? rescuer_thread+0x410/0x410
Jan 23 23:19:36 dijkstra kernel:  kthread+0x133/0x150
Jan 23 23:19:36 dijkstra kernel:  ? __kthread_bind_mask+0x60/0x60
Jan 23 23:19:36 dijkstra kernel:  ret_from_fork+0x22/0x30
Jan 23 23:19:36 dijkstra kernel: Modules linked in: 8021q garp mrp stp llc ccm rfcomm cmac algif_hash algif_skcipher af_alg bnep intel_r
apl_msr intel_rapl_common intel_powerclamp coretemp kvm_intel hid_logitech_hidpp kvm snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_co
dec_generic ledtrig_audio hid_logitech_dj rtsx_usb_sdmmc irqbypass mmc_core btusb rtsx_usb_ms btrtl btbcm crct10dif_pclmul memstick crc3
2_pclmul uvcvideo btintel ghash_clmulni_intel bluetooth iTCO_wdt aesni_intel videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 intel_pmc
_bxt iTCO_vendor_support wmi_bmof videobuf2_common mei_hdcp crypto_simd videodev cryptd glue_helper mc ecdh_generic intel_cstate ecc snd
_hda_intel snd_intel_dspcfg ath10k_pci soundwire_intel soundwire_generic_allocation ath10k_core rtsx_usb usbhid ath soundwire_cadence sn
d_hda_codec snd_hda_core mac80211 snd_hwdep soundwire_bus intel_xhci_usb_role_switch i915 snd_soc_core cfg80211 roles snd_compress ac97_
bus joydev pcspkr mousedev snd_pcm_dmaengine snd_pcm snd_timer snd r8169
Jan 23 23:19:36 dijkstra kernel:  soundcore libarc4 i2c_algo_bit realtek mdio_devres ideapad_laptop sparse_keymap libphy drm_kms_helper 
rfkill cec wmi intel_gtt mei_txe tpm_crb mei syscopyarea sysfillrect sysimgblt fb_sys_fops mac_hid tpm_tis tpm_tis_core tpm rng_core i2c
_i801 i2c_smbus video lpc_ich intel_int0002_vgpio vboxnetflt(OE) vboxnetadp(OE) vboxdrv(OE) drm fuse agpgart crypto_user bpf_preload ip_
tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 crc32c_intel serio_raw xhci_pci xhci_pci_renesas
Jan 23 23:19:36 dijkstra kernel: ---[ end trace 20875c79439e4309 ]---
Comment 1 Milan Plzik 2021-01-27 19:11:17 UTC
Hello, I'm hitting this same issue. The complete kernel stack trace is attached. The system is in a semi-frozen state after this, with some processes responding and some being stuck. I believe this happens under a heavy swap load, but can't confirm that due to the lack of the HDD activity LED.
Comment 2 Milan Plzik 2021-01-27 19:12:17 UTC
Created attachment 294975 [details]
dmesg content after triggering the issue.
Comment 3 Stefan Kuhn 2021-02-09 16:16:46 UTC
I have the same issue (https://bugzilla.kernel.org/show_bug.cgi?id=211653). I am able to reproduce it running Prime95 (mprime) in stress test mode.
Comment 4 Stefan Kuhn 2021-02-09 16:17:12 UTC
*** Bug 211653 has been marked as a duplicate of this bug. ***
Comment 5 Stefan Kuhn 2021-02-10 08:50:53 UTC
I did try it with my work laptop, which has a similar setup as well.

The problem also occurs:

-- Journal begins at Wed 2021-01-20 17:26:36 CET, ends at Wed 2021-02-10 09:46:33 CET. --
Feb 09 19:14:40 kernel: process '/usr/bin/mprime' started with executable stack
Feb 09 20:14:52 kernel: page:000000004c9ab2e6 refcount:0 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x13388e
Feb 09 20:14:52 kernel: flags: 0x2ffff0000000000()
Feb 09 20:14:52 kernel: raw: 02ffff0000000000 ffffe814c4f7c088 ffff96449d7b2170 0000000000000000
Feb 09 20:14:52 kernel: raw: 0000000000000000 0000000000000011 00000000ffffffff 0000000000000000
Feb 09 20:14:52 kernel: page dumped because: VM_BUG_ON_PAGE(page_ref_count(page) == 0)
Feb 09 20:14:52 kernel: ------------[ cut here ]------------
Feb 09 20:14:52 kernel: kernel BUG at include/linux/mm.h:707!
Feb 09 20:14:52 kernel: invalid opcode: 0000 [#1] PREEMPT SMP NOPTI
Feb 09 20:14:52 kernel: CPU: 14 PID: 582 Comm: kworker/14:2 Not tainted 5.10.13-2-MANJARO #1
Feb 09 20:14:52 kernel: Hardware name: LENOVO 20TQCTO1WW/20TQCTO1WW, BIOS N30ET33W (1.16 ) 12/17/2020
Feb 09 20:14:52 kernel: Workqueue: zswap-shrink shrink_worker
Feb 09 20:14:52 kernel: RIP: 0010:__free_pages+0x83/0x90
Feb 09 20:14:52 kernel: Code: 33 e4 ff ff eb d1 85 f6 75 09 5b 5d 41 5c e9 a4 fe ff ff 5b 31 d2 5d 41 5c e9 19 e4 ff ff 48 c7 c6 c0 f0 73 bc e8 cd 64 fd ff <0f> 0b 5b 5d 41 5c c3 66 0f 1f 44 00 00 0f 1f 44 00 00 48 85 ff 75
Feb 09 20:14:52 kernel: RSP: 0000:ffffb6d044753d60 EFLAGS: 00010286
Feb 09 20:14:52 kernel: RAX: 0000000000000000 RBX: ffff963d40dc4600 RCX: ffff96449d798bb8
Feb 09 20:14:52 kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff96449d798bb0
Feb 09 20:14:52 kernel: RBP: ffffe814c4ce2380 R08: 0000000000000000 R09: ffffb6d044753a60
Feb 09 20:14:52 kernel: R10: ffffb6d044753a58 R11: ffffffffbcacb228 R12: ffffe814c4ce2380
Feb 09 20:14:52 kernel: R13: ffffe814c4ce2388 R14: ffff963d7388e000 R15: ffffe814c4ce2388
Feb 09 20:14:52 kernel: FS:  0000000000000000(0000) GS:ffff96449d780000(0000) knlGS:0000000000000000
Feb 09 20:14:52 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 09 20:14:52 kernel: CR2: 00007fde3ef28860 CR3: 0000000194c10004 CR4: 00000000007706e0
Feb 09 20:14:52 kernel: PKRU: 55555554
Feb 09 20:14:52 kernel: Call Trace:
Feb 09 20:14:52 kernel:  z3fold_zpool_shrink+0x146/0x720
Feb 09 20:14:52 kernel:  ? sched_clock+0x5/0x10
Feb 09 20:14:52 kernel:  ? sched_clock_cpu+0xc/0xb0
Feb 09 20:14:52 kernel:  shrink_worker+0x19/0x50
Feb 09 20:14:52 kernel:  process_one_work+0x1d6/0x3a0
Feb 09 20:14:52 kernel:  worker_thread+0x4d/0x3d0
Feb 09 20:14:52 kernel:  ? rescuer_thread+0x410/0x410
Feb 09 20:14:52 kernel:  kthread+0x133/0x150
Feb 09 20:14:52 kernel:  ? __kthread_bind_mask+0x60/0x60
Feb 09 20:14:52 kernel:  ret_from_fork+0x1f/0x30
Feb 09 20:14:52 kernel: Modules linked in: ccm rfcomm cmac algif_hash algif_skcipher af_alg bnep uvcvideo btusb btrtl videobuf2_vmalloc btbcm videobuf2_memops videobuf2_v4l2 btintel videobuf2_common bluetooth videodev joydev snd_hda_codec_hdmi mousedev mc ecdh_generic ecc snd_hda_codec_realtek snd_hda_codec_generic snd_sof_pci snd_sof_intel_byt snd_sof_intel_ipc snd_sof_intel_hda_common snd_soc_hdac_hda snd_sof_xtensa_dsp snd_sof_intel_hda snd_sof snd_soc_skl snd_soc_sst_ipc snd_soc_sst_dsp snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi snd_hda_intel iTCO_wdt intel_pmc_bxt snd_intel_dspcfg ee1004 iTCO_vendor_support mei_hdcp mei_wdt intel_rapl_msr soundwire_intel wmi_bmof soundwire_generic_allocation soundwire_cadence intel_wmi_thunderbolt x86_pkg_temp_thermal iwlmvm intel_powerclamp snd_hda_codec coretemp mac80211 i915 squashfs kvm_intel snd_hda_core snd_hwdep libarc4 soundwire_bus vfat kvm fat iwlwifi snd_soc_core irqbypass rapl snd_compress ac97_bus intel_cstate loop snd_pcm_dmaengine
Feb 09 20:14:52 kernel:  i2c_algo_bit intel_uncore snd_pcm psmouse drm_kms_helper pcspkr i2c_i801 cfg80211 e1000e tpm_crb thunderbolt snd_timer i2c_smbus cec mei_me thinkpad_acpi mei intel_gtt ucsi_acpi syscopyarea processor_thermal_device intel_lpss_pci typec_ucsi sysfillrect intel_rapl_common tpm_tis sysimgblt ledtrig_audio intel_lpss idma64 rfkill intel_pch_thermal intel_soc_dts_iosf fb_sys_fops typec tpm_tis_core wmi snd soundcore int3403_thermal int340x_thermal_zone video mac_hid int3400_thermal acpi_pad acpi_thermal_rel drm uinput sg fuse crypto_user agpgart ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 dm_crypt cbc encrypted_keys dm_mod trusted tpm rng_core crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel sdhci_pci crypto_simd cqhci cryptd sdhci glue_helper serio_raw mmc_core xhci_pci
Feb 09 20:14:52 kernel: ---[ end trace bf7896908c608ed4 ]---
Feb 09 20:14:52 kernel: ------------[ cut here ]------------
Feb 09 20:14:52 kernel: WARNING: CPU: 14 PID: 582 at kernel/rcu/tree_plugin.h:297 rcu_note_context_switch+0x42/0x480
Feb 09 20:14:52 kernel: Modules linked in: ccm rfcomm cmac algif_hash algif_skcipher af_alg bnep uvcvideo btusb btrtl videobuf2_vmalloc btbcm videobuf2_memops videobuf2_v4l2 btintel videobuf2_common bluetooth videodev joydev snd_hda_codec_hdmi mousedev mc ecdh_generic ecc snd_hda_codec_realtek snd_hda_codec_generic snd_sof_pci snd_sof_intel_byt snd_sof_intel_ipc snd_sof_intel_hda_common snd_soc_hdac_hda snd_sof_xtensa_dsp snd_sof_intel_hda snd_sof snd_soc_skl snd_soc_sst_ipc snd_soc_sst_dsp snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi snd_hda_intel iTCO_wdt intel_pmc_bxt snd_intel_dspcfg ee1004 iTCO_vendor_support mei_hdcp mei_wdt intel_rapl_msr soundwire_intel wmi_bmof soundwire_generic_allocation soundwire_cadence intel_wmi_thunderbolt x86_pkg_temp_thermal iwlmvm intel_powerclamp snd_hda_codec coretemp mac80211 i915 squashfs kvm_intel snd_hda_core snd_hwdep libarc4 soundwire_bus vfat kvm fat iwlwifi snd_soc_core irqbypass rapl snd_compress ac97_bus intel_cstate loop snd_pcm_dmaengine
Feb 09 20:14:52 kernel:  i2c_algo_bit intel_uncore snd_pcm psmouse drm_kms_helper pcspkr i2c_i801 cfg80211 e1000e tpm_crb thunderbolt snd_timer i2c_smbus cec mei_me thinkpad_acpi mei intel_gtt ucsi_acpi syscopyarea processor_thermal_device intel_lpss_pci typec_ucsi sysfillrect intel_rapl_common tpm_tis sysimgblt ledtrig_audio intel_lpss idma64 rfkill intel_pch_thermal intel_soc_dts_iosf fb_sys_fops typec tpm_tis_core wmi snd soundcore int3403_thermal int340x_thermal_zone video mac_hid int3400_thermal acpi_pad acpi_thermal_rel drm uinput sg fuse crypto_user agpgart ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 dm_crypt cbc encrypted_keys dm_mod trusted tpm rng_core crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel sdhci_pci crypto_simd cqhci cryptd sdhci glue_helper serio_raw mmc_core xhci_pci
Feb 09 20:14:52 kernel: CPU: 14 PID: 582 Comm: kworker/14:2 Tainted: G      D           5.10.13-2-MANJARO #1
Feb 09 20:14:52 kernel: Hardware name: LENOVO 20TQCTO1WW/20TQCTO1WW, BIOS N30ET33W (1.16 ) 12/17/2020
Feb 09 20:14:52 kernel: Workqueue: zswap-shrink shrink_worker
Feb 09 20:14:52 kernel: RIP: 0010:rcu_note_context_switch+0x42/0x480
Feb 09 20:14:52 kernel: Code: c3 40 cf 02 00 65 48 03 1d 73 f2 af 44 0f 1f 44 00 00 40 84 ed 75 15 65 48 8b 04 25 c0 7b 01 00 8b 90 b0 03 00 00 85 d2 7e 02 <0f> 0b 65 48 8b 04 25 c0 7b 01 00 8b 80 b0 03 00 00 85 c0 7e 0a 41
Feb 09 20:14:52 kernel: RSP: 0000:ffffb6d0447537b0 EFLAGS: 00010002
Feb 09 20:14:52 kernel: RAX: ffff963d49935e80 RBX: ffff96449d7acf40 RCX: 0000000000000000
Feb 09 20:14:52 kernel: RDX: 0000000000000001 RSI: ffffffffbbe81fcb RDI: 0000000000000000
Feb 09 20:14:52 kernel: RBP: 0000000000000000 R08: 0000000000000001 R09: 0000000000000387
Feb 09 20:14:52 kernel: R10: abcc77118461cefd R11: ffffb6d044753a9f R12: 000000000002c180
Feb 09 20:14:52 kernel: R13: ffff963d49935e80 R14: ffff963d49935e80 R15: ffff96449d7ac180
Feb 09 20:14:52 kernel: FS:  0000000000000000(0000) GS:ffff96449d780000(0000) knlGS:0000000000000000
Feb 09 20:14:52 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 09 20:14:52 kernel: CR2: 00007fde3ef28860 CR3: 0000000194c10004 CR4: 00000000007706e0
Feb 09 20:14:52 kernel: PKRU: 55555554
Feb 09 20:14:52 kernel: Call Trace:
Feb 09 20:14:52 kernel:  __schedule+0xae/0x810
Feb 09 20:14:52 kernel:  schedule+0x5b/0xc0
Feb 09 20:14:52 kernel:  schedule_timeout+0x11c/0x160
Feb 09 20:14:52 kernel:  wait_for_completion+0x9e/0x100
Feb 09 20:14:52 kernel:  virt_efi_query_variable_info+0x141/0x150
Feb 09 20:14:52 kernel:  efi_query_variable_store+0x7b/0x1e0
Feb 09 20:14:52 kernel:  ? vsnprintf+0x6e/0x4f0
Feb 09 20:14:52 kernel:  efivar_entry_set_safe+0xbd/0x210
Feb 09 20:14:52 kernel:  efi_pstore_write+0x12d/0x1a0
Feb 09 20:14:52 kernel:  pstore_dump+0x1b3/0x2f0
Feb 09 20:14:52 kernel:  kmsg_dump+0xa8/0xd0
Feb 09 20:14:52 kernel:  oops_end+0x61/0xd0
Feb 09 20:14:52 kernel:  do_trap+0x8d/0x110
Feb 09 20:14:52 kernel:  ? __free_pages+0x83/0x90
Feb 09 20:14:52 kernel:  do_error_trap+0x64/0xa0
Feb 09 20:14:52 kernel:  ? __free_pages+0x83/0x90
Feb 09 20:14:52 kernel:  exc_invalid_op+0x4e/0x70
Feb 09 20:14:52 kernel:  ? __free_pages+0x83/0x90
Feb 09 20:14:52 kernel:  asm_exc_invalid_op+0x12/0x20
Feb 09 20:14:52 kernel: RIP: 0010:__free_pages+0x83/0x90
Feb 09 20:14:52 kernel: Code: 33 e4 ff ff eb d1 85 f6 75 09 5b 5d 41 5c e9 a4 fe ff ff 5b 31 d2 5d 41 5c e9 19 e4 ff ff 48 c7 c6 c0 f0 73 bc e8 cd 64 fd ff <0f> 0b 5b 5d 41 5c c3 66 0f 1f 44 00 00 0f 1f 44 00 00 48 85 ff 75
Feb 09 20:14:52 kernel: RSP: 0000:ffffb6d044753d60 EFLAGS: 00010286
Feb 09 20:14:52 kernel: RAX: 0000000000000000 RBX: ffff963d40dc4600 RCX: ffff96449d798bb8
Feb 09 20:14:52 kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff96449d798bb0
Feb 09 20:14:52 kernel: RBP: ffffe814c4ce2380 R08: 0000000000000000 R09: ffffb6d044753a60
Feb 09 20:14:52 kernel: R10: ffffb6d044753a58 R11: ffffffffbcacb228 R12: ffffe814c4ce2380
Feb 09 20:14:52 kernel: R13: ffffe814c4ce2388 R14: ffff963d7388e000 R15: ffffe814c4ce2388
Feb 09 20:14:52 kernel:  z3fold_zpool_shrink+0x146/0x720
Feb 09 20:14:52 kernel:  ? sched_clock+0x5/0x10
Feb 09 20:14:52 kernel:  ? sched_clock_cpu+0xc/0xb0
Feb 09 20:14:52 kernel:  shrink_worker+0x19/0x50
Feb 09 20:14:52 kernel:  process_one_work+0x1d6/0x3a0
Feb 09 20:14:52 kernel:  worker_thread+0x4d/0x3d0
Feb 09 20:14:52 kernel:  ? rescuer_thread+0x410/0x410
Feb 09 20:14:52 kernel:  kthread+0x133/0x150
Feb 09 20:14:52 kernel:  ? __kthread_bind_mask+0x60/0x60
Feb 09 20:14:52 kernel:  ret_from_fork+0x1f/0x30
Feb 09 20:14:52 kernel: ---[ end trace bf7896908c608ed5 ]---
Feb 09 20:14:52 kernel: RIP: 0010:__free_pages+0x83/0x90
Feb 09 20:14:52 kernel: Code: 33 e4 ff ff eb d1 85 f6 75 09 5b 5d 41 5c e9 a4 fe ff ff 5b 31 d2 5d 41 5c e9 19 e4 ff ff 48 c7 c6 c0 f0 73 bc e8 cd 64 fd ff <0f> 0b 5b 5d 41 5c c3 66 0f 1f 44 00 00 0f 1f 44 00 00 48 85 ff 75
Feb 09 20:14:52 kernel: RSP: 0000:ffffb6d044753d60 EFLAGS: 00010286
Feb 09 20:14:52 kernel: RAX: 0000000000000000 RBX: ffff963d40dc4600 RCX: ffff96449d798bb8
Feb 09 20:14:52 kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff96449d798bb0
Feb 09 20:14:52 kernel: RBP: ffffe814c4ce2380 R08: 0000000000000000 R09: ffffb6d044753a60
Feb 09 20:14:52 kernel: R10: ffffb6d044753a58 R11: ffffffffbcacb228 R12: ffffe814c4ce2380
Feb 09 20:14:52 kernel: R13: ffffe814c4ce2388 R14: ffff963d7388e000 R15: ffffe814c4ce2388
Feb 09 20:14:52 kernel: FS:  0000000000000000(0000) GS:ffff96449d780000(0000) knlGS:0000000000000000
Feb 09 20:14:52 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 09 20:14:52 kernel: CR2: 00007fde3d0dc000 CR3: 00000001581a8001 CR4: 00000000007706e0
Feb 09 20:14:52 kernel: PKRU: 55555554
Comment 6 Alois Nespor 2021-02-10 17:18:02 UTC
i am affected with this "kernel BUG at include/linux/mm.h:707!" bug too.
still happens with kernel 5.10.14-zen, Ryzen 3400G. 
I can't reset / shut down.  Unfortunately, it doesn't save me the output from dmesg. I'll add as soon as I can.
Comment 7 Stefan Kuhn 2021-02-12 15:06:17 UTC
still with 5.10.15. It froze the DE, yet I was able to log in on tty and enter a reboot. Sadly, my laptop wasn't able to properly shut down even after 3 hours.


-- Journal begins at Tue 2020-04-21 22:11:16 CEST, ends at Fri 2021-02-12 16:01:30 CET. --
Feb 12 13:05:09 kernel: page:000000000742d55c refcount:0 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x15b94b
Feb 12 13:05:09 kernel: flags: 0x2ffff0000000000()
Feb 12 13:05:09 kernel: raw: 02ffff0000000000 fffff71f0470a7c8 ffff900c31532170 0000000000000000
Feb 12 13:05:09 kernel: raw: 0000000000000000 0000000000000011 00000000ffffffff 0000000000000000
Feb 12 13:05:09 kernel: page dumped because: VM_BUG_ON_PAGE(page_ref_count(page) == 0)
Feb 12 13:05:09 kernel: ------------[ cut here ]------------
Feb 12 13:05:09 kernel: kernel BUG at include/linux/mm.h:707!
Feb 12 13:05:09 kernel: invalid opcode: 0000 [#1] PREEMPT SMP NOPTI
Feb 12 13:05:09 kernel: CPU: 4 PID: 15998 Comm: kworker/4:1 Not tainted 5.10.15-1-MANJARO #1
Feb 12 13:05:09 kernel: Hardware name: LENOVO 20NBCTO1WW/20NBCTO1WW, BIOS R0YET46W (1.29 ) 10/09/2020
Feb 12 13:05:09 kernel: Workqueue: zswap-shrink shrink_worker
Feb 12 13:05:09 kernel: RIP: 0010:__free_pages+0x83/0x90
Feb 12 13:05:09 kernel: Code: 33 e4 ff ff eb d1 85 f6 75 09 5b 5d 41 5c e9 a4 fe ff ff 5b 31 d2 5d 41 5c e9 19 e4 ff ff 48 c7 c6 60 f3 73 b8 e8 cd 64 fd ff <0f> 0b 5b 5d 41 5c c3 66 0f 1f 44 00 00 0f 1f 44 00 00 48 85 ff 75
Feb 12 13:05:09 kernel: RSP: 0000:ffffb0a347d3bd60 EFLAGS: 00010286
Feb 12 13:05:09 kernel: RAX: 0000000000000000 RBX: ffff9008c4775d80 RCX: ffff900c31518bb8
Feb 12 13:05:09 kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff900c31518bb0
Feb 12 13:05:09 kernel: RBP: fffff71f056e52c0 R08: 0000000000000000 R09: ffffb0a347d3ba60
Feb 12 13:05:09 kernel: R10: ffffb0a347d3ba58 R11: ffffffffb8acb228 R12: fffff71f056e52c0
Feb 12 13:05:09 kernel: R13: fffff71f056e52c8 R14: ffff90091b94b000 R15: fffff71f056e52c8
Feb 12 13:05:09 kernel: FS:  0000000000000000(0000) GS:ffff900c31500000(0000) knlGS:0000000000000000
Feb 12 13:05:09 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 12 13:05:09 kernel: CR2: 00007f1b1d19332c CR3: 000000028a210003 CR4: 00000000003706e0
Feb 12 13:05:09 kernel: Call Trace:
Feb 12 13:05:09 kernel:  z3fold_zpool_shrink+0x146/0x720
Feb 12 13:05:09 kernel:  ? sched_clock+0x5/0x10
Feb 12 13:05:09 kernel:  ? sched_clock_cpu+0xc/0xb0
Feb 12 13:05:09 kernel:  shrink_worker+0x19/0x50
Feb 12 13:05:09 kernel:  process_one_work+0x1d6/0x3a0
Feb 12 13:05:09 kernel:  worker_thread+0x4d/0x3d0
Feb 12 13:05:09 kernel:  ? rescuer_thread+0x3c0/0x3c0
Feb 12 13:05:09 kernel:  kthread+0x133/0x150
Feb 12 13:05:09 kernel:  ? __kthread_bind_mask+0x60/0x60
Feb 12 13:05:09 kernel:  ret_from_fork+0x1f/0x30
Feb 12 13:05:09 kernel: Modules linked in: rfcomm ccm snd_hda_codec_hdmi snd_hda_codec_conexant snd_hda_codec_generic cmac algif_hash algif_skcipher af_alg bnep squashfs joydev mousedev vfat fat snd_sof_pci snd_sof_intel_byt snd_sof_intel_ipc loop snd_sof_intel_hda_common snd_soc_hdac_hda snd_sof_xtensa_dsp snd_sof_intel_hda snd_sof snd_soc_skl x86_pkg_temp_thermal intel_powerclamp snd_soc_sst_ipc coretemp snd_soc_sst_dsp iTCO_wdt snd_hda_ext_core intel_pmc_bxt mei_hdcp intel_rapl_msr ee1004 iTCO_vendor_support snd_soc_acpi_intel_match iwlmvm wmi_bmof snd_soc_acpi intel_wmi_thunderbolt kvm_intel snd_hda_intel snd_intel_dspcfg mac80211 uvcvideo soundwire_intel kvm soundwire_generic_allocation soundwire_cadence btusb videobuf2_vmalloc videobuf2_memops btrtl videobuf2_v4l2 btbcm libarc4 snd_hda_codec irqbypass btintel videobuf2_common rapl snd_hda_core intel_cstate i915 intel_uncore bluetooth iwlwifi videodev snd_hwdep soundwire_bus ecdh_generic psmouse ecc snd_soc_core mc cfg80211 thinkpad_acpi
Feb 12 13:05:09 kernel:  snd_compress ac97_bus ledtrig_audio i2c_algo_bit snd_pcm_dmaengine drm_kms_helper snd_pcm rfkill cec snd_timer intel_gtt syscopyarea snd sysfillrect processor_thermal_device sysimgblt mei_me intel_rapl_common int3403_thermal fb_sys_fops soundcore i2c_i801 mei int340x_thermal_zone video i2c_smbus intel_soc_dts_iosf intel_pch_thermal int3400_thermal ucsi_acpi typec_ucsi tpm_crb acpi_thermal_rel mac_hid typec tpm_tis tpm_tis_core wmi acpi_pad drm uinput sg fuse crypto_user agpgart ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 dm_crypt cbc encrypted_keys dm_mod trusted tpm rng_core crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd sdhci_pci cryptd cqhci glue_helper sdhci serio_raw mmc_core xhci_pci
Feb 12 13:05:09 kernel: ---[ end trace 066bd4a6c1c4050d ]---
Feb 12 13:05:09 kernel: ------------[ cut here ]------------
Feb 12 13:05:09 kernel: WARNING: CPU: 4 PID: 15998 at kernel/rcu/tree_plugin.h:297 rcu_note_context_switch+0x42/0x480
Feb 12 13:05:09 kernel: Modules linked in: rfcomm ccm snd_hda_codec_hdmi snd_hda_codec_conexant snd_hda_codec_generic cmac algif_hash algif_skcipher af_alg bnep squashfs joydev mousedev vfat fat snd_sof_pci snd_sof_intel_byt snd_sof_intel_ipc loop snd_sof_intel_hda_common snd_soc_hdac_hda snd_sof_xtensa_dsp snd_sof_intel_hda snd_sof snd_soc_skl x86_pkg_temp_thermal intel_powerclamp snd_soc_sst_ipc coretemp snd_soc_sst_dsp iTCO_wdt snd_hda_ext_core intel_pmc_bxt mei_hdcp intel_rapl_msr ee1004 iTCO_vendor_support snd_soc_acpi_intel_match iwlmvm wmi_bmof snd_soc_acpi intel_wmi_thunderbolt kvm_intel snd_hda_intel snd_intel_dspcfg mac80211 uvcvideo soundwire_intel kvm soundwire_generic_allocation soundwire_cadence btusb videobuf2_vmalloc videobuf2_memops btrtl videobuf2_v4l2 btbcm libarc4 snd_hda_codec irqbypass btintel videobuf2_common rapl snd_hda_core intel_cstate i915 intel_uncore bluetooth iwlwifi videodev snd_hwdep soundwire_bus ecdh_generic psmouse ecc snd_soc_core mc cfg80211 thinkpad_acpi
Feb 12 13:05:09 kernel:  snd_compress ac97_bus ledtrig_audio i2c_algo_bit snd_pcm_dmaengine drm_kms_helper snd_pcm rfkill cec snd_timer intel_gtt syscopyarea snd sysfillrect processor_thermal_device sysimgblt mei_me intel_rapl_common int3403_thermal fb_sys_fops soundcore i2c_i801 mei int340x_thermal_zone video i2c_smbus intel_soc_dts_iosf intel_pch_thermal int3400_thermal ucsi_acpi typec_ucsi tpm_crb acpi_thermal_rel mac_hid typec tpm_tis tpm_tis_core wmi acpi_pad drm uinput sg fuse crypto_user agpgart ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 dm_crypt cbc encrypted_keys dm_mod trusted tpm rng_core crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel crypto_simd sdhci_pci cryptd cqhci glue_helper sdhci serio_raw mmc_core xhci_pci
Feb 12 13:05:09 kernel: CPU: 4 PID: 15998 Comm: kworker/4:1 Tainted: G      D           5.10.15-1-MANJARO #1
Feb 12 13:05:09 kernel: Hardware name: LENOVO 20NBCTO1WW/20NBCTO1WW, BIOS R0YET46W (1.29 ) 10/09/2020
Feb 12 13:05:09 kernel: Workqueue: zswap-shrink shrink_worker
Feb 12 13:05:09 kernel: RIP: 0010:rcu_note_context_switch+0x42/0x480
Feb 12 13:05:09 kernel: Code: c3 40 cf 02 00 65 48 03 1d 53 ef af 48 0f 1f 44 00 00 40 84 ed 75 15 65 48 8b 04 25 c0 7b 01 00 8b 90 b0 03 00 00 85 d2 7e 02 <0f> 0b 65 48 8b 04 25 c0 7b 01 00 8b 80 b0 03 00 00 85 c0 7e 0a 41
Feb 12 13:05:09 kernel: RSP: 0000:ffffb0a347d3b7b0 EFLAGS: 00010002
Feb 12 13:05:09 kernel: RAX: ffff9009e1d7ddc0 RBX: ffff900c3152cf40 RCX: 0000000000000000
Feb 12 13:05:09 kernel: RDX: 0000000000000001 RSI: ffffffffb7e82f3b RDI: 0000000000000000
Feb 12 13:05:09 kernel: RBP: 0000000000000000 R08: 0000000000000001 R09: 0000000000000348
Feb 12 13:05:09 kernel: R10: abcc77118461cefd R11: ffffb0a347d3ba9f R12: 000000000002c180
Feb 12 13:05:09 kernel: R13: ffff9009e1d7ddc0 R14: ffff9009e1d7ddc0 R15: ffff900c3152c180
Feb 12 13:05:09 kernel: FS:  0000000000000000(0000) GS:ffff900c31500000(0000) knlGS:0000000000000000
Feb 12 13:05:09 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 12 13:05:09 kernel: CR2: 00007ffdd4cb73d8 CR3: 0000000102d6a003 CR4: 00000000003706e0
Feb 12 13:05:09 kernel: Call Trace:
Feb 12 13:05:09 kernel:  __schedule+0xae/0x810
Feb 12 13:05:09 kernel:  schedule+0x5b/0xc0
Feb 12 13:05:09 kernel:  schedule_timeout+0x11c/0x160
Feb 12 13:05:09 kernel:  wait_for_completion+0x9e/0x100
Feb 12 13:05:09 kernel:  virt_efi_query_variable_info+0x141/0x150
Feb 12 13:05:09 kernel:  efi_query_variable_store+0x7b/0x1e0
Feb 12 13:05:09 kernel:  ? vsnprintf+0x6e/0x4f0
Feb 12 13:05:09 kernel:  efivar_entry_set_safe+0xbd/0x210
Feb 12 13:05:09 kernel:  efi_pstore_write+0x12d/0x1a0
Feb 12 13:05:09 kernel:  pstore_dump+0x1b3/0x2f0
Feb 12 13:05:09 kernel:  kmsg_dump+0xa8/0xd0
Feb 12 13:05:09 kernel:  oops_end+0x61/0xd0
Feb 12 13:05:09 kernel:  do_trap+0x8d/0x110
Feb 12 13:05:09 kernel:  ? __free_pages+0x83/0x90
Feb 12 13:05:09 kernel:  do_error_trap+0x64/0xa0
Feb 12 13:05:09 kernel:  ? __free_pages+0x83/0x90
Feb 12 13:05:09 kernel:  exc_invalid_op+0x4e/0x70
Feb 12 13:05:09 kernel:  ? __free_pages+0x83/0x90
Feb 12 13:05:09 kernel:  asm_exc_invalid_op+0x12/0x20
Feb 12 13:05:09 kernel: RIP: 0010:__free_pages+0x83/0x90
Feb 12 13:05:09 kernel: Code: 33 e4 ff ff eb d1 85 f6 75 09 5b 5d 41 5c e9 a4 fe ff ff 5b 31 d2 5d 41 5c e9 19 e4 ff ff 48 c7 c6 60 f3 73 b8 e8 cd 64 fd ff <0f> 0b 5b 5d 41 5c c3 66 0f 1f 44 00 00 0f 1f 44 00 00 48 85 ff 75
Feb 12 13:05:09 kernel: RSP: 0000:ffffb0a347d3bd60 EFLAGS: 00010286
Feb 12 13:05:09 kernel: RAX: 0000000000000000 RBX: ffff9008c4775d80 RCX: ffff900c31518bb8
Feb 12 13:05:09 kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff900c31518bb0
Feb 12 13:05:09 kernel: RBP: fffff71f056e52c0 R08: 0000000000000000 R09: ffffb0a347d3ba60
Feb 12 13:05:09 kernel: R10: ffffb0a347d3ba58 R11: ffffffffb8acb228 R12: fffff71f056e52c0
Feb 12 13:05:09 kernel: R13: fffff71f056e52c8 R14: ffff90091b94b000 R15: fffff71f056e52c8
Feb 12 13:05:09 kernel:  z3fold_zpool_shrink+0x146/0x720
Feb 12 13:05:09 kernel:  ? sched_clock+0x5/0x10
Feb 12 13:05:09 kernel:  ? sched_clock_cpu+0xc/0xb0
Feb 12 13:05:09 kernel:  shrink_worker+0x19/0x50
Feb 12 13:05:09 kernel:  process_one_work+0x1d6/0x3a0
Feb 12 13:05:09 kernel:  worker_thread+0x4d/0x3d0
Feb 12 13:05:09 kernel:  ? rescuer_thread+0x3c0/0x3c0
Feb 12 13:05:09 kernel:  kthread+0x133/0x150
Feb 12 13:05:09 kernel:  ? __kthread_bind_mask+0x60/0x60
Feb 12 13:05:09 kernel:  ret_from_fork+0x1f/0x30
Feb 12 13:05:09 kernel: ---[ end trace 066bd4a6c1c4050e ]---
Feb 12 13:05:09 kernel: RIP: 0010:__free_pages+0x83/0x90
Feb 12 13:05:09 kernel: Code: 33 e4 ff ff eb d1 85 f6 75 09 5b 5d 41 5c e9 a4 fe ff ff 5b 31 d2 5d 41 5c e9 19 e4 ff ff 48 c7 c6 60 f3 73 b8 e8 cd 64 fd ff <0f> 0b 5b 5d 41 5c c3 66 0f 1f 44 00 00 0f 1f 44 00 00 48 85 ff 75
Feb 12 13:05:09 kernel: RSP: 0000:ffffb0a347d3bd60 EFLAGS: 00010286
Feb 12 13:05:09 kernel: RAX: 0000000000000000 RBX: ffff9008c4775d80 RCX: ffff900c31518bb8
Feb 12 13:05:09 kernel: RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff900c31518bb0
Feb 12 13:05:09 kernel: RBP: fffff71f056e52c0 R08: 0000000000000000 R09: ffffb0a347d3ba60
Feb 12 13:05:09 kernel: R10: ffffb0a347d3ba58 R11: ffffffffb8acb228 R12: fffff71f056e52c0
Feb 12 13:05:09 kernel: R13: fffff71f056e52c8 R14: ffff90091b94b000 R15: fffff71f056e52c8
Feb 12 13:05:09 kernel: FS:  0000000000000000(0000) GS:ffff900c31500000(0000) knlGS:0000000000000000
Feb 12 13:05:09 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb 12 13:05:09 kernel: CR2: 00007fc84eef7631 CR3: 00000002c8f90001 CR4: 00000000003706e0
Feb 12 13:05:22 rtkit-daemon[994]: Supervising 5 threads of 2 processes of 1 users.
Comment 8 Stefan Kuhn 2021-02-16 08:34:32 UTC
I'll start to bisect the kernel versions to find the culprit. 
`5.4.95-1-MANJARO` is stable: the bug doesn't occur.
Comment 9 Stefan Kuhn 2021-03-05 12:48:45 UTC
Still buggy for 5.11.1


Mär 05 13:37:11 kernel: page:0000000049456552 refcount:0 mapcount:0 mapping:0000000000000000 index:0x0 pfn:0x11aa43
Mär 05 13:37:11 kernel: flags: 0x2ffff0000000000()
Mär 05 13:37:11 kernel: raw: 02ffff0000000000 ffffe2f38e825f08 ffff8f98b15b2170 0000000000000000
Mär 05 13:37:11 kernel: raw: 0000000000000000 0000000000000011 00000000ffffffff 0000000000000000
Mär 05 13:37:11 kernel: page dumped because: VM_BUG_ON_PAGE(page_ref_count(page) == 0)
Mär 05 13:37:11 kernel: ------------[ cut here ]------------
Mär 05 13:37:11 kernel: kernel BUG at include/linux/mm.h:723!
Mär 05 13:37:11 kernel: invalid opcode: 0000 [#1] PREEMPT SMP NOPTI
Mär 05 13:37:11 kernel: CPU: 6 PID: 146 Comm: kworker/6:1 Not tainted 5.11.1-1-MANJARO #1
Mär 05 13:37:11 kernel: Hardware name: LENOVO 20NBCTO1WW/20NBCTO1WW, BIOS R0YET46W (1.29 ) 10/09/2020
Mär 05 13:37:11 kernel: Workqueue: zswap-shrink shrink_worker
Mär 05 13:37:11 kernel: RIP: 0010:__free_pages+0x83/0x90
Mär 05 13:37:11 kernel: Code: f3 e4 ff ff eb d1 85 f6 75 09 5b 5d 41 5c e9 a4 fe ff ff 5b 31 d2 5d 41 5c e9 d9 e4 ff ff 48 c7 c6 08 34 b7 a1 e8 1d 4b fd ff <0f> 0b 5b 5d 41 5c c3 66 0f 1f 44 00 00 0f 1f 44 00 00 48 85 ff 75
lines 4334-4390/4390 (END)
Comment 10 Milan Plzik 2021-03-15 07:37:59 UTC
Note that the hardware for this bug is misfiles, which might be lowering its visibility. This is not an IA-64 issue, but rather a x86-64 one. There doesn't seem to be a way to change this, though.
Comment 11 Stefan Kuhn 2021-03-30 13:13:53 UTC
`5.9.16-1-MANJARO` is also fine, so the problem seems to have started with 5.10