Bug 213813

Summary: general protection fault, probably for non-canonical address - snd_soc_tplg_component_remove and rcu_note_context_switch
Product: Drivers Reporter: sander44 (ionut_n2001)
Component: IOMMUAssignee: drivers_iommu
Status: NEW ---    
Severity: blocking CC: agurenko
Priority: P1    
Hardware: x86-64   
OS: Linux   
Kernel Version: 5.13.4 Subsystem:
Regression: No Bisected commit-id:
Attachments: dmesg with bug

Description sander44 2021-07-21 14:46:09 UTC
Created attachment 297985 [details]
dmesg with bug

Hi Kernel Team,

I notice this:

[  267.928616] general protection fault, probably for non-canonical address 0xed84e71d57f738be: 0000 [#1] PREEMPT SMP NOPTI
[  267.928632] CPU: 1 PID: 2995 Comm: TC2.2.8.sh Not tainted 5.13.4-1-vanilla #1
[  267.928637] Hardware name: OnLogic RXM-181/RXM-181, BIOS Z01-0001A031 05/11/2020
[  267.928641] RIP: 0010:snd_soc_tplg_component_remove+0x8f/0x290 [snd_soc_core]
[  267.928673] Code: 00 49 8b 44 24 10 49 8b 54 24 08 48 89 42 08 48 89 10 48 b8 00 01 00 00 00 00 ad de 49 89 44 24 08 48 83 c0 22 49 89 44 24 10 <49> 8b 47 08 49 8d 57 08 4d 89 fc 48 83 e8 08 48 39 d3 0f 84 b4 01
[  267.928681] RSP: 0018:ffffa84780ab7ae0 EFLAGS: 00010206
[  267.928686] RAX: ed84e71d57f738b6 RBX: ffff9ca2d41f00c8 RCX: ffffffffc0dda598
[  267.928690] RDX: 0000000000000004 RSI: 0000000000000001 RDI: ffff9ca2d41f0018
[  267.928694] RBP: ffffa84780ab7b18 R08: 0000000000000000 R09: ffffa84780ab7a50
[  267.928697] R10: 0000000000000000 R11: ffff9ca2c70363c8 R12: ffff9ca2c2e6f8b8
[  267.928701] R13: ffff9ca2d41f0018 R14: 0000000000000008 R15: ed84e71d57f738b6
[  267.928704] FS:  00007fac8e07a740(0000) GS:ffff9caa0c440000(0000) knlGS:0000000000000000
[  267.928709] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  267.928713] CR2: 000055e91e312750 CR3: 0000000106cf6002 CR4: 00000000003706e0
[  267.928717] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  267.928720] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  267.928724] Call Trace:
[  267.928730]  sof_pcm_remove+0xe/0x10 [snd_sof]
[  267.928743]  snd_soc_component_remove+0x18/0x20 [snd_soc_core]
[  267.928771]  soc_remove_component+0x96/0xa0 [snd_soc_core]
[  267.928793]  soc_cleanup_card_resources+0xbb/0x2e0 [snd_soc_core]
[  267.928815]  snd_soc_del_component_unlocked+0x134/0x170 [snd_soc_core]
[  267.928837]  snd_soc_unregister_component_by_driver+0x3b/0x50 [snd_soc_core]
[  267.928859]  devm_component_release+0x11/0x20 [snd_soc_core]
[  267.928887]  release_nodes+0x1a4/0x1f0
[  267.928896]  devres_release_group+0x7c/0xb0
[  267.928901]  component_del+0xb0/0x140
[  267.928907]  intel_audio_deinit+0x3c/0x50 [i915]
[  267.929031]  intel_display_driver_unregister+0x25/0x50 [i915]
[  267.929144]  i915_driver_remove+0x43/0x100 [i915]
[  267.929217]  i915_pci_remove+0x1e/0x30 [i915]
[  267.929290]  pci_device_remove+0x3b/0xb0
[  267.929296]  __device_release_driver+0x17e/0x240
[  267.929302]  device_driver_detach+0x41/0xa0
[  267.929307]  unbind_store+0x11e/0x130
[  267.929312]  drv_attr_store+0x21/0x30
[  267.929316]  sysfs_kf_write+0x3c/0x50
[  267.929321]  kernfs_fop_write_iter+0x138/0x1d0
[  267.929327]  new_sync_write+0x111/0x1a0
[  267.929333]  vfs_write+0x1eb/0x280
[  267.929338]  ksys_write+0x67/0xe0
[  267.929343]  __x64_sys_write+0x1a/0x20
[  267.929348]  do_syscall_64+0x40/0xb0
[  267.929353]  entry_SYSCALL_64_after_hwframe+0x44/0xae
[  267.929360] RIP: 0033:0x7fac8e167644
[  267.929365] Code: 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b3 0f 1f 80 00 00 00 00 48 8d 05 b9 60 0d 00 8b 00 85 c0 75 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 41 54 49 89 d4 55 48 89 f5 53
[  267.929372] RSP: 002b:00007fff458bbd88 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[  267.929378] RAX: ffffffffffffffda RBX: 000000000000000c RCX: 00007fac8e167644
[  267.929382] RDX: 000000000000000c RSI: 0000000001172a10 RDI: 0000000000000001
[  267.929385] RBP: 0000000001172a10 R08: 00007fac8e23a8c0 R09: 00007fac8e07a740
[  267.929389] R10: 0000000000000000 R11: 0000000000000246 R12: 00007fac8e239760
[  267.929392] R13: 000000000000000c R14: 00007fac8e234760 R15: 000000000000000c
[  267.929398] Modules linked in: pci_stub bluetooth ecdh_generic ecc cfg80211 st input_leds joydev snd_soc_skl_hda_dsp snd_soc_hdac_hdmi snd_hda_codec_hdmi snd_soc_dmic snd_hda_codec_realtek intel_rapl_msr snd_hda_codec_generic nls_iso8859_1 mei_hdcp snd_sof_pci_intel_cnl snd_sof_intel_hda_common snd_soc_hdac_hda soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci intel_rapl_common snd_sof_xtensa_dsp x86_pkg_temp_thermal intel_powerclamp coretemp snd_sof kvm_intel snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi soundwire_bus cdc_acm ledtrig_audio snd_soc_core kvm snd_compress ac97_bus snd_pcm_dmaengine crct10dif_pclmul snd_hda_intel snd_intel_dspcfg crc32_pclmul snd_intel_sdw_acpi ghash_clmulni_intel aesni_intel snd_hda_codec crypto_simd cryptd rapl intel_cstate snd_hda_core snd_hwdep snd_pcm snd_timer pcspkr wmi_bmof intel_wmi_thunderbolt mei_me efi_pstore snd soundcore 8250_dw ee1004 mei intel_pch_thermal mac_hid acpi_pad acpi_tad
[  267.929453]  binfmt_misc ip_tables x_tables autofs4 btrfs blake2b_generic zstd_compress usbkbd raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear hid_generic usbmouse usbhid hid i915 drm_kms_helper syscopyarea sysfillrect e1000e sysimgblt igb fb_sys_fops ahci cec xhci_pci intel_lpss_pci nvme intel_lpss rc_core idma64 i2c_i801 dca drm i2c_smbus nvme_core i2c_algo_bit libahci xhci_pci_renesas virt_dma wmi video pinctrl_cannonlake
[  267.929525] fbcon: Taking over console
[  267.929551] ---[ end trace a6964a558f492176 ]---
[  267.930799] ------------[ cut here ]------------
[  267.930802] WARNING: CPU: 1 PID: 2995 at kernel/rcu/tree_plugin.h:359 rcu_note_context_switch+0x48/0x470
[  267.930814] Modules linked in: pci_stub bluetooth ecdh_generic ecc cfg80211 st input_leds joydev snd_soc_skl_hda_dsp snd_soc_hdac_hdmi snd_hda_codec_hdmi snd_soc_dmic snd_hda_codec_realtek intel_rapl_msr snd_hda_codec_generic nls_iso8859_1 mei_hdcp snd_sof_pci_intel_cnl snd_sof_intel_hda_common snd_soc_hdac_hda soundwire_intel soundwire_generic_allocation soundwire_cadence snd_sof_intel_hda snd_sof_pci intel_rapl_common snd_sof_xtensa_dsp x86_pkg_temp_thermal intel_powerclamp coretemp snd_sof kvm_intel snd_hda_ext_core snd_soc_acpi_intel_match snd_soc_acpi soundwire_bus cdc_acm ledtrig_audio snd_soc_core kvm snd_compress ac97_bus snd_pcm_dmaengine crct10dif_pclmul snd_hda_intel snd_intel_dspcfg crc32_pclmul snd_intel_sdw_acpi ghash_clmulni_intel aesni_intel snd_hda_codec crypto_simd cryptd rapl intel_cstate snd_hda_core snd_hwdep snd_pcm snd_timer pcspkr wmi_bmof intel_wmi_thunderbolt mei_me efi_pstore snd soundcore 8250_dw ee1004 mei intel_pch_thermal mac_hid acpi_pad acpi_tad
[  267.930863]  binfmt_misc ip_tables x_tables autofs4 btrfs blake2b_generic zstd_compress usbkbd raid10 raid456 async_raid6_recov async_memcpy async_pq async_xor async_tx xor raid6_pq libcrc32c raid1 raid0 multipath linear hid_generic usbmouse usbhid hid i915 drm_kms_helper syscopyarea sysfillrect e1000e sysimgblt igb fb_sys_fops ahci cec xhci_pci intel_lpss_pci nvme intel_lpss rc_core idma64 i2c_i801 dca drm i2c_smbus nvme_core i2c_algo_bit libahci xhci_pci_renesas virt_dma wmi video pinctrl_cannonlake
[  267.930936] CPU: 1 PID: 2995 Comm: TC2.2.8.sh Tainted: G      D           5.13.4-1-vanilla #1
[  267.930942] Hardware name: OnLogic RXM-181/RXM-181, BIOS Z01-0001A031 05/11/2020
[  267.930946] RIP: 0010:rcu_note_context_switch+0x48/0x470
[  267.930955] Code: c3 00 de 02 00 65 48 03 1d 95 15 4e 62 0f 1f 44 00 00 45 84 e4 75 15 65 48 8b 04 25 c0 7b 01 00 8b 90 d4 07 00 00 85 d2 7e 02 <0f> 0b 65 48 8b 04 25 c0 7b 01 00 8b 80 d4 07 00 00 85 c0 7e 0a 41
[  267.930963] RSP: 0018:ffffa84780ab74c0 EFLAGS: 00010002
[  267.930968] RAX: ffff9ca2c52bd000 RBX: ffff9caa0c46de00 RCX: 0000000000000000
[  267.930972] RDX: 0000000000000001 RSI: ffffffff9e645d58 RDI: 0000000000000000
[  267.930977] RBP: ffffa84780ab74e0 R08: 0000000000000001 R09: ffff9caa0c42d170
[  267.930981] R10: 0000000000000001 R11: 0000000000000370 R12: 0000000000000000
[  267.930984] R13: 000000000002d040 R14: ffff9ca2c52bd000 R15: ffff9ca2e138e000
[  267.930989] FS:  00007fac8e07a740(0000) GS:ffff9caa0c440000(0000) knlGS:0000000000000000
[  267.930995] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  267.930999] CR2: 000055e91e312750 CR3: 0000000106cf6002 CR4: 00000000003706e0
[  267.931003] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  267.931007] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  267.931012] Call Trace:
[  267.931016]  __schedule+0xbb/0x920
[  267.931023]  ? ttwu_do_activate+0x6e/0xc0
[  267.931030]  schedule+0x68/0xe0
[  267.931036]  schedule_timeout+0x117/0x160
[  267.931042]  ? wake_up_process+0x15/0x20
[  267.931047]  ? insert_work+0x71/0x80
[  267.931054]  wait_for_completion+0x8b/0xf0
[  267.931061]  virt_efi_query_variable_info+0x146/0x160
[  267.931068]  efi_query_variable_store+0x50/0x180
[  267.931073]  ? vsnprintf+0x71/0x4e0
[  267.931081]  efivar_entry_set_safe+0xba/0x210
[  267.931091]  efi_pstore_write+0x106/0x190 [efi_pstore]
[  267.931099]  pstore_dump+0x113/0x340
[  267.931107]  kmsg_dump+0x4b/0x70
[  267.931114]  oops_exit+0x56/0x60
[  267.931118]  oops_end+0x61/0xd0
[  267.931125]  die_addr+0x51/0x60
[  267.931130]  exc_general_protection+0x1d7/0x3b0
[  267.931137]  asm_exc_general_protection+0x1e/0x30
[  267.931144] RIP: 0010:snd_soc_tplg_component_remove+0x8f/0x290 [snd_soc_core]
[  267.931178] Code: 00 49 8b 44 24 10 49 8b 54 24 08 48 89 42 08 48 89 10 48 b8 00 01 00 00 00 00 ad de 49 89 44 24 08 48 83 c0 22 49 89 44 24 10 <49> 8b 47 08 49 8d 57 08 4d 89 fc 48 83 e8 08 48 39 d3 0f 84 b4 01
[  267.931186] RSP: 0018:ffffa84780ab7ae0 EFLAGS: 00010206
[  267.931190] RAX: ed84e71d57f738b6 RBX: ffff9ca2d41f00c8 RCX: ffffffffc0dda598
[  267.931194] RDX: 0000000000000004 RSI: 0000000000000001 RDI: ffff9ca2d41f0018
[  267.931197] RBP: ffffa84780ab7b18 R08: 0000000000000000 R09: ffffa84780ab7a50
[  267.931200] R10: 0000000000000000 R11: ffff9ca2c70363c8 R12: ffff9ca2c2e6f8b8
[  267.931204] R13: ffff9ca2d41f0018 R14: 0000000000000008 R15: ed84e71d57f738b6
[  267.931211]  sof_pcm_remove+0xe/0x10 [snd_sof]
[  267.931225]  snd_soc_component_remove+0x18/0x20 [snd_soc_core]
[  267.931255]  soc_remove_component+0x96/0xa0 [snd_soc_core]
[  267.931279]  soc_cleanup_card_resources+0xbb/0x2e0 [snd_soc_core]
[  267.931304]  snd_soc_del_component_unlocked+0x134/0x170 [snd_soc_core]
[  267.931333]  snd_soc_unregister_component_by_driver+0x3b/0x50 [snd_soc_core]
[  267.931360]  devm_component_release+0x11/0x20 [snd_soc_core]
[  267.931387]  release_nodes+0x1a4/0x1f0
[  267.931395]  devres_release_group+0x7c/0xb0
[  267.931399]  component_del+0xb0/0x140
[  267.931405]  intel_audio_deinit+0x3c/0x50 [i915]
[  267.931524]  intel_display_driver_unregister+0x25/0x50 [i915]
[  267.931635]  i915_driver_remove+0x43/0x100 [i915]
[  267.931707]  i915_pci_remove+0x1e/0x30 [i915]
[  267.931781]  pci_device_remove+0x3b/0xb0
[  267.931786]  __device_release_driver+0x17e/0x240
[  267.931792]  device_driver_detach+0x41/0xa0
[  267.931797]  unbind_store+0x11e/0x130
[  267.931802]  drv_attr_store+0x21/0x30
[  267.931805]  sysfs_kf_write+0x3c/0x50
[  267.931811]  kernfs_fop_write_iter+0x138/0x1d0
[  267.931816]  new_sync_write+0x111/0x1a0
[  267.931822]  vfs_write+0x1eb/0x280
[  267.931827]  ksys_write+0x67/0xe0
[  267.931832]  __x64_sys_write+0x1a/0x20
[  267.931836]  do_syscall_64+0x40/0xb0
[  267.931840]  entry_SYSCALL_64_after_hwframe+0x44/0xae
[  267.931847] RIP: 0033:0x7fac8e167644
[  267.931850] Code: 00 f7 d8 64 89 02 48 c7 c0 ff ff ff ff eb b3 0f 1f 80 00 00 00 00 48 8d 05 b9 60 0d 00 8b 00 85 c0 75 13 b8 01 00 00 00 0f 05 <48> 3d 00 f0 ff ff 77 54 c3 0f 1f 00 41 54 49 89 d4 55 48 89 f5 53
[  267.931857] RSP: 002b:00007fff458bbd88 EFLAGS: 00000246 ORIG_RAX: 0000000000000001
[  267.931863] RAX: ffffffffffffffda RBX: 000000000000000c RCX: 00007fac8e167644
[  267.931866] RDX: 000000000000000c RSI: 0000000001172a10 RDI: 0000000000000001
[  267.931870] RBP: 0000000001172a10 R08: 00007fac8e23a8c0 R09: 00007fac8e07a740
[  267.931873] R10: 0000000000000000 R11: 0000000000000246 R12: 00007fac8e239760
[  267.931877] R13: 000000000000000c R14: 00007fac8e234760 R15: 000000000000000c
[  267.931882] ---[ end trace a6964a558f492177 ]---
[  319.236482] RIP: 0010:snd_soc_tplg_component_remove+0x8f/0x290 [snd_soc_core]
[  319.236563] Code: 00 49 8b 44 24 10 49 8b 54 24 08 48 89 42 08 48 89 10 48 b8 00 01 00 00 00 00 ad de 49 89 44 24 08 48 83 c0 22 49 89 44 24 10 <49> 8b 47 08 49 8d 57 08 4d 89 fc 48 83 e8 08 48 39 d3 0f 84 b4 01
[  319.236573] RSP: 0018:ffffa84780ab7ae0 EFLAGS: 00010206
[  319.236579] RAX: ed84e71d57f738b6 RBX: ffff9ca2d41f00c8 RCX: ffffffffc0dda598
[  319.236583] RDX: 0000000000000004 RSI: 0000000000000001 RDI: ffff9ca2d41f0018
[  319.236588] RBP: ffffa84780ab7b18 R08: 0000000000000000 R09: ffffa84780ab7a50
[  319.236592] R10: 0000000000000000 R11: ffff9ca2c70363c8 R12: ffff9ca2c2e6f8b8
[  319.236596] R13: ffff9ca2d41f0018 R14: 0000000000000008 R15: ed84e71d57f738b6
[  319.236600] FS:  00007fac8e07a740(0000) GS:ffff9caa0c400000(0000) knlGS:0000000000000000
[  319.236605] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  319.236609] CR2: 00007fff71e0a000 CR3: 0000000106cf6002 CR4: 00000000003706f0
[  319.236614] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  319.236618] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400

Step for reproduce:
BOOT_IMAGE=/vmlinuz-5.13.4-1-vanilla root=UUID=ac59dee0-90a4-4d34-8518-acbb7c28b7cc ro splash quiet intel_iommu=on

Step 1.
# systemctl stop lightdm
# modprobe pci-stub
# CONTROLLER="00:02.0"
# VENDOR=$(lspci -s $CONTROLLER -nD  | awk '{ print $3 }' | cut -d: -f1)
# PRODUCT=$(lspci -s $CONTROLLER -nD  | awk '{ print $3 }' | cut -d: -f2)
# KERNEL_MODULE=$(lspci -s $CONTROLLER -k | grep Kernel | head -2 | tail -1 | cut -d: -f2 | cut -d' ' -f2)
# echo -n "$VENDOR $PRODUCT" > /sys/bus/pci/drivers/pci-stub/new_id
# echo -n "0000:$CONTROLLER" > /sys/bus/pci/devices/0000:$CONTROLLER/driver/unbind
Segmentation fault

Step 2.
#echo 1 > /sys/bus/pci/devices/0000:$CONTROLLER/remove
This command does not complete, it remains stuck in the terminal.

I try to reboot with the "reboot" command, but it doesn't work, the system gets stuck in an incomplete state.