Bug 215117
Summary: | ucsi_acpi: kernel NULL pointer dereference | ||
---|---|---|---|
Product: | Drivers | Reporter: | Chris Hixon (linux-kernel-bugs) |
Component: | USB | Assignee: | Default virtual assignee for Drivers/USB (drivers_usb) |
Status: | NEW --- | ||
Severity: | normal | CC: | heikki.krogerus, linux-kernel-bugs, manuel.viet0, regressions |
Priority: | P1 | ||
Hardware: | x86-64 | ||
OS: | Linux | ||
Kernel Version: | 5.16-rc2 | Subsystem: | |
Regression: | Yes | Bisected commit-id: | |
Attachments: |
journal and lshw
fix proposal |
FWIW, sadly lot's of bugs filed in bugzilla.kernel.org don't reach the responsible developers. This seems to be the case here. You thus might want to report your problem by mail as described in this document, as explained on the front-page of bugzilla.kernel.org: https://www.kernel.org/doc/html/latest/admin-guide/reporting-issues.html As this seems to be a regression you might want to CC the regressions mailing list on your report, as described in that document. The bug is still present in -rc3, tested on a Ryzen7-based HP Envy x360 convertible model 13-arXXXXX. Same logs, basically. Hi, this is your Linux kernel regression tracker speaking. Parlty top-posting for once, to make this easy accessible to everyone. Heikki, below bug sounds a awful lot like a regression. I'd be glad if you could take a quick look at this, as the report seems have fallen through the cracks; somebody else today confirmed the problem is still happening with 5.16-rc3. Chris or Manuel, could you please confirm v5.15.y worked fine? On 23.11.21 04:51, bugzilla-daemon@bugzilla.kernel.org wrote: > https://bugzilla.kernel.org/show_bug.cgi?id=215117 > > Bug ID: 215117 > Summary: ucsi_acpi: kernel NULL pointer dereference > Product: Drivers > Version: 2.5 > Kernel Version: 5.16-rc2 > Hardware: x86-64 > OS: Linux > Tree: Mainline > Status: NEW > Severity: normal > Priority: P1 > Component: USB > Assignee: drivers_usb@kernel-bugs.kernel.org > Reporter: linux-kernel-bugs@hixontech.com > Regression: No > > Created attachment 299677 [details] > --> https://bugzilla.kernel.org/attachment.cgi?id=299677&action=edit > journal and lshw > > The system fails to boot completely (or shutdown properly) after kernel oops, > apparently in the ucsi_acpi module. It boots up fine with this module > blacklisted. I first noticed the issue on 5.16-rc1; the problem continues > with > 5.16-rc2. > > HW: HP ENVY x360, AMD Ryzen 7 4700U with Radeon Graphics, Renoir > > Attached: full kernel journal log and output from lshw. > > OOPS: > > Nov 22 06:44:04 kernel: BUG: kernel NULL pointer dereference, address: > 0000000000000058 > Nov 22 06:44:04 kernel: #PF: supervisor read access in kernel mode > Nov 22 06:44:04 kernel: #PF: error_code(0x0000) - not-present page > Nov 22 06:44:04 kernel: PGD 0 P4D 0 > Nov 22 06:44:04 kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI > Nov 22 06:44:04 kernel: CPU: 0 PID: 394 Comm: kworker/0:2 Not tainted > 5.16.0-rc2-1-mainline #1 4a5aa185cbfb8b63cd50dfec190bc41096ea30a5 > Nov 22 06:44:04 kernel: Hardware name: HP HP ENVY x360 Convertible > 15-ds1xxx/87A9, BIOS F.07 03/18/2021 > Nov 22 06:44:04 kernel: Workqueue: events_long ucsi_init_work [typec_ucsi] > Nov 22 06:44:04 kernel: RIP: 0010:typec_register_altmode+0x2e/0x3a0 [typec] > Nov 22 06:44:04 kernel: Code: 00 41 57 41 56 41 55 41 54 49 89 f4 55 48 89 fd > 48 8d bf 08 03 00 00 53 48 83 ec 28 65 48 8b 04 25 28 00 00 00 48 89 44 24 20 > <48> 8b 87 50 fd ff ff 48 3d e0 99 5b c0 74 18 48 8d 95 f8 02 00 00 > Nov 22 06:44:04 kernel: RSP: 0018:ffffa171c0f9fd30 EFLAGS: 00010286 > Nov 22 06:44:04 kernel: RAX: 8a5a9eb1bcae6600 RBX: ffff94994f1b7800 RCX: > 0000000000000001 > Nov 22 06:44:04 kernel: RDX: 0000000000000000 RSI: ffffa171c0f9fdd0 RDI: > 0000000000000308 > Nov 22 06:44:04 kernel: RBP: 0000000000000000 R08: 0000000000000000 R09: > 0000000000000000 > Nov 22 06:44:04 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: > ffffa171c0f9fdd0 > Nov 22 06:44:04 kernel: R13: 0000000000000000 R14: 0000000000000000 R15: > ffff94994f1b7800 > Nov 22 06:44:04 kernel: FS: 0000000000000000(0000) GS:ffff949c3f600000(0000) > knlGS:0000000000000000 > Nov 22 06:44:04 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > Nov 22 06:44:04 kernel: CR2: 0000000000000058 CR3: 0000000103c3e000 CR4: > 0000000000350ef0 > Nov 22 06:44:04 kernel: Call Trace: > Nov 22 06:44:04 kernel: <TASK> > Nov 22 06:44:04 kernel: ? ucsi_acpi_sync_write+0x4a/0x70 [ucsi_acpi > 02bdd89c7010256e11856d8931a8362b48e4c3f7] > Nov 22 06:44:04 kernel: ucsi_register_altmode.constprop.0+0x1f0/0x250 > [typec_ucsi 5c5256aa8a0bedb6e8965681f3f36303c0e1b18d] > Nov 22 06:44:04 kernel: ucsi_register_altmodes+0x161/0x1c0 [typec_ucsi > 5c5256aa8a0bedb6e8965681f3f36303c0e1b18d] > Nov 22 06:44:04 kernel: ucsi_check_altmodes+0x17/0x50 [typec_ucsi > 5c5256aa8a0bedb6e8965681f3f36303c0e1b18d] > Nov 22 06:44:04 kernel: ucsi_init_work+0x6c7/0x720 [typec_ucsi > 5c5256aa8a0bedb6e8965681f3f36303c0e1b18d] > Nov 22 06:44:04 kernel: process_one_work+0x1e8/0x3c0 > Nov 22 06:44:04 kernel: worker_thread+0x50/0x3c0 > Nov 22 06:44:04 kernel: ? rescuer_thread+0x390/0x390 > Nov 22 06:44:04 kernel: kthread+0x15c/0x180 > Nov 22 06:44:04 kernel: ? set_kthread_struct+0x50/0x50 > Nov 22 06:44:04 kernel: ret_from_fork+0x22/0x30 > Nov 22 06:44:04 kernel: </TASK> > Nov 22 06:44:04 kernel: Modules linked in: snd_hda_codec_realtek(+) fjes(-) > snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi joydev iwlmvm(+) > mousedev snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi mac80211 > nls_iso8859_1 snd_hda_codec btusb vfat amdgpu(+) libarc4 snd_hda_core btrtl > fat > snd_hwdep btbcm iwlwifi snd_pcm btintel snd_timer bluetooth snd_pci_acp5x > snd_rn_pci_acp3x k10temp gpu_sched amd_sfh snd_pci_acp3x cfg80211 snd > ecdh_generic ucsi_acpi drm_ttm_helper sp5100_tco soundcore rfkill typec_ucsi > ttm i2c_piix4 typec mac_hid roles wmi video tpm_crb tpm_tis wireless_hotkey > tpm_tis_core hp_accel acpi_cpufreq lis3lv02d amd_pmc acpi_tad 9pnet_virtio 9p > 9pnet fscache netfs sg crypto_user fuse bpf_preload ip_tables x_tables ext4 > crc32c_generic crc16 mbcache jbd2 dm_crypt cbc encrypted_keys dm_mod trusted > asn1_encoder tee tpm rtsx_pci_sdmmc mmc_core crct10dif_pclmul serio_raw > crc32_pclmul crc32c_intel ghash_clmulni_intel atkbd aesni_intel libps2 > crypto_simd cryptd ccp xhci_pci > Nov 22 06:44:04 kernel: xhci_pci_renesas rng_core rtsx_pci i8042 serio > hid_multitouch i2c_hid_acpi i2c_hid pinctrl_amd > Nov 22 06:44:04 kernel: CR2: 0000000000000058 > Nov 22 06:44:04 kernel: ---[ end trace bdd82aa217da2b8a ]--- > Nov 22 06:44:04 kernel: RIP: 0010:typec_register_altmode+0x2e/0x3a0 [typec] > Nov 22 06:44:04 kernel: Code: 00 41 57 41 56 41 55 41 54 49 89 f4 55 48 89 fd > 48 8d bf 08 03 00 00 53 48 83 ec 28 65 48 8b 04 25 28 00 00 00 48 89 44 24 20 > <48> 8b 87 50 fd ff ff 48 3d e0 99 5b c0 74 18 48 8d 95 f8 02 00 00 > Nov 22 06:44:04 kernel: RSP: 0018:ffffa171c0f9fd30 EFLAGS: 00010286 > Nov 22 06:44:04 kernel: RAX: 8a5a9eb1bcae6600 RBX: ffff94994f1b7800 RCX: > 0000000000000001 > Nov 22 06:44:04 kernel: RDX: 0000000000000000 RSI: ffffa171c0f9fdd0 RDI: > 0000000000000308 > Nov 22 06:44:04 kernel: RBP: 0000000000000000 R08: 0000000000000000 R09: > 0000000000000000 > Nov 22 06:44:04 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: > ffffa171c0f9fdd0 > Nov 22 06:44:04 kernel: R13: 0000000000000000 R14: 0000000000000000 R15: > ffff94994f1b7800 > Nov 22 06:44:04 kernel: FS: 0000000000000000(0000) GS:ffff949c3f600000(0000) > knlGS:0000000000000000 > Nov 22 06:44:04 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > Nov 22 06:44:04 kernel: CR2: 0000000000000058 CR3: 0000000103c3e000 CR4: > 0000000000350ef0 [TLDR for the rest: adding this regression to regzbot; this mail is partly compiled from a few templates paragraphs some of you might have seen already.] Adding the regression mailing list to the list of recipients, as it should be in the loop for all regressions, as explained here: https://www.kernel.org/doc/html/latest/admin-guide/reporting-issues.html To be sure this issue doesn't fall through the cracks unnoticed, I'm adding it to regzbot, my Linux kernel regression tracking bot: #regzbot ^introduced v5.15..v5.16-rc1 #regzbot title usb: ucsi_acpi: kernel NULL pointer dereference Reminder: when fixing the issue, please add a 'Link:' tag with the URL to the report (the parent of this mail), then regzbot will automatically mark the regression as resolved once the fix lands in the appropriate tree. For more details about regzbot see footer. Sending this to everyone that got the initial report, to make all aware of the tracking. I also hope that messages like this motivate people to directly get at least the regression mailing list and ideally even regzbot involved when dealing with regressions, as messages like this wouldn't be needed then. Don't worry, I'll send further messages wrt to this regression just to the lists (with a tag in the subject so people can filter them away), as long as they are intended just for regzbot. With a bit of luck no such messages will be needed anyway. Ciao, Thorsten (wearing his 'Linux kernel regression tracker' hat). P.S.: As a Linux kernel regression tracker I'm getting a lot of reports on my table. I can only look briefly into most of them. Unfortunately therefore I sometimes will get things wrong or miss something important. I hope that's not the case here; if you think it is, don't hesitate to tell me about it in a public reply. That's in everyone's interest, as what I wrote above might be misleading to everyone reading this; any suggestion I gave thus might sent someone reading this down the wrong rabbit hole, which none of us wants. BTW, I have no personal interest in this issue, which is tracked using regzbot, my Linux kernel regression tracking bot (https://linux-regtracking.leemhuis.info/regzbot/). I'm only posting this mail to get things rolling again and hence don't need to be CC on all further activities wrt to this regression. It sure seems like a regression, starting at v5.16-rc1. I haven't encountered this bug in any v5.15 version I've used, including -rc versions (v5.15-rcX), v5.15(mainline), or v5.15.y(stable). It seems like the bug still exists in v5.16-rc4, though I wasn't able to capture a kernel oops the one time I tried that. I'll soon try -rc5. Created attachment 300041 [details]
fix proposal
Most likely regression from commit 6cbe4b2d5a3f ("usb: typec: ucsi: Check the partner alt modes always if there is PD contract").
Can you guys test the patch I attached?
Hi, On Thu, Dec 16, 2021 at 10:22:17AM +0100, Thorsten Leemhuis wrote: > Hi, this is your Linux kernel regression tracker speaking. > > Parlty top-posting for once, to make this easy accessible to everyone. > > Heikki, below bug sounds a awful lot like a regression. I'd be glad if > you could take a quick look at this, as the report seems have fallen > through the cracks; somebody else today confirmed the problem is still > happening with 5.16-rc3. It is most likely regression. This commit is quite likely the culprit: cbe4b2d5a3f ("usb: typec: ucsi: Check the partner alt modes always if there is PD contract") I think this should fix it: diff --git a/drivers/usb/typec/ucsi/ucsi.c b/drivers/usb/typec/ucsi/ucsi.c index 6aa28384f77f1..08561bf7c40cd 100644 --- a/drivers/usb/typec/ucsi/ucsi.c +++ b/drivers/usb/typec/ucsi/ucsi.c @@ -1150,7 +1150,9 @@ static int ucsi_register_port(struct ucsi *ucsi, int index) ret = 0; } - if (UCSI_CONSTAT_PWR_OPMODE(con->status.flags) == UCSI_CONSTAT_PWR_OPMODE_PD) { + if (con->partner && + UCSI_CONSTAT_PWR_OPMODE(con->status.flags) == + UCSI_CONSTAT_PWR_OPMODE_PD) { ucsi_get_src_pdos(con); ucsi_check_altmodes(con); } It's also attached to the bug report. > Chris or Manuel, could you please confirm v5.15.y worked fine? > > On 23.11.21 04:51, bugzilla-daemon@bugzilla.kernel.org wrote: > > https://bugzilla.kernel.org/show_bug.cgi?id=215117 > > > > Bug ID: 215117 > > Summary: ucsi_acpi: kernel NULL pointer dereference > > Product: Drivers > > Version: 2.5 > > Kernel Version: 5.16-rc2 > > Hardware: x86-64 > > OS: Linux > > Tree: Mainline > > Status: NEW > > Severity: normal > > Priority: P1 > > Component: USB > > Assignee: drivers_usb@kernel-bugs.kernel.org > > Reporter: linux-kernel-bugs@hixontech.com > > Regression: No > > > > Created attachment 299677 [details] > > --> https://bugzilla.kernel.org/attachment.cgi?id=299677&action=edit > > journal and lshw > > > > The system fails to boot completely (or shutdown properly) after kernel > oops, > > apparently in the ucsi_acpi module. It boots up fine with this module > > blacklisted. I first noticed the issue on 5.16-rc1; the problem continues > with > > 5.16-rc2. > > > > HW: HP ENVY x360, AMD Ryzen 7 4700U with Radeon Graphics, Renoir > > > > Attached: full kernel journal log and output from lshw. > > > > OOPS: > > > > Nov 22 06:44:04 kernel: BUG: kernel NULL pointer dereference, address: > > 0000000000000058 > > Nov 22 06:44:04 kernel: #PF: supervisor read access in kernel mode > > Nov 22 06:44:04 kernel: #PF: error_code(0x0000) - not-present page > > Nov 22 06:44:04 kernel: PGD 0 P4D 0 > > Nov 22 06:44:04 kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI > > Nov 22 06:44:04 kernel: CPU: 0 PID: 394 Comm: kworker/0:2 Not tainted > > 5.16.0-rc2-1-mainline #1 4a5aa185cbfb8b63cd50dfec190bc41096ea30a5 > > Nov 22 06:44:04 kernel: Hardware name: HP HP ENVY x360 Convertible > > 15-ds1xxx/87A9, BIOS F.07 03/18/2021 > > Nov 22 06:44:04 kernel: Workqueue: events_long ucsi_init_work [typec_ucsi] > > Nov 22 06:44:04 kernel: RIP: 0010:typec_register_altmode+0x2e/0x3a0 [typec] > > Nov 22 06:44:04 kernel: Code: 00 41 57 41 56 41 55 41 54 49 89 f4 55 48 89 > fd > > 48 8d bf 08 03 00 00 53 48 83 ec 28 65 48 8b 04 25 28 00 00 00 48 89 44 24 > 20 > > <48> 8b 87 50 fd ff ff 48 3d e0 99 5b c0 74 18 48 8d 95 f8 02 00 00 > > Nov 22 06:44:04 kernel: RSP: 0018:ffffa171c0f9fd30 EFLAGS: 00010286 > > Nov 22 06:44:04 kernel: RAX: 8a5a9eb1bcae6600 RBX: ffff94994f1b7800 RCX: > > 0000000000000001 > > Nov 22 06:44:04 kernel: RDX: 0000000000000000 RSI: ffffa171c0f9fdd0 RDI: > > 0000000000000308 > > Nov 22 06:44:04 kernel: RBP: 0000000000000000 R08: 0000000000000000 R09: > > 0000000000000000 > > Nov 22 06:44:04 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: > > ffffa171c0f9fdd0 > > Nov 22 06:44:04 kernel: R13: 0000000000000000 R14: 0000000000000000 R15: > > ffff94994f1b7800 > > Nov 22 06:44:04 kernel: FS: 0000000000000000(0000) > GS:ffff949c3f600000(0000) > > knlGS:0000000000000000 > > Nov 22 06:44:04 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > > Nov 22 06:44:04 kernel: CR2: 0000000000000058 CR3: 0000000103c3e000 CR4: > > 0000000000350ef0 > > Nov 22 06:44:04 kernel: Call Trace: > > Nov 22 06:44:04 kernel: <TASK> > > Nov 22 06:44:04 kernel: ? ucsi_acpi_sync_write+0x4a/0x70 [ucsi_acpi > > 02bdd89c7010256e11856d8931a8362b48e4c3f7] > > Nov 22 06:44:04 kernel: ucsi_register_altmode.constprop.0+0x1f0/0x250 > > [typec_ucsi 5c5256aa8a0bedb6e8965681f3f36303c0e1b18d] > > Nov 22 06:44:04 kernel: ucsi_register_altmodes+0x161/0x1c0 [typec_ucsi > > 5c5256aa8a0bedb6e8965681f3f36303c0e1b18d] > > Nov 22 06:44:04 kernel: ucsi_check_altmodes+0x17/0x50 [typec_ucsi > > 5c5256aa8a0bedb6e8965681f3f36303c0e1b18d] > > Nov 22 06:44:04 kernel: ucsi_init_work+0x6c7/0x720 [typec_ucsi > > 5c5256aa8a0bedb6e8965681f3f36303c0e1b18d] > > Nov 22 06:44:04 kernel: process_one_work+0x1e8/0x3c0 > > Nov 22 06:44:04 kernel: worker_thread+0x50/0x3c0 > > Nov 22 06:44:04 kernel: ? rescuer_thread+0x390/0x390 > > Nov 22 06:44:04 kernel: kthread+0x15c/0x180 > > Nov 22 06:44:04 kernel: ? set_kthread_struct+0x50/0x50 > > Nov 22 06:44:04 kernel: ret_from_fork+0x22/0x30 > > Nov 22 06:44:04 kernel: </TASK> > > Nov 22 06:44:04 kernel: Modules linked in: snd_hda_codec_realtek(+) fjes(-) > > snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi joydev iwlmvm(+) > > mousedev snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi mac80211 > > nls_iso8859_1 snd_hda_codec btusb vfat amdgpu(+) libarc4 snd_hda_core btrtl > fat > > snd_hwdep btbcm iwlwifi snd_pcm btintel snd_timer bluetooth snd_pci_acp5x > > snd_rn_pci_acp3x k10temp gpu_sched amd_sfh snd_pci_acp3x cfg80211 snd > > ecdh_generic ucsi_acpi drm_ttm_helper sp5100_tco soundcore rfkill > typec_ucsi > > ttm i2c_piix4 typec mac_hid roles wmi video tpm_crb tpm_tis wireless_hotkey > > tpm_tis_core hp_accel acpi_cpufreq lis3lv02d amd_pmc acpi_tad 9pnet_virtio > 9p > > 9pnet fscache netfs sg crypto_user fuse bpf_preload ip_tables x_tables ext4 > > crc32c_generic crc16 mbcache jbd2 dm_crypt cbc encrypted_keys dm_mod > trusted > > asn1_encoder tee tpm rtsx_pci_sdmmc mmc_core crct10dif_pclmul serio_raw > > crc32_pclmul crc32c_intel ghash_clmulni_intel atkbd aesni_intel libps2 > > crypto_simd cryptd ccp xhci_pci > > Nov 22 06:44:04 kernel: xhci_pci_renesas rng_core rtsx_pci i8042 serio > > hid_multitouch i2c_hid_acpi i2c_hid pinctrl_amd > > Nov 22 06:44:04 kernel: CR2: 0000000000000058 > > Nov 22 06:44:04 kernel: ---[ end trace bdd82aa217da2b8a ]--- > > Nov 22 06:44:04 kernel: RIP: 0010:typec_register_altmode+0x2e/0x3a0 [typec] > > Nov 22 06:44:04 kernel: Code: 00 41 57 41 56 41 55 41 54 49 89 f4 55 48 89 > fd > > 48 8d bf 08 03 00 00 53 48 83 ec 28 65 48 8b 04 25 28 00 00 00 48 89 44 24 > 20 > > <48> 8b 87 50 fd ff ff 48 3d e0 99 5b c0 74 18 48 8d 95 f8 02 00 00 > > Nov 22 06:44:04 kernel: RSP: 0018:ffffa171c0f9fd30 EFLAGS: 00010286 > > Nov 22 06:44:04 kernel: RAX: 8a5a9eb1bcae6600 RBX: ffff94994f1b7800 RCX: > > 0000000000000001 > > Nov 22 06:44:04 kernel: RDX: 0000000000000000 RSI: ffffa171c0f9fdd0 RDI: > > 0000000000000308 > > Nov 22 06:44:04 kernel: RBP: 0000000000000000 R08: 0000000000000000 R09: > > 0000000000000000 > > Nov 22 06:44:04 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: > > ffffa171c0f9fdd0 > > Nov 22 06:44:04 kernel: R13: 0000000000000000 R14: 0000000000000000 R15: > > ffff94994f1b7800 > > Nov 22 06:44:04 kernel: FS: 0000000000000000(0000) > GS:ffff949c3f600000(0000) > > knlGS:0000000000000000 > > Nov 22 06:44:04 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > > Nov 22 06:44:04 kernel: CR2: 0000000000000058 CR3: 0000000103c3e000 CR4: > > 0000000000350ef0 > > [TLDR for the rest: adding this regression to regzbot; this mail is > partly compiled from a few templates paragraphs some of you might have > seen already.] > > Adding the regression mailing list to the list of recipients, as it > should be in the loop for all regressions, as explained here: > https://www.kernel.org/doc/html/latest/admin-guide/reporting-issues.html > > To be sure this issue doesn't fall through the cracks unnoticed, I'm > adding it to regzbot, my Linux kernel regression tracking bot: > > #regzbot ^introduced v5.15..v5.16-rc1 > #regzbot title usb: ucsi_acpi: kernel NULL pointer dereference > > Reminder: when fixing the issue, please add a 'Link:' tag with the URL > to the report (the parent of this mail), then regzbot will automatically > mark the regression as resolved once the fix lands in the appropriate > tree. For more details about regzbot see footer. > > Sending this to everyone that got the initial report, to make all aware > of the tracking. I also hope that messages like this motivate people to > directly get at least the regression mailing list and ideally even > regzbot involved when dealing with regressions, as messages like this > wouldn't be needed then. > > Don't worry, I'll send further messages wrt to this regression just to > the lists (with a tag in the subject so people can filter them away), as > long as they are intended just for regzbot. With a bit of luck no such > messages will be needed anyway. > > Ciao, Thorsten (wearing his 'Linux kernel regression tracker' hat). > > P.S.: As a Linux kernel regression tracker I'm getting a lot of reports > on my table. I can only look briefly into most of them. Unfortunately > therefore I sometimes will get things wrong or miss something important. > I hope that's not the case here; if you think it is, don't hesitate to > tell me about it in a public reply. That's in everyone's interest, as > what I wrote above might be misleading to everyone reading this; any > suggestion I gave thus might sent someone reading this down the wrong > rabbit hole, which none of us wants. > > BTW, I have no personal interest in this issue, which is tracked using > regzbot, my Linux kernel regression tracking bot > (https://linux-regtracking.leemhuis.info/regzbot/). I'm only posting > this mail to get things rolling again and hence don't need to be CC on > all further activities wrt to this regression. That patch seems to fix the bug for me, applied to v5.16-rc5. (patch attached to bug report as https://bugzilla.kernel.org/attachment.cgi?id=300041 ) Thanks! |
Created attachment 299677 [details] journal and lshw The system fails to boot completely (or shutdown properly) after kernel oops, apparently in the ucsi_acpi module. It boots up fine with this module blacklisted. I first noticed the issue on 5.16-rc1; the problem continues with 5.16-rc2. HW: HP ENVY x360, AMD Ryzen 7 4700U with Radeon Graphics, Renoir Attached: full kernel journal log and output from lshw. OOPS: Nov 22 06:44:04 kernel: BUG: kernel NULL pointer dereference, address: 0000000000000058 Nov 22 06:44:04 kernel: #PF: supervisor read access in kernel mode Nov 22 06:44:04 kernel: #PF: error_code(0x0000) - not-present page Nov 22 06:44:04 kernel: PGD 0 P4D 0 Nov 22 06:44:04 kernel: Oops: 0000 [#1] PREEMPT SMP NOPTI Nov 22 06:44:04 kernel: CPU: 0 PID: 394 Comm: kworker/0:2 Not tainted 5.16.0-rc2-1-mainline #1 4a5aa185cbfb8b63cd50dfec190bc41096ea30a5 Nov 22 06:44:04 kernel: Hardware name: HP HP ENVY x360 Convertible 15-ds1xxx/87A9, BIOS F.07 03/18/2021 Nov 22 06:44:04 kernel: Workqueue: events_long ucsi_init_work [typec_ucsi] Nov 22 06:44:04 kernel: RIP: 0010:typec_register_altmode+0x2e/0x3a0 [typec] Nov 22 06:44:04 kernel: Code: 00 41 57 41 56 41 55 41 54 49 89 f4 55 48 89 fd 48 8d bf 08 03 00 00 53 48 83 ec 28 65 48 8b 04 25 28 00 00 00 48 89 44 24 20 <48> 8b 87 50 fd ff ff 48 3d e0 99 5b c0 74 18 48 8d 95 f8 02 00 00 Nov 22 06:44:04 kernel: RSP: 0018:ffffa171c0f9fd30 EFLAGS: 00010286 Nov 22 06:44:04 kernel: RAX: 8a5a9eb1bcae6600 RBX: ffff94994f1b7800 RCX: 0000000000000001 Nov 22 06:44:04 kernel: RDX: 0000000000000000 RSI: ffffa171c0f9fdd0 RDI: 0000000000000308 Nov 22 06:44:04 kernel: RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000 Nov 22 06:44:04 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffffa171c0f9fdd0 Nov 22 06:44:04 kernel: R13: 0000000000000000 R14: 0000000000000000 R15: ffff94994f1b7800 Nov 22 06:44:04 kernel: FS: 0000000000000000(0000) GS:ffff949c3f600000(0000) knlGS:0000000000000000 Nov 22 06:44:04 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Nov 22 06:44:04 kernel: CR2: 0000000000000058 CR3: 0000000103c3e000 CR4: 0000000000350ef0 Nov 22 06:44:04 kernel: Call Trace: Nov 22 06:44:04 kernel: <TASK> Nov 22 06:44:04 kernel: ? ucsi_acpi_sync_write+0x4a/0x70 [ucsi_acpi 02bdd89c7010256e11856d8931a8362b48e4c3f7] Nov 22 06:44:04 kernel: ucsi_register_altmode.constprop.0+0x1f0/0x250 [typec_ucsi 5c5256aa8a0bedb6e8965681f3f36303c0e1b18d] Nov 22 06:44:04 kernel: ucsi_register_altmodes+0x161/0x1c0 [typec_ucsi 5c5256aa8a0bedb6e8965681f3f36303c0e1b18d] Nov 22 06:44:04 kernel: ucsi_check_altmodes+0x17/0x50 [typec_ucsi 5c5256aa8a0bedb6e8965681f3f36303c0e1b18d] Nov 22 06:44:04 kernel: ucsi_init_work+0x6c7/0x720 [typec_ucsi 5c5256aa8a0bedb6e8965681f3f36303c0e1b18d] Nov 22 06:44:04 kernel: process_one_work+0x1e8/0x3c0 Nov 22 06:44:04 kernel: worker_thread+0x50/0x3c0 Nov 22 06:44:04 kernel: ? rescuer_thread+0x390/0x390 Nov 22 06:44:04 kernel: kthread+0x15c/0x180 Nov 22 06:44:04 kernel: ? set_kthread_struct+0x50/0x50 Nov 22 06:44:04 kernel: ret_from_fork+0x22/0x30 Nov 22 06:44:04 kernel: </TASK> Nov 22 06:44:04 kernel: Modules linked in: snd_hda_codec_realtek(+) fjes(-) snd_hda_codec_generic ledtrig_audio snd_hda_codec_hdmi joydev iwlmvm(+) mousedev snd_hda_intel snd_intel_dspcfg snd_intel_sdw_acpi mac80211 nls_iso8859_1 snd_hda_codec btusb vfat amdgpu(+) libarc4 snd_hda_core btrtl fat snd_hwdep btbcm iwlwifi snd_pcm btintel snd_timer bluetooth snd_pci_acp5x snd_rn_pci_acp3x k10temp gpu_sched amd_sfh snd_pci_acp3x cfg80211 snd ecdh_generic ucsi_acpi drm_ttm_helper sp5100_tco soundcore rfkill typec_ucsi ttm i2c_piix4 typec mac_hid roles wmi video tpm_crb tpm_tis wireless_hotkey tpm_tis_core hp_accel acpi_cpufreq lis3lv02d amd_pmc acpi_tad 9pnet_virtio 9p 9pnet fscache netfs sg crypto_user fuse bpf_preload ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 dm_crypt cbc encrypted_keys dm_mod trusted asn1_encoder tee tpm rtsx_pci_sdmmc mmc_core crct10dif_pclmul serio_raw crc32_pclmul crc32c_intel ghash_clmulni_intel atkbd aesni_intel libps2 crypto_simd cryptd ccp xhci_pci Nov 22 06:44:04 kernel: xhci_pci_renesas rng_core rtsx_pci i8042 serio hid_multitouch i2c_hid_acpi i2c_hid pinctrl_amd Nov 22 06:44:04 kernel: CR2: 0000000000000058 Nov 22 06:44:04 kernel: ---[ end trace bdd82aa217da2b8a ]--- Nov 22 06:44:04 kernel: RIP: 0010:typec_register_altmode+0x2e/0x3a0 [typec] Nov 22 06:44:04 kernel: Code: 00 41 57 41 56 41 55 41 54 49 89 f4 55 48 89 fd 48 8d bf 08 03 00 00 53 48 83 ec 28 65 48 8b 04 25 28 00 00 00 48 89 44 24 20 <48> 8b 87 50 fd ff ff 48 3d e0 99 5b c0 74 18 48 8d 95 f8 02 00 00 Nov 22 06:44:04 kernel: RSP: 0018:ffffa171c0f9fd30 EFLAGS: 00010286 Nov 22 06:44:04 kernel: RAX: 8a5a9eb1bcae6600 RBX: ffff94994f1b7800 RCX: 0000000000000001 Nov 22 06:44:04 kernel: RDX: 0000000000000000 RSI: ffffa171c0f9fdd0 RDI: 0000000000000308 Nov 22 06:44:04 kernel: RBP: 0000000000000000 R08: 0000000000000000 R09: 0000000000000000 Nov 22 06:44:04 kernel: R10: 0000000000000000 R11: 0000000000000000 R12: ffffa171c0f9fdd0 Nov 22 06:44:04 kernel: R13: 0000000000000000 R14: 0000000000000000 R15: ffff94994f1b7800 Nov 22 06:44:04 kernel: FS: 0000000000000000(0000) GS:ffff949c3f600000(0000) knlGS:0000000000000000 Nov 22 06:44:04 kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Nov 22 06:44:04 kernel: CR2: 0000000000000058 CR3: 0000000103c3e000 CR4: 0000000000350ef0