Bug 215618
Summary: | vblank related lockup during start of SteamVR using Valve Index HMD | ||
---|---|---|---|
Product: | Drivers | Reporter: | farmboy0 |
Component: | Video(DRI - non Intel) | Assignee: | drivers_video-dri |
Status: | NEW --- | ||
Severity: | normal | CC: | ct.lrn, perk11, primalucegd |
Priority: | P1 | ||
Hardware: | All | ||
OS: | Linux | ||
Kernel Version: | 5.16.9 | Subsystem: | |
Regression: | No | Bisected commit-id: |
Description
farmboy0
2022-02-17 18:48:05 UTC
I can reproduce the crash with a Radeon 6800XT in 5.17.1. GPU is then unstable after it resets and system must be rebooted.
Here is my callstack - it's almost the same as the previous one:
> [drm:dm_vblank_get_counter [amdgpu]] *ERROR* dc_stream_state is NULL for crtc '1'!
> [drm:dm_crtc_get_scanoutpos [amdgpu]] *ERROR* dc_stream_state is NULL for crtc '1'!
> [drm:dm_vblank_get_counter [amdgpu]] *ERROR* dc_stream_state is NULL for crtc '1'!
> ------------[ cut here ]------------
> amdgpu 0000:0b:00.0: drm_WARN_ON_ONCE(drm_drv_uses_atomic_modeset(dev))
> WARNING: CPU: 3 PID: 2263 at drivers/gpu/drm/drm_vblank.c:728 drm_crtc_vblank_helper_get_vblank_timestamp_internal+0x369/0x380
> Modules linked in: nf_tables nfnetlink snd_seq_dummy snd_hrtimer snd_seq cfg80211 8021q garp mrp stp llc nct6775 hwmon_vid eeepc_wmi intel_ra>
> crct10dif_pclmul crc32_pclmul crc32c_intel ghash_clmulni_intel aesni_intel ccp sr_mod xhci_pci crypto_simd cryptd rng_core cdrom xhci_pci_re>
> CPU: 3 PID: 2263 Comm: VulkanVblankThr Not tainted 5.17.1-arch1-1 #1 0ea933cb6bfe82a8dc16ab834a4bccdd297f98b7
> Hardware name: ASUS System Product Name/ROG CROSSHAIR VIII DARK HERO, BIOS 3601 05/26/2021
> RIP: 0010:drm_crtc_vblank_helper_get_vblank_timestamp_internal+0x369/0x380
> Code: 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 f0 6b 01 00 48 c7 c1 40 f3 d1 ad 4c 89 ea 48 c7 c7 c2 5d d1 ad 48 89 c6 e8 a3 43 3d 00 <0f> 0b e>
> RSP: 0018:ffff9beb86303b20 EFLAGS: 00010082
> RAX: 0000000000000000 RBX: ffffffffc0b7e840 RCX: 0000000000000027
> RDX: ffff8dca0eae1728 RSI: 0000000000000001 RDI: ffff8dca0eae1720
> RBP: ffff9beb86303b90 R08: 0000000000000000 R09: ffff9beb86303950
> R10: ffff9beb86303948 R11: ffff8dca2f2a9b28 R12: 0000000000000000
> R13: ffff8dc3023dae30 R14: 0000000000000000 R15: ffff8dc3376b21d8
> FS: 00007fa79444b640(0000) GS:ffff8dca0eac0000(0000) knlGS:0000000000000000
> CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033
> CR2: 00007fa6ec001278 CR3: 00000001e6984000 CR4: 0000000000750ee0
> PKRU: 55555554
> Call Trace:
> <TASK>
> drm_get_last_vbltimestamp+0xb2/0xc0
> drm_update_vblank_count+0x91/0x3d0
> drm_vblank_enable+0x14b/0x180
> drm_vblank_get+0x95/0xe0
> drm_crtc_queue_sequence_ioctl+0xfd/0x2d0
> ? __check_object_size+0x46/0x140
> ? drm_crtc_get_sequence_ioctl+0x1a0/0x1a0
> drm_ioctl_kernel+0xb8/0x140
> drm_ioctl+0x22a/0x3d0
> ? drm_crtc_get_sequence_ioctl+0x1a0/0x1a0
> amdgpu_drm_ioctl+0x49/0x80 [amdgpu 08a70cd20fdf14582ce9165e3698aeaecdd8c8f8]
> __x64_sys_ioctl+0x82/0xb0
> do_syscall_64+0x5c/0x80
> ? do_user_addr_fault+0x1d7/0x690
> ? do_syscall_64+0x69/0x80
> ? exc_page_fault+0x72/0x170
> entry_SYSCALL_64_after_hwframe+0x44/0xae
> RIP: 0033:0x7fa7ac2a7e6f
> Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 05 <41> 89 c>
> RSP: 002b:00007fa79444ab00 EFLAGS: 00000246 ORIG_RAX: 0000000000000010
> RAX: ffffffffffffffda RBX: 00007fa79444ab90 RCX: 00007fa7ac2a7e6f
> RDX: 00007fa79444ab90 RSI: 00000000c018643c RDI: 000000000000004a
> RBP: 00000000c018643c R08: 0000000000000000 R09: 00007fa6ec000be0
> R10: 000000000000006e R11: 0000000000000246 R12: 000055d760b895b8
> R13: 000000000000004a R14: 000055d760c41d00 R15: 0000000000000000
> </TASK>
> ---[ end trace 0000000000000000 ]---
> [drm:dm_vblank_get_counter [amdgpu]] *ERROR* dc_stream_state is NULL for crtc '1'!
> [drm:dm_crtc_get_scanoutpos [amdgpu]] *ERROR* dc_stream_state is NULL for crtc '1'!
> [drm:dm_vblank_get_counter [amdgpu]] *ERROR* dc_stream_state is NULL for crtc '1'!
Hello, wanted to confirm that I also have gotten this issue using an HTC Vive HMD on kernel version 5.15.32. Here's /var/log/messages for the first few seconds before the crash, the actual crash log, and the few messages before the system finally shut down. This is consistent on the game "Jet Island" when attempting to load into the game world - the menus in the game prior to doing so appear to present no issues. May 17 18:48:15 auro-desktop steam[6275]: LHR-84CF93F9: Triggered keepalive (succeeded) May 17 18:48:15 auro-desktop steam[6275]: LHR-84CF93F9: Packet received after 0.019s, keepalive (0/1) May 17 18:48:18 auro-desktop kernel: ------------[ cut here ]------------ May 17 18:48:18 auro-desktop kernel: amdgpu 0000:01:00.0: drm_WARN_ON_ONCE(drm_drv_uses_atomic_modeset(dev)) May 17 18:48:18 auro-desktop kernel: WARNING: CPU: 0 PID: 6896 at drivers/gpu/drm/drm_vblank.c:728 drm_crtc_vblank_helper_get_vblank_timestamp_internal+0x34b/0x360 May 17 18:48:18 auro-desktop kernel: Modules linked in: rfcomm xt_CHECKSUM xt_MASQUERADE xt_conntrack iptable_nat algif_hash algif_skcipher af_alg bnep nvidia_drm(PO) nvidia_modeset(PO) mxm_wmi x86_pkg_temp_thermal btusb nvidia(PO) btrtl btbcm btintel bluetooth kvm_intel ecdh_generic ecc iwlmvm amdgpu iwlwifi drm_ttm_helper wmi gpu_sched vboxnetflt(O) vboxnetadp(O) vboxdrv(O) v4l2loopback(O) efivarfs May 17 18:48:18 auro-desktop kernel: CPU: 0 PID: 6896 Comm: VulkanVblankThr Tainted: P O 5.15.32-gentoo-r1 #1 May 17 18:48:18 auro-desktop kernel: Hardware name: Gigabyte Technology Co., Ltd. Z170XP-SLI/Z170XP-SLI-CF, BIOS F22d 01/11/2018 May 17 18:48:18 auro-desktop kernel: RIP: 0010:drm_crtc_vblank_helper_get_vblank_timestamp_internal+0x34b/0x360 May 17 18:48:18 auro-desktop kernel: Code: 4c 8b 6f 50 4d 85 ed 75 03 4c 8b 2f e8 ae 6b 16 00 48 c7 c1 80 47 37 9b 4c 89 ea 48 c7 c7 4b b5 36 9b 48 89 c6 e8 1f 60 77 00 <0f> 0b e9 d0 fe ff ff e8 99 d7 7e 00 66 0f 1f 84 00 00 00 00 00 0f May 17 18:48:18 auro-desktop kernel: RSP: 0018:ffffafa8c29efbc8 EFLAGS: 00010082 May 17 18:48:18 auro-desktop kernel: RAX: 0000000000000000 RBX: ffffffffc04e80e0 RCX: 0000000000000027 May 17 18:48:18 auro-desktop kernel: RDX: ffff96a95ec1b458 RSI: 0000000000000001 RDI: ffff96a95ec1b450 May 17 18:48:18 auro-desktop kernel: RBP: ffffafa8c29efc38 R08: ffffffff9b74a9c8 R09: 00000000ffffdfff May 17 18:48:18 auro-desktop kernel: R10: ffffffff9b66a9e0 R11: ffffffff9b66a9e0 R12: 0000000000000000 May 17 18:48:18 auro-desktop kernel: R13: ffff96a240ef4ad0 R14: 0000000000000000 R15: ffff96a24e53a9d8 May 17 18:48:18 auro-desktop kernel: FS: 00007fd630203640(0000) GS:ffff96a95ec00000(0000) knlGS:0000000000000000 May 17 18:48:18 auro-desktop kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 May 17 18:48:18 auro-desktop kernel: CR2: 00007fd60c001268 CR3: 00000001b9df0003 CR4: 00000000003706f0 May 17 18:48:18 auro-desktop kernel: DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 May 17 18:48:18 auro-desktop kernel: DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400 May 17 18:48:18 auro-desktop kernel: Call Trace: May 17 18:48:18 auro-desktop kernel: <TASK> May 17 18:48:18 auro-desktop kernel: drm_get_last_vbltimestamp+0xaa/0xc0 May 17 18:48:18 auro-desktop kernel: drm_update_vblank_count+0x88/0x3c0 May 17 18:48:18 auro-desktop kernel: drm_vblank_enable+0x154/0x190 May 17 18:48:18 auro-desktop kernel: drm_vblank_get+0x95/0xe0 May 17 18:48:18 auro-desktop kernel: drm_crtc_queue_sequence_ioctl+0xea/0x290 May 17 18:48:18 auro-desktop kernel: ? drm_crtc_get_sequence_ioctl+0x190/0x190 May 17 18:48:18 auro-desktop kernel: drm_ioctl_kernel+0xaa/0xf0 May 17 18:48:18 auro-desktop kernel: drm_ioctl+0x1ff/0x3d0 May 17 18:48:18 auro-desktop kernel: ? drm_crtc_get_sequence_ioctl+0x190/0x190 May 17 18:48:18 auro-desktop kernel: ? selinux_file_ioctl+0x144/0x250 May 17 18:48:18 auro-desktop kernel: amdgpu_drm_ioctl+0x49/0x80 [amdgpu] May 17 18:48:18 auro-desktop kernel: __x64_sys_ioctl+0x82/0xb0 May 17 18:48:18 auro-desktop kernel: do_syscall_64+0x3b/0x90 May 17 18:48:18 auro-desktop kernel: entry_SYSCALL_64_after_hwframe+0x44/0xae May 17 18:48:18 auro-desktop kernel: RIP: 0033:0x7fd6502b7457 May 17 18:48:18 auro-desktop kernel: Code: 3c 1c e8 2c ff ff ff 85 c0 79 97 5b 49 c7 c4 ff ff ff ff 5d 4c 89 e0 41 5c c3 66 0f 1f 84 00 00 00 00 00 b8 10 00 00 00 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d 99 e9 0e 00 f7 d8 64 89 01 48 May 17 18:48:18 auro-desktop kernel: RSP: 002b:00007fd630202af8 EFLAGS: 00000246 ORIG_RAX: 0000000000000010 May 17 18:48:18 auro-desktop kernel: RAX: ffffffffffffffda RBX: 00007fd630202b30 RCX: 00007fd6502b7457 May 17 18:48:18 auro-desktop kernel: RDX: 00007fd630202b30 RSI: 00000000c018643c RDI: 0000000000000056 May 17 18:48:18 auro-desktop kernel: RBP: 00000000c018643c R08: 0000000000000000 R09: 00007fd60c000bd0 May 17 18:48:18 auro-desktop kernel: R10: 00007fd64c846b70 R11: 0000000000000246 R12: 000055d3a8389898 May 17 18:48:18 auro-desktop kernel: R13: 0000000000000056 R14: 000055d3a8521b00 R15: 000055d3a8389850 May 17 18:48:18 auro-desktop kernel: </TASK> May 17 18:48:18 auro-desktop kernel: ---[ end trace ece90a9627f14e87 ]--- May 17 18:48:26 auro-desktop crash_20220517184826_2.dmp[7049]: Uploading dump (out-of-process) /tmp/dumps/crash_20220517184826_2.dmp May 17 18:48:26 auro-desktop kernel: RenderThread[6902]: segfault at 0 ip 000055d3a6274c7b sp 00007fd5fb5f9ee0 error 4 in vrcompositor[55d3a6035000+4bc000] May 17 18:48:26 auro-desktop kernel: Code: f8 ff ff 66 90 66 2e 0f 1f 84 00 00 00 00 00 55 48 89 e5 41 56 41 89 ce 41 55 49 89 fd 48 89 f7 41 54 49 89 d4 53 48 83 ec 30 <48> 8b 06 ff 50 18 f6 40 1c 10 48 89 c3 0f 85 02 01 00 00 48 8d 45 May 17 18:48:42 auro-desktop crash_20220517184842_2.dmp[7178]: Uploading dump (out-of-process) /tmp/dumps/crash_20220517184842_2.dmp May 17 18:48:42 auro-desktop kernel: traps: vrmonitor[6912] trap divide error ip:7f5e4ab3b893 sp:7ffebc5ed130 error:0 in libQt5XcbQpa.so.5[7f5e4aad5000+15b000] May 17 18:48:43 auro-desktop crash_20220517184842_2.dmp[7178]: Finished uploading minidump (out-of-process): success = yes May 17 18:48:43 auro-desktop crash_20220517184842_2.dmp[7178]: response: CrashID=bp-5c64aa69-8191-446a-9bbd-78d142220517 May 17 18:48:43 auro-desktop crash_20220517184842_2.dmp[7178]: file ''/tmp/dumps/crash_20220517184842_2.dmp'', upload yes: ''CrashID=bp-5c64aa69-8191-446a-9bbd-78d142220517'' May 17 18:49:17 auro-desktop crash_20220517184826_2.dmp[7049]: Finished uploading minidump (out-of-process): success = no May 17 18:49:17 auro-desktop crash_20220517184826_2.dmp[7049]: error: HTTP response code said error May 17 18:49:17 auro-desktop crash_20220517184826_2.dmp[7049]: file ''/tmp/dumps/crash_20220517184826_2.dmp'', upload no: ''HTTP response code said error'' Getting the same issue as the original post most times when starting SteamVR with Valve Index connected. Linux perk11-home 5.18.6-1-MANJARO #1 SMP PREEMPT_DYNAMIC Wed Jun 22 14:16:20 UTC 2022 x86_64 GNU/Linux Jun 29 22:38:01 perk11-home kernel: [drm:dm_vblank_get_counter [amdgpu]] *ERROR* dc_stream_state is NULL for crtc '1'! Jun 29 22:38:01 perk11-home kernel: [drm:dm_crtc_get_scanoutpos [amdgpu]] *ERROR* dc_stream_state is NULL for crtc '1'! Jun 29 22:38:01 perk11-home kernel: [drm:dm_vblank_get_counter [amdgpu]] *ERROR* dc_stream_state is NULL for crtc '1'! Jun 29 22:38:01 perk11-home kernel: ------------[ cut here ]------------ Jun 29 22:38:01 perk11-home kernel: amdgpu 0000:0a:00.0: drm_WARN_ON_ONCE(drm_drv_uses_atomic_modeset(dev)) Jun 29 22:38:01 perk11-home kernel: WARNING: CPU: 0 PID: 24962 at drivers/gpu/drm/drm_vblank.c:728 drm_crtc_vblank_helper_get_vblank_timestamp_internal+0x343/0x350 Jun 29 22:38:01 perk11-home kernel: Modules linked in: cdc_acm xt_REDIRECT xt_nat xt_tcpudp veth xt_conntrack xt_MASQUERADE nf_conntrack_netlink nfnetlink xt_addrtype iptable_nat nf_nat nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 br_netfilter bridge stp> Jun 29 22:38:01 perk11-home kernel: gpu_sched crypto_simd drm_ttm_helper cryptd ttm snd_timer rapl snd drm_dp_helper tpm_crb soundcore pcspkr tpm_tis tpm_tis_core ccp tpm joydev rng_core pinctrl_amd mac_hid sp5100_tco wmi k10temp i2c_piix4 acpi_cpufre> Jun 29 22:38:01 perk11-home kernel: CPU: 0 PID: 24962 Comm: VulkanVblankThr Tainted: G OE 5.18.6-1-MANJARO #1 b31d9f0a4836337638fdda8256aa2395a26153d9 Jun 29 22:38:01 perk11-home kernel: Hardware name: Gigabyte Technology Co., Ltd. X570 AORUS ELITE/X570 AORUS ELITE, BIOS F37a 02/16/2022 Jun 29 22:38:01 perk11-home kernel: RIP: 0010:drm_crtc_vblank_helper_get_vblank_timestamp_internal+0x343/0x350 Jun 29 22:38:01 perk11-home kernel: Code: 48 8b 5f 50 48 85 db 75 03 48 8b 1f e8 46 5b 01 00 48 c7 c1 10 9b 14 b8 48 89 da 48 c7 c7 0c a0 0b b8 48 89 c6 e8 59 4e 3e 00 <0f> 0b e9 c7 fd ff ff e8 f1 fa 43 00 90 f3 0f 1e fa 0f 1f 44 00 00 Jun 29 22:38:01 perk11-home kernel: RSP: 0018:ffffba908622bb98 EFLAGS: 00010086 Jun 29 22:38:01 perk11-home kernel: RAX: 0000000000000000 RBX: ffff9ec78170b600 RCX: 0000000000000027 Jun 29 22:38:01 perk11-home kernel: RDX: ffff9ed67ea216a8 RSI: 0000000000000001 RDI: ffff9ed67ea216a0 Jun 29 22:38:01 perk11-home kernel: RBP: ffffba908622bc08 R08: 0000000000000000 R09: ffffba908622b9a8 Jun 29 22:38:01 perk11-home kernel: R10: 0000000000000003 R11: ffff9ed6bf3248a8 R12: ffffba908622bc68 Jun 29 22:38:01 perk11-home kernel: R13: ffffffffc0ea51e0 R14: 0000000000000003 R15: ffff9ec7a0ab41d8 Jun 29 22:38:01 perk11-home kernel: FS: 00007f4f18238640(0000) GS:ffff9ed67ea00000(0000) knlGS:0000000000000000 Jun 29 22:38:01 perk11-home kernel: CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 Jun 29 22:38:01 perk11-home kernel: CR2: 00007f4eb4001278 CR3: 00000001b7dc8000 CR4: 0000000000350ef0 Jun 29 22:38:01 perk11-home kernel: Call Trace: Jun 29 22:38:01 perk11-home kernel: <TASK> Jun 29 22:38:01 perk11-home kernel: drm_get_last_vbltimestamp+0xaa/0xc0 Jun 29 22:38:01 perk11-home kernel: drm_update_vblank_count+0x8f/0x3a0 Jun 29 22:38:01 perk11-home kernel: drm_vblank_enable+0x14b/0x180 Jun 29 22:38:01 perk11-home kernel: drm_vblank_get+0x97/0xe0 Jun 29 22:38:01 perk11-home kernel: drm_crtc_queue_sequence_ioctl+0xf9/0x2d0 Jun 29 22:38:01 perk11-home kernel: ? drm_ioctl+0x250/0x410 Jun 29 22:38:01 perk11-home kernel: ? drm_crtc_get_sequence_ioctl+0x1a0/0x1a0 Jun 29 22:38:01 perk11-home kernel: drm_ioctl_kernel+0xca/0x170 Jun 29 22:38:01 perk11-home kernel: drm_ioctl+0x22e/0x410 Jun 29 22:38:01 perk11-home kernel: ? drm_crtc_get_sequence_ioctl+0x1a0/0x1a0 Jun 29 22:38:01 perk11-home kernel: amdgpu_drm_ioctl+0x4e/0x80 [amdgpu 87a94d2005d4986ad8a825609f070d6f446ae712] Jun 29 22:38:01 perk11-home kernel: __x64_sys_ioctl+0x91/0xc0 Jun 29 22:38:01 perk11-home kernel: do_syscall_64+0x5f/0x90 Jun 29 22:38:01 perk11-home kernel: ? exc_page_fault+0x74/0x170 Jun 29 22:38:01 perk11-home kernel: entry_SYSCALL_64_after_hwframe+0x44/0xae Jun 29 22:38:01 perk11-home kernel: RIP: 0033:0x7f4f2f3077af Jun 29 22:38:01 perk11-home kernel: Code: 00 48 89 44 24 18 31 c0 48 8d 44 24 60 c7 04 24 10 00 00 00 48 89 44 24 08 48 8d 44 24 20 48 89 44 24 10 b8 10 00 00 00 0f 05 <89> c2 3d 00 f0 ff ff 77 18 48 8b 44 24 18 64 48 2b 04 25 28 00 00 Jun 29 22:38:01 perk11-home kernel: RSP: 002b:00007f4f18237a80 EFLAGS: 00000246 ORIG_RAX: 0000000000000010 Jun 29 22:38:01 perk11-home kernel: RAX: ffffffffffffffda RBX: 0000000000000000 RCX: 00007f4f2f3077af Jun 29 22:38:01 perk11-home kernel: RDX: 00007f4f18237b10 RSI: 00000000c018643c RDI: 0000000000000060 Jun 29 22:38:01 perk11-home kernel: RBP: 00007f4f18237b10 R08: 0000000000000000 R09: 00007f4eb4000be0 Jun 29 22:38:01 perk11-home kernel: R10: 0000000000004022 R11: 0000000000000246 R12: 00000000c018643c Jun 29 22:38:01 perk11-home kernel: R13: 0000000000000060 R14: 000055f449aaabd0 R15: 00007f4f18138000 Jun 29 22:38:01 perk11-home kernel: </TASK> Jun 29 22:38:01 perk11-home kernel: ---[ end trace 0000000000000000 ]--- Jun 29 22:38:01 perk11-home kernel: [drm:dm_vblank_get_counter [amdgpu]] *ERROR* dc_stream_state is NULL for crtc '1'! Jun 29 22:38:01 perk11-home kernel: [drm:dm_crtc_get_scanoutpos [amdgpu]] *ERROR* dc_stream_state is NULL for crtc '1'! Jun 29 22:38:01 perk11-home kernel: [drm:dm_vblank_get_counter [amdgpu]] *ERROR* dc_stream_state is NULL for crtc '1'! The issue still happens with 5.18.9. The issue has also been reported to mesa and someone bisected it to the following commit: https://gitlab.freedesktop.org/drm/amd/-/issues/1980#note_1400657 |