Bug 207881
Summary: | amdgpu keeps crashing while playing | ||
---|---|---|---|
Product: | Drivers | Reporter: | Artem S. Tashkinov (aros) |
Component: | Video(Other) | Assignee: | drivers_video-other |
Status: | RESOLVED WILL_NOT_FIX | ||
Severity: | blocking | ||
Priority: | P1 | ||
Hardware: | x86-64 | ||
OS: | Linux | ||
Kernel Version: | 5.6.14 | Subsystem: | |
Regression: | No | Bisected commit-id: |
Description
Artem S. Tashkinov
2020-05-25 01:03:09 UTC
Here's a crash while idling on the desktop: gmc_v10_0_process_interrupt: 11 callbacks suppressed amdgpu 0000:09:00.0: [gfxhub] page fault (src_id:0 ring:40 vmid:0 pasid:0, for process pid 0 thread pid 0) amdgpu 0000:09:00.0: in page starting at address 0x0000000000a25000 from client 27 amdgpu 0000:09:00.0: GCVM_L2_PROTECTION_FAULT_STATUS:0x00041C50 amdgpu 0000:09:00.0: MORE_FAULTS: 0x0 amdgpu 0000:09:00.0: WALKER_ERROR: 0x0 amdgpu 0000:09:00.0: PERMISSION_FAULTS: 0x5 amdgpu 0000:09:00.0: MAPPING_ERROR: 0x0 amdgpu 0000:09:00.0: RW: 0x1 [drm:amdgpu_dm_atomic_commit_tail [amdgpu]] *ERROR* Waiting for fences timed out! [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring sdma1 timeout, signaled seq=102394, emitted seq=102396 [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process pid 0 thread pid 0 amdgpu 0000:09:00.0: GPU reset begin! amdgpu 0000:09:00.0: GPU reset succeeded, trying to resume [drm] PCIE GART of 512M enabled (table at 0x0000008000300000). [drm] VRAM is lost due to GPU reset! [drm] PSP is resuming... [drm] reserve 0x900000 from 0x817e400000 for PSP TMR amdgpu 0000:09:00.0: RAS: ras ta ucode is not available amdgpu: [powerplay] SMU is resuming... amdgpu: [powerplay] SMU is resumed successfully! [drm] kiq ring mec 2 pipe 1 q 0 [drm] VCN decode and encode initialized successfully(under DPG Mode). [drm] JPEG decode initialized successfully. amdgpu 0000:09:00.0: ring gfx_0.0.0 uses VM inv eng 0 on hub 0 amdgpu 0000:09:00.0: ring comp_1.0.0 uses VM inv eng 1 on hub 0 amdgpu 0000:09:00.0: ring comp_1.1.0 uses VM inv eng 4 on hub 0 amdgpu 0000:09:00.0: ring comp_1.2.0 uses VM inv eng 5 on hub 0 amdgpu 0000:09:00.0: ring comp_1.3.0 uses VM inv eng 6 on hub 0 amdgpu 0000:09:00.0: ring comp_1.0.1 uses VM inv eng 7 on hub 0 amdgpu 0000:09:00.0: ring comp_1.1.1 uses VM inv eng 8 on hub 0 amdgpu 0000:09:00.0: ring comp_1.2.1 uses VM inv eng 9 on hub 0 amdgpu 0000:09:00.0: ring comp_1.3.1 uses VM inv eng 10 on hub 0 amdgpu 0000:09:00.0: ring kiq_2.1.0 uses VM inv eng 11 on hub 0 amdgpu 0000:09:00.0: ring sdma0 uses VM inv eng 12 on hub 0 amdgpu 0000:09:00.0: ring sdma1 uses VM inv eng 13 on hub 0 amdgpu 0000:09:00.0: ring vcn_dec uses VM inv eng 0 on hub 1 amdgpu 0000:09:00.0: ring vcn_enc0 uses VM inv eng 1 on hub 1 amdgpu 0000:09:00.0: ring vcn_enc1 uses VM inv eng 4 on hub 1 amdgpu 0000:09:00.0: ring jpeg_dec uses VM inv eng 5 on hub 1 [drm] recover vram bo from shadow start [drm] recover vram bo from shadow done [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! amdgpu 0000:09:00.0: GPU reset(1) succeeded! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! snd_hda_intel 0000:09:00.1: azx_get_response timeout, switching to polling mode: last cmd=0x00370100 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x370100 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x370100 snd_hda_intel 0000:09:00.1: spurious response 0x233:0x0, last cmd=0x370100 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x370100 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x370100 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x370100 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x370100 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x370100 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x370100 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x370100 snd_hda_intel 0000:09:00.1: No response from codec, disabling MSI: last cmd=0x00272400 snd_hda_intel 0000:09:00.1: No response from codec, resetting bus: last cmd=0x00272400 snd_hda_intel 0000:09:00.1: azx_get_response timeout, switching to single_cmd mode: last cmd=0x002f2d00 [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx_0.0.0 timeout, signaled seq=4692299, emitted seq=4692301 [drm:amdgpu_job_timedout [amdgpu]] *ERROR* Process information: process Xorg pid 3208 thread Xorg:cs0 pid 3229 amdgpu 0000:09:00.0: GPU reset begin! [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KGQ disable failed [drm:gfx_v10_0_hw_fini [amdgpu]] *ERROR* KCQ disable failed [drm:gfx_v10_0_cp_gfx_enable.isra.0 [amdgpu]] *ERROR* failed to halt cp gfx amdgpu 0000:09:00.0: GPU reset succeeded, trying to resume [drm] PCIE GART of 512M enabled (table at 0x0000008000300000). [drm] VRAM is lost due to GPU reset! [drm] PSP is resuming... [drm] reserve 0x900000 from 0x817e400000 for PSP TMR amdgpu 0000:09:00.0: RAS: ras ta ucode is not available amdgpu: [powerplay] SMU is resuming... amdgpu: [powerplay] SMU is resumed successfully! [drm] kiq ring mec 2 pipe 1 q 0 [drm] VCN decode and encode initialized successfully(under DPG Mode). [drm] JPEG decode initialized successfully. amdgpu 0000:09:00.0: ring gfx_0.0.0 uses VM inv eng 0 on hub 0 amdgpu 0000:09:00.0: ring comp_1.0.0 uses VM inv eng 1 on hub 0 amdgpu 0000:09:00.0: ring comp_1.1.0 uses VM inv eng 4 on hub 0 amdgpu 0000:09:00.0: ring comp_1.2.0 uses VM inv eng 5 on hub 0 amdgpu 0000:09:00.0: ring comp_1.3.0 uses VM inv eng 6 on hub 0 amdgpu 0000:09:00.0: ring comp_1.0.1 uses VM inv eng 7 on hub 0 amdgpu 0000:09:00.0: ring comp_1.1.1 uses VM inv eng 8 on hub 0 amdgpu 0000:09:00.0: ring comp_1.2.1 uses VM inv eng 9 on hub 0 amdgpu 0000:09:00.0: ring comp_1.3.1 uses VM inv eng 10 on hub 0 amdgpu 0000:09:00.0: ring kiq_2.1.0 uses VM inv eng 11 on hub 0 amdgpu 0000:09:00.0: ring sdma0 uses VM inv eng 12 on hub 0 amdgpu 0000:09:00.0: ring sdma1 uses VM inv eng 13 on hub 0 amdgpu 0000:09:00.0: ring vcn_dec uses VM inv eng 0 on hub 1 amdgpu 0000:09:00.0: ring vcn_enc0 uses VM inv eng 1 on hub 1 amdgpu 0000:09:00.0: ring vcn_enc1 uses VM inv eng 4 on hub 1 amdgpu 0000:09:00.0: ring jpeg_dec uses VM inv eng 5 on hub 1 [drm] recover vram bo from shadow start [drm] recover vram bo from shadow done [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! amdgpu 0000:09:00.0: GPU reset(2) succeeded! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! [drm] Skip scheduling IBs! traps: Chrome_IOThread[247153] trap int3 ip:56060e7f8ab5 sp:7ff2ef9ef540 error:0 in chrome[56060bb2c000+7858000] azx_single_wait_for_response: 220 callbacks suppressed snd_hdac_bus_update_rirb: 176 callbacks suppressed snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x370740 snd_hda_intel 0000:09:00.1: spurious response 0x600:0x0, last cmd=0x370740 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x570740 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x770740 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x970740 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0xb70740 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0xd70740 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x377200 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x378901 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x577200 snd_hdac_bus_update_rirb: 423 callbacks suppressed snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x220000 snd_hda_intel 0000:09:00.1: spurious response 0x0:0x0, last cmd=0x220000 A very similar issue is being discussed here: https://askubuntu.com/questions/1240879/rx-5500-xt-ubuntu-20-10-instability-crashing-drmamdgpu-dm-commit-planes-cons I can confirm that the mouse is alive but the entire X.org session (no compositing BTW) crashed hard. And another similar issue: AMD Navi GPU frequent freezes on both Manjaro/Ubuntu with kernel 5.3 and mesa 19.2 -git/llvm9 https://gitlab.freedesktop.org/drm/amd/-/issues/892 Only I'm running Fedora 32 and Linux 5.6.14 vanilla. I sold the card a long time ago, AMD developers never chimed in, closing. |