Bug 216359 - [amdgpu] ring gfx timeout after waking from suspend and exiting X
Summary: [amdgpu] ring gfx timeout after waking from suspend and exiting X
Status: RESOLVED ANSWERED
Alias: None
Product: Drivers
Classification: Unclassified
Component: Video(DRI - non Intel) (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: drivers_video-dri
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2022-08-14 05:55 UTC by Shlomo
Modified: 2022-08-16 16:55 UTC (History)
1 user (show)

See Also:
Kernel Version: 5.18.13, 5.19.1
Subsystem:
Regression: Yes
Bisected commit-id:


Attachments

Description Shlomo 2022-08-14 05:55:52 UTC
After waking up from suspend and exiting X (and trying to switch to a different tty, but I think the bug happens before that), I get:

Aug 14 08:27:21 up kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, but soft recovered
Aug 14 08:27:31 up kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, but soft recovered
Aug 14 08:27:42 up kernel: [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring gfx timeout, but soft recovered

The screen freezes, and I have to reboot to get a stable system.

Arch Linux 5.18.12.arch1-1 is good.
Arch Linux 5.18.14.arch1-1 is bad.
Arch Linux 5.18.15.arch1-1 is bad.
Arch Linux 5.18.16.arch1-1 is bad.
Arch Linux 5.19.1.arch2-1 is bad.

4ffcacab7145080187330accafae69e87a481eec is the first bad commit
commit 4ffcacab7145080187330accafae69e87a481eec
Author: Alex Deucher <alexander.deucher@amd.com>
Date:   Tue Jun 21 10:10:37 2022 -0400

    drm/amdgpu/display: disable prefer_shadow for generic fb helpers
    
    [ Upstream commit 3a4b1cc28fbdc2325b3e3ed7d8024995a75f9216 ]
    
    Seems to break hibernation.  Disable for now until we can root
    cause it.
    
    Fixes: 087451f372bf ("drm/amdgpu: use generic fb helpers instead of setting up AMD own's.")
    Bug: https://bugzilla.kernel.org/show_bug.cgi?id=216119
    Acked-by: Evan Quan <evan.quan@amd.com>
    Signed-off-by: Alex Deucher <alexander.deucher@amd.com>
    Signed-off-by: Sasha Levin <sashal@kernel.org>

 drivers/gpu/drm/amd/amdgpu/amdgpu_vkms.c          | 3 ++-
 drivers/gpu/drm/amd/amdgpu/dce_v10_0.c            | 3 ++-
 drivers/gpu/drm/amd/amdgpu/dce_v11_0.c            | 3 ++-
 drivers/gpu/drm/amd/amdgpu/dce_v6_0.c             | 3 ++-
 drivers/gpu/drm/amd/amdgpu/dce_v8_0.c             | 3 ++-
 drivers/gpu/drm/amd/display/amdgpu_dm/amdgpu_dm.c | 3 ++-
 6 files changed, 12 insertions(+), 6 deletions(-)
Comment 1 Shlomo 2022-08-16 05:45:03 UTC
Reposted on GitLab:

https://gitlab.freedesktop.org/drm/amd/-/issues/2124
Comment 2 Shlomo 2022-08-16 16:55:36 UTC
Moved to GitLab.

Note You need to log in before you can comment on or make changes to this bug.