Bug 208835 - amdgpu fails to resume from suspend
Summary: amdgpu fails to resume from suspend
Status: RESOLVED PATCH_ALREADY_AVAILABLE
Alias: None
Product: Drivers
Classification: Unclassified
Component: Video(DRI - non Intel) (show other bugs)
Hardware: Intel Linux
: P1 normal
Assignee: drivers_video-dri
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2020-08-06 14:27 UTC by sevenever
Modified: 2022-07-08 16:03 UTC (History)
3 users (show)

See Also:
Kernel Version: 5.8.0-050800-generic
Subsystem:
Regression: No
Bisected commit-id:


Attachments
dmesg (111.73 KB, text/plain)
2020-08-06 14:27 UTC, sevenever
Details
systemd journal (28.08 KB, text/plain)
2020-10-30 16:33 UTC, Tom
Details

Description sevenever 2020-08-06 14:27:07 UTC
Created attachment 290797 [details]
dmesg

Computer fails to resume from suspend.
From the logs it looks like AMDGPU fails to resume.

It is a thinkpad T14 AMD APU R7 4750U, connected to an external display via USB-C port thru an adapter.

I can still ssh to the machine after resume action, but the both internal display and external display stuck at black screen.
Comment 1 Alex Deucher 2020-08-06 14:38:43 UTC
Does it work without the external display connected?
Comment 2 sevenever 2020-08-07 14:04:38 UTC
(In reply to Alex Deucher from comment #1)
> Does it work without the external display connected?

Tried several times, Looks like no problem without the external display connected.

btw, I have kernel option amd_iommu=off and vt.handoff=7
Comment 3 Tom 2020-10-30 16:31:21 UTC
I am also affected by this. There are a few recent amdgpu suspend bugs open but this one is most similar.

My card is a Radeon R7 260X (Bonaire XTX [Radeon R7 260X/360]), so I'm using the experimental amdgpu driver support.

5.4.58 works,
5.4.72 does not,
5.9.1 does not.

My computer seems to be completely unresponsive after resuming. I haven't tried SSHing but it does not respond to pressing the power button to put it back into sleep. However there are systemd service logs in the journal after resume.

Kernel messages of note, which are not part of working logs:

18:14:26 amdgpu: [powerplay] 
          failed to send message 5e ret is 0
18:14:26 amdgpu: [powerplay] 
          failed to send message 145 ret is 0
------------------
18:14:26 [drm:uvd_v4_2_start [amdgpu]] *ERROR* UVD not responding, trying to reset the VCPU!!!
18:14:26 [drm:uvd_v4_2_start [amdgpu]] *ERROR* UVD not responding, giving up!!!
18:14:26 [drm:amdgpu_device_ip_set_powergating_state [amdgpu]] *ERROR* set_powergating_state of IP block <uvd_v4_2> failed -1
------------------
18:14:26 amdgpu 0000:01:00.0: [drm:amdgpu_ring_test_helper [amdgpu]] *ERROR* ring uvd test failed (-110)
18:14:26 [drm:amdgpu_device_ip_resume_phase2 [amdgpu]] *ERROR* resume of IP block <uvd_v4_2> failed -110
18:14:26 [drm:amdgpu_device_resume [amdgpu]] *ERROR* amdgpu_device_ip_resume failed (-110).
18:14:26 PM: dpm_run_callback(): pci_pm_resume+0x0/0xe0 returns -110
18:14:26 PM: Device 0000:01:00.0 failed to resume async: error -110
------------------
18:14:27 [drm] Fence fallback timer expired on ring sdma0


Full journalctl from after resume attached.
Comment 4 Tom 2020-10-30 16:33:16 UTC
Created attachment 293321 [details]
systemd journal
Comment 5 Mario Limonciello (AMD) 2022-07-07 20:48:25 UTC
Can you please re-check this on a recent 5.18.y kernel and latest linux-firmware?  I don't believe it should still be an issue.  If it is, I would like to see a new log.
Comment 6 sevenever 2022-07-08 07:27:55 UTC
I don’t hit this issue anymore, looks it has been fixed. Please close. Thank you!

Note You need to log in before you can comment on or make changes to this bug.