Bug 216673

Summary: Recurring amdgpu freeze on kernel 6.0.6 only
Product: Drivers Reporter: Stanislav Modrak (stanislav.modrak)
Component: Video(DRI - non Intel)Assignee: drivers_video-dri
Status: RESOLVED ANSWERED    
Severity: high CC: alexdeucher, micheledellaguardia
Priority: P1    
Hardware: AMD   
OS: Linux   
Kernel Version: 6.0.6-060006-generic Subsystem:
Regression: No Bisected commit-id:
Attachments: log messages

Description Stanislav Modrak 2022-11-08 17:21:03 UTC
Created attachment 303150 [details]
log messages

The monitor goes blank at 17:00:34 system time (the screen froze for a moment before going blank completely, showing this timestamp) and no response from the system with keyboard commands is visible. Forced to hard reboot Recurring issue, only on kernel 6.0.6.

Attaching the full log file here.

Identified the log at 17:00:34 exactly. There are multiple logs at this timestamp, however, showing just one.

```
Time: 17:00:34
Message: amdgpu 0000:06:00.0: amdgpu: [mmhub0] no-retry page fault (src_id:0 ring:40 vmid:4 pasid:32777, for process todoist pid 26652 thread todoist:cs0 pid 26658)
Kernel Device: +pci:0000:06:00.0
Priority: 3
```
Comment 2 Artem S. Tashkinov 2022-11-09 09:13:35 UTC
https://gitlab.freedesktop.org is where it should be anyways.
Comment 3 Stanislav Modrak 2022-11-09 13:08:34 UTC
(In reply to Artem S. Tashkinov from comment #2)
> https://gitlab.freedesktop.org is where it should be anyways.

Can you please explain why it belongs there and not here? Thx!
Comment 4 Alex Deucher 2022-11-09 14:27:30 UTC
(In reply to Stanislav Modrak from comment #3)
> (In reply to Artem S. Tashkinov from comment #2)
> > https://gitlab.freedesktop.org is where it should be anyways.
> 
> Can you please explain why it belongs there and not here? Thx!

That is where most GPU developers are and it also allows us to move bugs to other components when necessary.  E.g., a mesa or xorg bug is misfiled as kernel.
Comment 5 Michele Della Guardia 2022-11-10 14:42:39 UTC
I've got the same issue on 6.0.6 and 6.0.7 kernel, but found that the problem has already been discussed and addressed here:

https://gitlab.freedesktop.org/drm/amd/-/issues/2113

I think fixes have been already committed so we should find them in the next kernel release.
Regards
Comment 6 Michele Della Guardia 2022-11-11 10:30:04 UTC
Update: downloaded kernel 6.0.8, applied patches found here:

https://gitlab.freedesktop.org/drm/amd/-/issues/2113

and compiled. That problem seems gone. Hope to find patches applied on next kernel release.

Regards
Comment 7 Stanislav Modrak 2022-11-11 11:30:18 UTC
(In reply to Michele Della Guardia from comment #6)
> Update: downloaded kernel 6.0.8, applied patches found here:
> 
> https://gitlab.freedesktop.org/drm/amd/-/issues/2113
> 
> and compiled. That problem seems gone. Hope to find patches applied on next
> kernel release.
> 
> Regards

Thanks! I've also noticed the issue and proposed patches. Waiting for it to be included in an upcoming release.