Bug 201163

Summary: amdgpu: carrizo: Stalls using vaapi encoder
Product: Drivers Reporter: Ricardo Ribalda (ricardo.ribalda)
Component: Video(DRI - non Intel)Assignee: drivers_video-dri
Status: NEW ---    
Severity: normal CC: alexdeucher, andrey.grodzovsky, jerome, nhaehnle, ricardo.ribalda, yshuiv7
Priority: P1    
Hardware: x86-64   
OS: Linux   
Kernel Version: 4.18.0 Subsystem:
Regression: No Bisected commit-id:
Attachments: glxinfo
dmesg
trace-cmd
umr

Description Ricardo Ribalda 2018-09-17 12:39:36 UTC
Created attachment 278595 [details]
glxinfo

This command:

gst-launch-1.0 v4l2src io-mode=4 ! vaapih264enc ! fakesink dump=1

results in a stall non recoverable using ctrl+c and the following kernel message

[ 1617.944590] [drm:amdgpu_job_timedout [amdgpu]] *ERROR* ring vce0 timeout, last signaled seq=4, last emitted seq=6
[ 1617.944628] [drm] IP block:vce_v3_0 is hung!
[ 1617.944632] [drm] GPU recovery disabled.
Comment 1 Ricardo Ribalda 2018-09-17 12:40:01 UTC
Created attachment 278597 [details]
dmesg
Comment 2 Ricardo Ribalda 2018-09-17 12:40:33 UTC
Created attachment 278599 [details]
trace-cmd
Comment 3 Ricardo Ribalda 2018-09-17 12:40:57 UTC
Created attachment 278601 [details]
umr
Comment 4 Ricardo Ribalda 2018-09-17 12:42:45 UTC
Adding umr output. Obtained with this command:

sudo umr -O verbose,follow_ib -R gfx[0:2047]
sudo umr -O many,bits  -r *.gfx80.mmGRBM_STATUS
sudo umr -O many,bits  -r *.gfx80.HEADER_DUMP
sudo umr -O many,bits  -r *.gfx80.CP_EOP


Adding trace output. Obtained with this command:
trace-cmd record -e dma_fence:* -e amdgpu:* -e gpu_scheduler:*

Also dmesg and glxinfo
Comment 5 Ricardo Ribalda 2018-09-17 12:46:16 UTC
# cat /sys/kernel/debug/dri/0/amdgpu_firmware_info
VCE feature version: 0, firmware version: 0x34040300
UVD feature version: 0, firmware version: 0x015b0b00
MC feature version: 0, firmware version: 0x00000000
ME feature version: 46, firmware version: 0x000000a1
PFP feature version: 46, firmware version: 0x000000eb
CE feature version: 46, firmware version: 0x00000086
RLC feature version: 1, firmware version: 0x0000009c
RLC SRLC feature version: 0, firmware version: 0x00000000
RLC SRLG feature version: 0, firmware version: 0x00000000
RLC SRLS feature version: 0, firmware version: 0x00000000
MEC feature version: 46, firmware version: 0x000002c1
MEC2 feature version: 46, firmware version: 0x000002c1
SOS feature version: 0, firmware version: 0x00000000
ASD feature version: 0, firmware version: 0x00000000
SMC feature version: 0, firmware version: 0x00001238
SDMA0 feature version: 0, firmware version: 0x00000022
SDMA1 feature version: 0, firmware version: 0x00000022
VCN feature version: 0, firmware version: 0x00000000
VBIOS version: 113-C75100-028