Bug 210543

Summary: amdgpu Kernel panic:__ttm_dma_free_page.isra.0+0xac/0xe8 [ttm]
Product: Drivers Reporter: Ancheng (acyellow)
Component: Video(DRI - non Intel)Assignee: drivers_video-dri
Status: NEW ---    
Severity: blocking CC: acyellow, alexdeucher
Priority: P1    
Hardware: ARM   
OS: Linux   
Kernel Version: 4.19 Subsystem:
Regression: No Bisected commit-id:
Attachments: dmesg file
dmesg file1

Description Ancheng 2020-12-08 00:41:22 UTC
When start 16 Android VM to do Android App Monkey test, Linux kernel panic happens which causes the computer restart.
It seems AMDGPU related issue.

GPU device: 0007:01:00.0 VGA compatible controller: Advanced Micro Devices, Inc. [AMD/ATI] Device 67c7



2020-11-19 17:23:35 [ 1483.496680] pstate: 40400009 (nZcv daif +PAN -UAO)
2020-11-19 17:23:35 [ 1483.496687] pc : kfree+0x78/0x1a8
2020-11-19 17:23:35 [ 1483.496697] lr : __ttm_dma_free_page.isra.0+0xac/0xe8 [ttm]
2020-11-19 17:23:35 [ 1483.496697] sp : ffff00001f9138d0
2020-11-19 17:23:35 [ 1483.496698] x29: ffff00001f9138d0 x28: 0000000000000008
2020-11-19 17:23:35 [ 1483.496699] x27: 0000000000000000 x26: ffffc003e1d69d48
2020-11-19 17:23:35 [ 1483.496700] x25: 0000000000000001 x24: 0000000000001000
2020-11-19 17:23:35 [ 1483.496701] x23: ffff8000398340b0 x22: 00000000635f8000
2020-11-19 17:23:35 [ 1483.496702] x21: ffff0000026c9c94 x20: ffff000156085000
2020-11-19 17:23:35 [ 1483.496703] x19: ffff87ff8433fb80 x18: 0000000000000000
2020-11-19 17:23:35 [ 1483.496703] x17: 0000000000000000 x16: 0000000000000000
2020-11-19 17:23:35 [ 1483.496704] x15: 0000000000000000 x14: 0000000000000000
2020-11-19 17:23:35 [ 1483.496704] x13: 0000000000000000 x12: 0000000000000000
2020-11-19 17:23:35 [ 1483.496705] x11: 0000000000000040 x10: ffffc00359470638
2020-11-19 17:23:35 [ 1483.496705] x9 : ffff0000026c9c94 x8 : 0000000000210d00
2020-11-19 17:23:35 [ 1483.496706] x7 : ffffc004e28d4fe0 x6 : ffff7e1ffe10cfc0
2020-11-19 17:23:35 [ 1483.496707] x5 : 0000000000000019 x4 : ffff7e20029703c0
2020-11-19 17:23:35 [ 1483.496708] x3 : ffff88046b5f8d78 x2 : ffff00001f9138d0
2020-11-19 17:23:35 [ 1483.496709] x1 : 000088046286b000 x0 : ffffc004e28d4e78
2020-11-19 17:23:35 [ 1483.496710] Call trace:
2020-11-19 17:23:35 [ 1483.496712]  kfree+0x78/0x1a8
2020-11-19 17:23:35 [ 1483.496715]  __ttm_dma_free_page.isra.0+0xac/0xe8 [ttm]
2020-11-19 17:23:35 [ 1483.496719]  ttm_dma_page_put+0x5c/0x68 [ttm]
2020-11-19 17:23:35 [ 1483.496722]  ttm_dma_unpopulate+0x1f8/0x400 [ttm]
2020-11-19 17:23:35 [ 1483.496804]  amdgpu_ttm_tt_unpopulate+0x70/0xa0 [amdgpu]
2020-11-19 17:23:35 [ 1483.496814]  ttm_tt_unpopulate+0x6c/0x88 [ttm]
2020-11-19 17:23:35 [ 1483.909262]  ttm_tt_destroy.part.0+0x64/0x68 [ttm]
2020-11-19 17:23:35 [ 1483.909266]  ttm_tt_destroy+0x18/0x28 [ttm]
2020-11-19 17:23:35 [ 1483.909269]  ttm_bo_cleanup_memtype_use+0x3c/0x90 [ttm]
2020-11-19 17:23:35 [ 1483.909272]  ttm_bo_cleanup_refs_or_queue+0x1c8/0x200 [ttm]
2020-11-19 17:23:35 [ 1483.909275]  ttm_bo_put+0x80/0xb8 [ttm]
2020-11-19 17:23:35 [ 1483.909345]  amdgpu_bo_unref+0x28/0x38 [amdgpu]
2020-11-19 17:23:35 [ 1483.909406]  amdgpu_gem_object_free+0x3c/0x60 [amdgpu]
Comment 1 Alex Deucher 2020-12-08 16:24:55 UTC
Please attach your full dmesg output.
Comment 2 Ancheng 2020-12-11 06:06:13 UTC
Created attachment 294093 [details]
dmesg file
Comment 3 Ancheng 2020-12-11 06:06:37 UTC
Created attachment 294095 [details]
dmesg file1
Comment 4 Ancheng 2020-12-11 06:09:54 UTC
(In reply to Alex Deucher from comment #1)
> Please attach your full dmesg output.

Thanks for your response, attachment please find the dmesg output. There are some differences in the call trace for each hang.