Bug 105711

Summary: amdgpu: no monitor signal (modprobe amdgpu fails)
Product: Drivers Reporter: Lars W. (polynomial-c)
Component: Video(DRI - non Intel)Assignee: drivers_video-dri
Status: NEW ---    
Severity: normal CC: parker.l.reed
Priority: P1    
Hardware: x86-64   
OS: Linux   
Kernel Version: 4.5 Subsystem:
Regression: No Bisected commit-id:
Attachments: kernel dmesg output
lspci -nnv
kernel-4.3-rc4 config (gzip compressed)
kernel-4.5.0 config (gzip compressed)
kernel dmesg output

Description Lars W. 2015-10-09 17:49:31 UTC
Created attachment 189801 [details]
kernel dmesg output

As soon as the driver gets loaded the monitor looses the signal and doesn't get it back. After a while the kernel panics so even network connections get lost:


[  751.713493] Task dump for CPU 7:
[  751.713496] modprobe        R  running task        0  2994   2968 0x0000000a
[  751.713506]  ffff88063a72ae00 ffffffff818809f9 ffff88063a72ae00 0000000000000007
[  751.713513]  ffff88063a72ae00 0000000000000000 ffff8800a6658000 00000000ffffffff
[  751.713519]  0000000000000000 ffff8800a6658000 ffff880634b20000 ffffffff818814b2
[  751.713525] Call Trace:
[  751.713532]  [<ffffffff818809f9>] ? 0xffffffff818809f9
[  751.713536]  [<ffffffff818814b2>] ? 0xffffffff818814b2
[  751.713539]  [<ffffffff81885180>] ? 0xffffffff81885180
[  751.713543]  [<ffffffffa00232ca>] ? 0xffffffffa00232ca
[  751.713547]  [<ffffffff814f6c8c>] ? 0xffffffff814f6c8c
[  751.713550]  [<ffffffffa005933c>] ? 0xffffffffa005933c
[  751.713553]  [<ffffffffa005c69e>] ? 0xffffffffa005c69e
[  751.713557]  [<ffffffffa0018cba>] ? 0xffffffffa0018cba
[  751.713560]  [<ffffffffa001a5c1>] ? 0xffffffffa001a5c1
[  751.713563]  [<ffffffff815c7893>] ? 0xffffffff815c7893
[  751.713566]  [<ffffffff815c9589>] ? 0xffffffff815c9589
[  751.713569]  [<ffffffff81526b58>] ? 0xffffffff81526b58
[  751.713572]  [<ffffffff81526abf>] ? 0xffffffff81526abf
[  751.713575]  [<ffffffff81527193>] ? 0xffffffff81527193
[  751.713578]  [<ffffffff815e4334>] ? 0xffffffff815e4334
[  751.713581]  [<ffffffff815e458d>] ? 0xffffffff815e458d
[  751.713584]  [<ffffffff815e4500>] ? 0xffffffff815e4500
[  751.713586]  [<ffffffff815e2585>] ? 0xffffffff815e2585
[  751.713589]  [<ffffffff815e3af1>] ? 0xffffffff815e3af1
[  751.713592]  [<ffffffffa0088000>] ? 0xffffffffa0088000
[  751.713595]  [<ffffffff815e4b42>] ? 0xffffffff815e4b42
[  751.713598]  [<ffffffffa0088000>] ? 0xffffffffa0088000
[  751.713601]  [<ffffffff8100036b>] ? 0xffffffff8100036b
[  751.713604]  [<ffffffff8113065c>] ? 0xffffffff8113065c
[  751.713607]  [<ffffffff810fb8d4>] ? 0xffffffff810fb8d4
[  751.713610]  [<ffffffff810f94d0>] ? 0xffffffff810f94d0
[  751.713615]  [<ffffffff810fbe42>] ? 0xffffffff810fbe42
[  751.713618]  [<ffffffff81884697>] ? 0xffffffff81884697
[  751.713626] rcu_preempt kthread starved for 24577 jiffies! g2247 c2246 f0x0 s3 ->state=0x0


Same problem happens with vanilla kernel-4.2.3
Comment 1 Lars W. 2015-10-09 17:52:26 UTC
Created attachment 189811 [details]
lspci -nnv

83:00.0 VGA compatible controller [0300]: Advanced Micro Devices, Inc. [AMD/ATI] Tonga PRO [Radeon R9 285/380] [1002:6939] (prog-if 00 [VGA controller])
        Subsystem: PC Partner Limited / Sapphire Technology Tonga PRO [Radeon R9 285/380] [174b:e306]
        Flags: bus master, fast devsel, latency 0, IRQ 11
        Memory at c0000000 (64-bit, prefetchable) [size=256M]
        Memory at b0e00000 (64-bit, prefetchable) [size=2M]
        I/O ports at 6000 [size=256]
        Memory at b0b40000 (32-bit, non-prefetchable) [size=256K]
        [virtual] Expansion ROM at b0b20000 [disabled] [size=128K]
        Capabilities: [48] Vendor Specific Information: Len=08 <?>
        Capabilities: [50] Power Management version 3
        Capabilities: [58] Express Legacy Endpoint, MSI 00
        Capabilities: [a0] MSI: Enable- Count=1/1 Maskable- 64bit+
        Kernel modules: amdgpu
Comment 2 Lars W. 2015-10-09 17:54:37 UTC
Created attachment 189821 [details]
kernel-4.3-rc4 config (gzip compressed)
Comment 3 Lars W. 2016-03-19 10:43:31 UTC
Still failing with kernel-4.5.0:

[   43.006770] [drm] amdgpu kernel modesetting enabled.
[   43.007422] ACPI: PCI Interrupt Link [LK2E] enabled at IRQ 47
[   43.007760] [drm] initializing kernel modesetting (TONGA 0x1002:0x6939 0x174B:0xE306 0x00).
[   43.007789] [drm] register mmio base: 0xB0B40000
[   43.007792] [drm] register mmio size: 262144
[   43.007804] [drm] doorbell mmio base: 0xB0E00000
[   43.007806] [drm] doorbell mmio size: 2097152
[   43.007818] [drm] probing gen 2 caps for device 10de:377 = 113501/0
[   43.007822] [drm] probing mlw for device 10de:377 = 113501
[   43.007991] ATOM BIOS: E306
[   43.009420] amdgpu 0000:83:00.0: VRAM: 2048M 0x0000000000000000 - 0x000000007FFFFFFF (2048M used)
[   43.009427] amdgpu 0000:83:00.0: GTT: 2048M 0x0000000080000000 - 0x00000000FFFFFFFF
[   43.009431] [drm] Detected VRAM RAM=2048M, BAR=256M
[   43.009434] [drm] RAM width 256bits DDR
[   43.010009] [TTM] Zone  kernel: Available graphics memory: 12376642 kiB
[   43.010016] [TTM] Zone   dma32: Available graphics memory: 2097152 kiB
[   43.010019] [TTM] Initializing pool allocator
[   43.010031] [TTM] Initializing DMA pool allocator
[   43.010078] [drm] amdgpu: 2048M of VRAM memory ready
[   43.010081] [drm] amdgpu: 2048M of GTT memory ready.
[   43.010090] [drm] GART: num cpu pages 524288, num gpu pages 524288
[   43.018292] [drm] PCIE GART of 2048M enabled (table at 0x0000000000040000).
[   43.018309] [drm] Supports vblank timestamp caching Rev 2 (21.10.2013).
[   43.018309] [drm] Driver supports precise vblank timestamp query.
[   43.018348] amdgpu 0000:83:00.0: amdgpu: using MSI.
[   43.018376] [drm] amdgpu: irq initialized.
[   43.018384] Can't find requested voltage id in vdd_dep_on_sclk table!
[   43.022329] amdgpu: powerplay initialized
[   43.023020] [drm] AMDGPU Display Connectors
[   43.023021] [drm] Connector 0:
[   43.023021] [drm]   DP-1
[   43.023022] [drm]   HPD4
[   43.023023] [drm]   DDC: 0x4868 0x4868 0x4869 0x4869 0x486a 0x486a 0x486b 0x486b
[   43.023024] [drm]   Encoders:
[   43.023025] [drm]     DFP1: INTERNAL_UNIPHY1
[   43.023025] [drm] Connector 1:
[   43.023026] [drm]   HDMI-A-1
[   43.023026] [drm]   HPD5
[   43.023028] [drm]   DDC: 0x4870 0x4870 0x4871 0x4871 0x4872 0x4872 0x4873 0x4873
[   43.023028] [drm]   Encoders:
[   43.023029] [drm]     DFP2: INTERNAL_UNIPHY1
[   43.023029] [drm] Connector 2:
[   43.023030] [drm]   DVI-D-1
[   43.023030] [drm]   HPD1
[   43.023031] [drm]   DDC: 0x4878 0x4878 0x4879 0x4879 0x487a 0x487a 0x487b 0x487b
[   43.023032] [drm]   Encoders:
[   43.023033] [drm]     DFP3: INTERNAL_UNIPHY
[   43.023033] [drm] Connector 3:
[   43.023034] [drm]   DVI-I-1
[   43.023034] [drm]   HPD6
[   43.023035] [drm]   DDC: 0x487c 0x487c 0x487d 0x487d 0x487e 0x487e 0x487f 0x487f
[   43.023036] [drm]   Encoders:
[   43.023036] [drm]     DFP4: INTERNAL_UNIPHY2
[   43.023037] [drm]     CRT1: INTERNAL_KLDSCP_DAC1
[   43.023144] amdgpu 0000:83:00.0: fence driver on ring 0 use gpu addr 0x0000000080000008, cpu addr 0xffff880632c0b008
[   43.023549] amdgpu 0000:83:00.0: fence driver on ring 1 use gpu addr 0x0000000080000018, cpu addr 0xffff880632c0b018
[   43.023729] amdgpu 0000:83:00.0: fence driver on ring 2 use gpu addr 0x0000000080000028, cpu addr 0xffff880632c0b028
[   43.023870] amdgpu 0000:83:00.0: fence driver on ring 3 use gpu addr 0x0000000080000038, cpu addr 0xffff880632c0b038
[   43.025305] amdgpu 0000:83:00.0: fence driver on ring 4 use gpu addr 0x0000000080000048, cpu addr 0xffff880632c0b048
[   43.025693] amdgpu 0000:83:00.0: fence driver on ring 5 use gpu addr 0x0000000080000058, cpu addr 0xffff880632c0b058
[   43.026043] amdgpu 0000:83:00.0: fence driver on ring 6 use gpu addr 0x0000000080000068, cpu addr 0xffff880632c0b068
[   43.026248] amdgpu 0000:83:00.0: fence driver on ring 7 use gpu addr 0x0000000080000078, cpu addr 0xffff880632c0b078
[   43.026425] amdgpu 0000:83:00.0: fence driver on ring 8 use gpu addr 0x0000000080000088, cpu addr 0xffff880632c0b088
[   43.026594] amdgpu 0000:83:00.0: fence driver on ring 9 use gpu addr 0x0000000080000098, cpu addr 0xffff880632c0b098
[   43.026683] amdgpu 0000:83:00.0: fence driver on ring 10 use gpu addr 0x00000000800000a8, cpu addr 0xffff880632c0b0a8
[   43.026717] [drm] Found UVD firmware Version: 1.52 Family ID: 10
[   43.027592] amdgpu 0000:83:00.0: fence driver on ring 11 use gpu addr 0x000000000048f7b0, cpu addr 0xffffc9000584e7b0
[   43.027602] [drm] Found VCE firmware Version: 50.17 Binary ID: 3
[   43.027712] amdgpu 0000:83:00.0: fence driver on ring 12 use gpu addr 0x00000000800000c8, cpu addr 0xffff880632c0b0c8
[   43.027760] amdgpu 0000:83:00.0: fence driver on ring 13 use gpu addr 0x00000000800000d8, cpu addr 0xffff880632c0b0d8
[   43.341156] Failed to send Previous Message.
[   43.536492] Failed to send Message.
[   43.731875] Failed to send Previous Message.
[   43.927250] Failed to send Message.
[   44.122619] Failed to send Previous Message.
[   44.317925] Failed to send Message.
[   44.513224] Failed to send Previous Message.
[   44.708523] Failed to send Message.
[   44.903831] Failed to send Previous Message.
[   45.099149] Failed to send Message.
[   45.294455] Failed to send Previous Message.
[   45.489762] Failed to send Message.
[   45.685065] Failed to send Previous Message.
[   45.880361] Failed to send Message.
[   46.075684] Failed to send Previous Message.
[   46.270966] Failed to send Message.
[   46.466281] Failed to send Previous Message.
[   46.661576] Failed to send Message.
[   46.852740] [drm:0xffffffffa0059682] *ERROR* amdgpu: ring 0 test failed (scratch(0xC040)=0xCAFEDEAD)                                                         
[   46.852742] [drm:0xffffffffa00145c5] *ERROR* hw_init 5 failed -22
[   46.852744] amdgpu 0000:83:00.0: amdgpu_init failed
[   46.855013] BUG: unable to handle kernel paging request at 0000000400000000
[   46.855197] IP: [<ffffffffa007c201>] 0xffffffffa007c201
[   46.855338] PGD a72ac067 PUD 0 
[   46.855512] Oops: 0000 [#1] PREEMPT SMP 
[   46.855718] Modules linked in: amdgpu(+) ttm
[   46.855926] CPU: 0 PID: 3003 Comm: modprobe Not tainted 4.5.0 #2
[   46.856032] Hardware name: Tyan Computer Corporation  MCP55 /S2915 , BIOS 4.01d.2915 05/25/2010
[   46.856207] task: ffff88063a545e00 ti: ffff8800a709c000 task.ti: ffff8800a709c000
[   46.856314] RIP: 0010:[<ffffffffa007c201>]  [<ffffffffa007c201>] 0xffffffffa007c201
[   46.856456] RSP: 0018:ffff8800a709fa18  EFLAGS: 00010046
[   46.856561] RAX: ffff88063241b0a0 RBX: ffff880639fde000 RCX: 0000000000000000
[   46.856668] RDX: 00000000c030000c RSI: 0000000000000002 RDI: 0000000400000000
[   46.856775] RBP: ffff8806384f1bf8 R08: ffffffffa007c1a0 R09: ffffffff815dbe3e
[   46.856881] R10: ffff88063f003500 R11: ffff88063f003500 R12: ffff8806384f0000
[   46.856988] R13: ffff8806384f1cc8 R14: ffff8806384f14c4 R15: ffff88063241b080
[   46.857096] FS:  00007fa815841700(0000) GS:ffff880657c00000(0000) knlGS:0000000000000000
[   46.857204] CS:  0010 DS: 0000 ES: 0000 CR0: 000000008005003b
[   46.857309] CR2: 0000000400000000 CR3: 00000000a7385000 CR4: 00000000000006f0
[   46.857414] Stack:
[   46.857519]  0000000000000000 ffffffffa003e2b9 0000000000000282 ffff8806384f0000
[   46.857759]  0000000000000001 ffff8806384f0000 ffff8806384f0f80 ffff880639fdf800
[   46.857999]  0000000000000000 ffffffff815dc5e8 ffff8806384f0000 ffff8806384f0000
[   46.858238] Call Trace:
[   46.858341]  [<ffffffffa003e2b9>] ? 0xffffffffa003e2b9
[   46.858445]  [<ffffffff815dc5e8>] ? 0xffffffff815dc5e8
[   46.858550]  [<ffffffffa003e5da>] ? 0xffffffffa003e5da
[   46.858654]  [<ffffffffa003fdd9>] ? 0xffffffffa003fdd9
[   46.858759]  [<ffffffffa001297f>] ? 0xffffffffa001297f
[   46.858863]  [<ffffffffa00140e8>] ? 0xffffffffa00140e8
[   46.858969]  [<ffffffffa0015c01>] ? 0xffffffffa0015c01
[   46.859073]  [<ffffffff815de813>] ? 0xffffffff815de813
[   46.859176]  [<ffffffff815e0889>] ? 0xffffffff815e0889
[   46.859281]  [<ffffffff8153bf78>] ? 0xffffffff8153bf78
[   46.859384]  [<ffffffff8153bedf>] ? 0xffffffff8153bedf
[   46.859488]  [<ffffffff8153c5b3>] ? 0xffffffff8153c5b3
[   46.859592]  [<ffffffff815fbde3>] ? 0xffffffff815fbde3
[   46.859696]  [<ffffffff815fbfad>] ? 0xffffffff815fbfad
[   46.859799]  [<ffffffff815fbf20>] ? 0xffffffff815fbf20
[   46.859903]  [<ffffffff815f9eb5>] ? 0xffffffff815f9eb5
[   46.860006]  [<ffffffff815fb431>] ? 0xffffffff815fb431
[   46.860110]  [<ffffffffa00d0000>] ? 0xffffffffa00d0000
[   46.860214]  [<ffffffff815fc582>] ? 0xffffffff815fc582
[   46.860317]  [<ffffffffa00d0000>] ? 0xffffffffa00d0000
[   46.860420]  [<ffffffff8100045b>] ? 0xffffffff8100045b
[   46.860526]  [<ffffffff81125378>] ? 0xffffffff81125378
[   46.860631]  [<ffffffff810ee580>] ? 0xffffffff810ee580
[   46.860736]  [<ffffffff810ec250>] ? 0xffffffff810ec250
[   46.860842]  [<ffffffff810eeaf2>] ? 0xffffffff810eeaf2
[   46.860947]  [<ffffffff818a4357>] ? 0xffffffff818a4357
[   46.861053] Code: 4c 8b 07 89 c1 ba 0c 00 30 c0 be 02 00 00 00 41 ff 50 60 31 c0 5b c3 31 c0 5b c3 48 8b 7b 18 85 c9 ba 0c 00 30 c0 be 02 00 00 00 <48> 8b 07 74 11 ff 50 58 48 8b 7b 18 0d 00 00 00 01 4c 8b 07 eb 
[   46.863475] RIP  [<ffffffffa007c201>] 0xffffffffa007c201
[   46.863613]  RSP <ffff8800a709fa18>
[   46.863713] CR2: 0000000400000000
[   46.863817] ---[ end trace b5290aa5486ecc14 ]---
[   46.863922] note: modprobe[3003] exited with preempt_count 1
Comment 4 Lars W. 2016-03-19 10:46:46 UTC
Created attachment 209841 [details]
kernel-4.5.0 config (gzip compressed)
Comment 5 Lars W. 2016-03-19 10:48:32 UTC
Created attachment 209851 [details]
kernel dmesg output
Comment 6 Parker Reed 2016-05-09 03:12:15 UTC
Seems to be the same issue I'm having on bonaire https://bugzilla.kernel.org/show_bug.cgi?id=117151