Bug 213053 - WARNING on dcn30_hwseq.c dcn30_set_hubp_blank, AMD Radeon 6700XT
Summary: WARNING on dcn30_hwseq.c dcn30_set_hubp_blank, AMD Radeon 6700XT
Status: NEW
Alias: None
Product: Drivers
Classification: Unclassified
Component: Video(DRI - non Intel) (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: drivers_video-dri
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2021-05-14 00:59 UTC by Andrew Dorney
Modified: 2021-07-22 07:21 UTC (History)
4 users (show)

See Also:
Kernel Version: 5.12.3
Subsystem:
Regression: No
Bisected commit-id:


Attachments
Full dmesg 2021-05-13 (93.12 KB, text/plain)
2021-05-14 00:59 UTC, Andrew Dorney
Details
full dmesg (97.06 KB, text/plain)
2021-06-04 06:28 UTC, Erik Badman
Details

Description Andrew Dorney 2021-05-14 00:59:17 UTC
Created attachment 296743 [details]
Full dmesg 2021-05-13

I recently managed to snag a Gigabyte "Eagle 12G" Radeon 6700 XT. I am running latest Debian Testing with a 5.12.3 kernel (config based off Debian defaults) and latest linux-firmware from git.

Upon installing the card into my system and booting, it outputs the warning below into kern.log. The UI is 3D accellerated, there are no errors or warnings in Xorg.log, but there is some graphical corruption and tearing in some games or on KDE's panels that disappears when the object is repainted.

Neither this WARNING nor the corruption occurs when when I plug in my previous video card, a Radeon Vega 64.

Please let me know if I can provide more information or test something.



[    5.481569] ------------[ cut here ]------------
[    5.481570] WARNING: CPU: 6 PID: 165 at drivers/gpu/drm/amd/amdgpu/../display/dc/dcn30/dcn30_hwseq.c:959 dcn30_set_hubp_blank+0x214/0x220 [amdgpu]
[    5.481725] Modules linked in: hid_generic(E) usbhid(E) hid(E) sd_mod(E) t10_pi(E) amdgpu(E+) gpu_sched(E) drm_ttm_helper(E) ttm(E) drm_kms_helper(E) xhci_pci(E) ahci(E) cec(E) xhci_pci_renesas(E) mxm_wmi(E) crc32_pclmul(E) crc32c_intel(E) igb(E) libahci(E) xhci_hcd(E) i2c_piix4(E) dca(E) drm(E) libata(E) i2c_algo_bit(E) usbcore(E) scsi_mod(E) wmi(E) gpio_amdpt(E) gpio_generic(E) button(E)
[    5.481736] CPU: 6 PID: 165 Comm: systemd-udevd Tainted: G            E     5.12.3 #69
[    5.481738] Hardware name: System manufacturer System Product Name/PRIME X470-PRO, BIOS 5837 02/01/2021
[    5.481739] RIP: 0010:dcn30_set_hubp_blank+0x214/0x220 [amdgpu]
[    5.481879] Code: 31 f6 48 8b 07 48 8b 40 50 e8 58 c5 ea ed 48 8b 9b d0 02 00 00 48 85 db 75 df eb 8c 0f 0b e9 46 ff ff ff 0f 0b e9 be fe ff ff <0f> 0b e9 42 fe ff ff 0f 1f 44 00 00 0f 1f 44 00 00 41 57 41 56 45
[    5.481880] RSP: 0018:ffffae8d8126b328 EFLAGS: 00010246
[    5.481882] RAX: 0000000000000000 RBX: 0000000000000001 RCX: 0000000000000003
[    5.481882] RDX: 0000000000000000 RSI: 0000000000000000 RDI: ffff9ec790180000
[    5.481883] RBP: ffff9ec791ee01e8 R08: ffffae8d8126b314 R09: ffffae8d8126b270
[    5.481884] R10: 0000000000000002 R11: 343a656e696c2063 R12: ffff9ec791ee01e8
[    5.481884] R13: ffff9ec792980000 R14: 0000000000000000 R15: ffff9ec791ee01e8
[    5.481885] FS:  00007ff744baf8c0(0000) GS:ffff9eca9e980000(0000) knlGS:0000000000000000
[    5.481886] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    5.481887] CR2: 00007ff744b9b50f CR3: 00000001100a2000 CR4: 00000000003506e0
[    5.481888] Call Trace:
[    5.481890]  dcn10_wait_for_mpcc_disconnect+0x116/0x190 [amdgpu]
[    5.482034]  dcn20_plane_atomic_disable+0x3e/0x150 [amdgpu]
[    5.482178]  dcn20_disable_plane+0x24/0x40 [amdgpu]
[    5.482320]  dcn10_init_pipes+0x307/0x3f0 [amdgpu]
[    5.482460]  dce110_enable_accelerated_mode+0x9c/0x260 [amdgpu]
[    5.482599]  dc_commit_state+0x942/0xa40 [amdgpu]
[    5.482736]  amdgpu_dm_atomic_commit_tail+0x585/0x2610 [amdgpu]
[    5.482878]  ? kfree+0xc3/0x410
[    5.482881]  ? dcn30_validate_bandwidth+0x11f/0x270 [amdgpu]
[    5.483019]  ? dcn30_validate_bandwidth+0x11f/0x270 [amdgpu]
[    5.483155]  ? __bpf_trace_amdgpu_dm_plane_state_template+0x10/0x10 [amdgpu]
[    5.483295]  ? dm_plane_helper_prepare_fb+0x1f1/0x250 [amdgpu]
[    5.483434]  ? drm_atomic_helper_setup_commit+0x74d/0x810 [drm_kms_helper]
[    5.483446]  commit_tail+0x94/0x130 [drm_kms_helper]
[    5.483457]  drm_atomic_helper_commit+0x113/0x140 [drm_kms_helper]
[    5.483467]  drm_client_modeset_commit_atomic+0x1e4/0x220 [drm]
[    5.483488]  drm_client_modeset_commit_locked+0x56/0x150 [drm]
[    5.483506]  drm_client_modeset_commit+0x24/0x40 [drm]
[    5.483524]  drm_fb_helper_set_par+0xa5/0xd0 [drm_kms_helper]
[    5.483533]  fbcon_init+0x264/0x4f0
[    5.483536]  visual_init+0xce/0x130
[    5.483539]  do_bind_con_driver.isra.0+0x1db/0x2e0
[    5.483541]  do_take_over_console+0x116/0x180
[    5.483542]  do_fbcon_takeover+0x5c/0xc0
[    5.483543]  register_framebuffer+0x1e4/0x300
[    5.483546]  __drm_fb_helper_initial_config_and_unlock+0x321/0x4a0 [drm_kms_helper]
[    5.483556]  amdgpu_fbdev_init+0xb9/0xf0 [amdgpu]
[    5.483664]  amdgpu_device_init.cold+0x1625/0x1ad5 [amdgpu]
[    5.483805]  ? pci_conf1_read+0xa4/0x100
[    5.483807]  ? pci_bus_read_config_word+0x49/0x70
[    5.483810]  amdgpu_driver_load_kms+0x64/0x260 [amdgpu]
[    5.483919]  amdgpu_pci_probe+0x10a/0x190 [amdgpu]
[    5.484021]  local_pci_probe+0x42/0x80
[    5.484023]  ? __cond_resched+0x16/0x40
[    5.484025]  pci_device_probe+0xfa/0x1b0
[    5.484027]  really_probe+0xf2/0x440
[    5.484029]  driver_probe_device+0xe1/0x150
[    5.484031]  device_driver_attach+0xa1/0xb0
[    5.484032]  __driver_attach+0x8a/0x150
[    5.484034]  ? device_driver_attach+0xb0/0xb0
[    5.484035]  ? device_driver_attach+0xb0/0xb0
[    5.484037]  bus_for_each_dev+0x78/0xc0
[    5.484038]  bus_add_driver+0x12b/0x1e0
[    5.484040]  driver_register+0x8b/0xe0
[    5.484042]  ? 0xffffffffc0f14000
[    5.484043]  do_one_initcall+0x44/0x1d0
[    5.484045]  ? do_init_module+0x23/0x260
[    5.484047]  ? kmem_cache_alloc_trace+0xfd/0x200
[    5.484049]  do_init_module+0x5c/0x260
[    5.484050]  __do_sys_finit_module+0xb1/0x110
[    5.484052]  do_syscall_64+0x33/0x80
[    5.484054]  entry_SYSCALL_64_after_hwframe+0x44/0xae
[    5.484056] RIP: 0033:0x7ff7450689b9
[    5.484057] Code: 00 c3 66 2e 0f 1f 84 00 00 00 00 00 0f 1f 44 00 00 48 89 f8 48 89 f7 48 89 d6 48 89 ca 4d 89 c2 4d 89 c8 4c 8b 4c 24 08 0f 05 <48> 3d 01 f0 ff ff 73 01 c3 48 8b 0d a7 54 0c 00 f7 d8 64 89 01 48
[    5.484058] RSP: 002b:00007ffff1849268 EFLAGS: 00000246 ORIG_RAX: 0000000000000139
[    5.484059] RAX: ffffffffffffffda RBX: 000055a1d6cd79f0 RCX: 00007ff7450689b9
[    5.484060] RDX: 0000000000000000 RSI: 00007ff7451f3e2d RDI: 0000000000000011
[    5.484061] RBP: 0000000000020000 R08: 0000000000000000 R09: 000055a1d6cd4e40
[    5.484061] R10: 0000000000000011 R11: 0000000000000246 R12: 00007ff7451f3e2d
[    5.484062] R13: 0000000000000000 R14: 000055a1d6cd7950 R15: 000055a1d6cd79f0
[    5.484063] ---[ end trace d62e2afb8d38cfd0 ]---
Comment 1 Erik Badman 2021-06-04 06:28:45 UTC
Created attachment 297145 [details]
full dmesg

I see the same in dmesg every boot, since the 5.12 kernel. AMD 6800 XT here.
Started around 5.12.4 somewhere I think.
Comment 2 Jonathan Farrugia 2021-06-11 20:05:36 UTC
Can confirm I also see this message with an RX 6800 (non XT).

It's still present as of Kernel 5.12.10

Tried earlier Kernels 5.12.8 and 5.12.6 - error was present there too.


If it's any help these are my specs:

Ryzen 5800X
Radeon 6800 (non XT)
Using Latest amdgpu firmware and Mesa 21.1.x


Let me know if there's anything else I can help with.
Comment 3 Alexey 2021-06-15 20:44:37 UTC
Also RX 6800 XT here:

Jun 15 23:05:38 kernel: [drm] fb depth is 24
Jun 15 23:05:38 kernel: [drm]    pitch is 10240
Jun 15 23:05:38 kernel: [drm] REG_WAIT timeout 1us * 100000 tries - mpc2_assert_idle_mpcc line:480
Jun 15 23:05:38 kernel: ------------[ cut here ]------------
Jun 15 23:05:38 kernel: WARNING: CPU: 8 PID: 650 at drivers/gpu/drm/amd/amdgpu/../display/dc/dcn30/dcn30_hwseq.c:959 dcn30_set_hubp_blank+0x20f/0x220 [amdgpu]
Jun 15 23:05:38 kernel: Modules linked in: pcc_cpufreq(-) algif_aead md4 iwlmvm joydev input_leds hid_generic bnep btusb btrtl btbcm btintel bluetooth snd_usb_audio(+) snd_usbmidi_lib snd_rawmidi snd_seq_device mc snd_hda_codec_realte>
Jun 15 23:05:38 kernel: CPU: 8 PID: 650 Comm: kworker/8:2 Tainted: G        W         5.12.10-gentoo-skobkin #1
Jun 15 23:05:38 kernel: Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./X399 Taichi, BIOS P3.90 12/04/2019
Jun 15 23:05:38 kernel: Workqueue: events work_for_cpu_fn
Jun 15 23:05:38 kernel: RIP: 0010:dcn30_set_hubp_blank+0x20f/0x220 [amdgpu]
Jun 15 23:05:38 kernel: Code: 31 f6 48 8b 07 48 8b 40 50 e8 0d e4 f1 d9 48 8b 9b d0 02 00 00 48 85 db 75 df eb 8c 0f 0b e9 46 ff ff ff 0f 0b e9 be fe ff ff <0f> 0b e9 42 fe ff ff 66 2e 0f 1f 84 00 00 00 00 00 41 57 41 56 45
Jun 15 23:05:38 kernel: RSP: 0018:ffffab11127f7480 EFLAGS: 00010246
Jun 15 23:05:38 kernel: RAX: 0000000000000000 RBX: 0000000000000001 RCX: 0000000000000008
Jun 15 23:05:38 kernel: RDX: 0000000000000000 RSI: 0000000000003ab3 RDI: ffff9e964c820000
Jun 15 23:05:38 kernel: RBP: ffff9e9655b601e8 R08: ffffab11127f746c R09: 0000000000000002
Jun 15 23:05:38 kernel: R10: 3038343a656e696c R11: 343a656e696c2063 R12: ffff9e9655b601e8
Jun 15 23:05:38 kernel: R13: ffff9e9655b601e8 R14: ffff9e964f190000 R15: 0000000000000000
Jun 15 23:05:38 kernel: FS:  0000000000000000(0000) GS:ffff9e9e1f400000(0000) knlGS:0000000000000000
Jun 15 23:05:38 kernel: CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Jun 15 23:05:38 kernel: CR2: 00007fceb9544884 CR3: 000000033a810000 CR4: 00000000003506e0
Jun 15 23:05:38 kernel: Call Trace:
Jun 15 23:05:38 kernel:  dcn10_wait_for_mpcc_disconnect+0x10d/0x190 [amdgpu]
Jun 15 23:05:38 kernel:  dcn20_plane_atomic_disable+0x39/0x110 [amdgpu]
Jun 15 23:05:38 kernel:  dcn20_disable_plane+0x1f/0x40 [amdgpu]
Jun 15 23:05:38 kernel:  dcn10_init_pipes+0x300/0x3f0 [amdgpu]
Jun 15 23:05:38 kernel:  dce110_enable_accelerated_mode+0x96/0x260 [amdgpu]
Jun 15 23:05:38 kernel:  dc_commit_state+0x97c/0xa90 [amdgpu]
Jun 15 23:05:38 kernel:  amdgpu_dm_atomic_commit_tail+0x459/0x20a0 [amdgpu]
Jun 15 23:05:38 kernel:  commit_tail+0x8d/0x120 [drm_kms_helper]
Jun 15 23:05:38 kernel:  drm_atomic_helper_commit+0x10e/0x140 [drm_kms_helper]
Jun 15 23:05:38 kernel:  drm_client_modeset_commit_atomic+0x1e3/0x230 [drm]
Jun 15 23:05:38 kernel:  drm_client_modeset_commit_locked+0x51/0x150 [drm]
Jun 15 23:05:38 kernel:  drm_client_modeset_commit+0x1f/0x40 [drm]
Jun 15 23:05:38 kernel:  drm_fb_helper_set_par+0xa0/0xd0 [drm_kms_helper]
Jun 15 23:05:38 kernel:  fbcon_init+0x29d/0x540
Jun 15 23:05:38 kernel:  visual_init+0xc4/0x120
Jun 15 23:05:38 kernel:  do_bind_con_driver.isra.0+0x1bc/0x2c0
Jun 15 23:05:38 kernel:  do_take_over_console+0x110/0x180
Jun 15 23:05:38 kernel:  do_fbcon_takeover+0x52/0xb0
Jun 15 23:05:38 kernel:  register_framebuffer+0x1ef/0x310
Jun 15 23:05:38 kernel:  __drm_fb_helper_initial_config_and_unlock+0x331/0x4a0 [drm_kms_helper]
Jun 15 23:05:38 kernel:  amdgpu_fbdev_init+0xb4/0xf0 [amdgpu]
Jun 15 23:05:38 kernel:  amdgpu_device_init.cold+0x116e/0x18ad [amdgpu]
Jun 15 23:05:38 kernel:  amdgpu_driver_load_kms+0x53/0x190 [amdgpu]
Jun 15 23:05:38 kernel:  amdgpu_pci_probe+0xb7/0x130 [amdgpu]
Jun 15 23:05:38 kernel:  local_pci_probe+0x3d/0x70
Jun 15 23:05:38 kernel:  work_for_cpu_fn+0x11/0x20
Jun 15 23:05:38 kernel:  process_one_work+0x210/0x3b0
Jun 15 23:05:38 kernel:  worker_thread+0x1f4/0x3c0
Jun 15 23:05:38 kernel:  ? rescuer_thread+0x3a0/0x3a0
Jun 15 23:05:38 kernel:  kthread+0x116/0x130
Jun 15 23:05:38 kernel:  ? kthread_associate_blkcg+0xa0/0xa0
Jun 15 23:05:38 kernel:  ret_from_fork+0x22/0x30
Jun 15 23:05:38 kernel: ---[ end trace fe613a36cb72f141 ]---
Comment 4 heuborger 2021-07-22 07:21:47 UTC
For me, this issue disappeared after updating the kernelto 5.13. 
I believe, this commit reverts the change that introduced this issue: 
https://github.com/torvalds/linux/commit/0b7421f0a6a41a8ce60c4dadf6f9e7c62fbd2f1f#diff-80cc88d298a712966f02c4cd7f9eb372b675720a337d0cbe85385ccdfb9c5618

Note You need to log in before you can comment on or make changes to this bug.