Bug 204849
Summary: | amdgpu (RX560X) traceboot in dmesg boot output, system instability | ||
---|---|---|---|
Product: | Drivers | Reporter: | Justin Clift (justin) |
Component: | Video(DRI - non Intel) | Assignee: | drivers_video-dri |
Status: | RESOLVED OBSOLETE | ||
Severity: | normal | CC: | andrew.wells1024, postix, robert |
Priority: | P1 | ||
Hardware: | All | ||
OS: | Linux | ||
Kernel Version: | 5.3-rc8 | Subsystem: | |
Regression: | No | Bisected commit-id: |
Description
Justin Clift
2019-09-15 06:17:08 UTC
Hi guys, I am getting a very similar issue with a Ryzen 3400G (AMDGPU). OS Fedora 31: Linux asrock 5.3.16-300.fc31.x86_64 #1 SMP Fri Dec 13 17:59:04 UTC 2019 x86_64 x86_64 x86_64 GNU/Linux The problem manifests itself every time I enter into an Xorg (LXDE) session, I guess one time per CPU core. So I am getting 10-12 errors every time I log in, as popups. Xfce also will not work at all, only LXDE (Openbox) seems to work fine. Operationally all seems relatively OK though, except if I turn off one of my 2 monitors, and turn them back on, I will get the errors again. I dont have a kernel debugger or profiler installed. -------- here is a snippet of a backtrace: ------------- WARNING: CPU: 1 PID: 917 at drivers/gpu/drm/amd/amdgpu/../display/dc/core/dc_link.c:1758 write_i2c_retimer_setting.cold+0xae/0xb5 [amdgpu] Modules linked in: serpent_avx2 serpent_avx_x86_64 serpent_sse2_x86_64 serpent_generic dm_crypt xt_CHECKSUM xt_MASQUERADE nf_nat_tftp nf_conntrack_tftp xt_CT tun bridge stp llc ip6t_REJECT nf_reject_ipv6 ip6t_rpfilter ipt_REJECT nf_reject_ipv4 xt_conntrack ebtable_nat ebtable_broute ip6table_nat ip6table_mangle ip6table_raw ip6table_security iptable_nat nf_nat iptable_mangle iptable_raw iptable_security nf_conntrack nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c ip_set nfnetlink ebtable_filter ebtables ip6table_filter ip6_tables iptable_filter sunrpc snd_hda_codec_realtek snd_hda_codec_generic edac_mce_amd ledtrig_audio snd_hda_codec_hdmi kvm_amd ccp snd_hda_intel snd_hda_codec kvm snd_hda_core snd_hwdep irqbypass snd_seq crct10dif_pclmul snd_seq_device crc32_pclmul snd_pcm joydev wmi_bmof ghash_clmulni_intel snd_timer nvme sp5100_tco nvme_core snd k10temp i2c_piix4 soundcore acpi_cpufreq ip_tables uas usb_storage amdgpu amd_iommu_v2 gpu_sched i2c_algo_bit ttm drm_kms_helper drm crc32c_intel r8169 wmi video pinctrl_amd fuse CPU: 1 PID: 917 Comm: Xorg Tainted: G W 5.3.16-300.fc31.x86_64 #1 Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./A300M-STX, BIOS P3.50 05/15/2019 RIP: 0010:write_i2c_retimer_setting.cold+0xae/0xb5 [amdgpu] Code: 6c f4 a0 c1 0f 0b e9 0e 0d fd ff 48 c7 c7 80 ee 7f c0 e8 59 f4 a0 c1 0f 0b e9 27 0b fd ff 48 c7 c7 80 ee 7f c0 e8 46 f4 a0 c1 <0f> 0b e9 fd 09 fd ff 48 c7 c7 80 ee 7f c0 e8 33 f4 a0 c1 0f 0b e9 RSP: 0018:ffffbe23c190f6a0 EFLAGS: 00010246 RAX: 0000000000000024 RBX: 0000000000000001 RCX: 0000000000000000 RDX: 0000000000000000 RSI: ffff99fc50657908 RDI: ffff99fc50657908 RBP: ffffbe23c190f71c R08: ffff99fc50657908 R09: 000000000000072a R10: 0000000000024924 R11: 0000000000000003 R12: 000000000000005d R13: 0000000000000000 R14: ffff99fc18140648 R15: 0000000000000000 FS: 00007fd0f4484f00(0000) GS:ffff99fc50640000(0000) knlGS:0000000000000000 CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 CR2: 000055800df036e8 CR3: 00000003ef71a000 CR4: 00000000003406e0 Call Trace: -------------------------- (In reply to Robert Hoffmann from comment #1) > Hi guys, > > I am getting a very similar issue with a Ryzen 3400G (AMDGPU). > > OS Fedora 31: Linux asrock 5.3.16-300.fc31.x86_64 #1 SMP Fri Dec 13 17:59:04 > UTC 2019 x86_64 x86_64 x86_64 GNU/Linux > > The problem manifests itself every time I enter into an Xorg (LXDE) session, > I guess one time per CPU core. So I am getting 10-12 errors every time I log > in, as popups. Xfce also will not work at all, only LXDE (Openbox) seems to > work fine. > > Operationally all seems relatively OK though, except if I turn off one of my > 2 monitors, and turn them back on, I will get the errors again. > > > I dont have a kernel debugger or profiler installed. > > -------- here is a snippet of a backtrace: ------------- > WARNING: CPU: 1 PID: 917 at > drivers/gpu/drm/amd/amdgpu/../display/dc/core/dc_link.c:1758 > write_i2c_retimer_setting.cold+0xae/0xb5 [amdgpu] > Modules linked in: serpent_avx2 serpent_avx_x86_64 serpent_sse2_x86_64 > serpent_generic dm_crypt xt_CHECKSUM xt_MASQUERADE nf_nat_tftp > nf_conntrack_tftp xt_CT tun bridge stp llc ip6t_REJECT nf_reject_ipv6 > ip6t_rpfilter ipt_REJECT nf_reject_ipv4 xt_conntrack ebtable_nat > ebtable_broute ip6table_nat ip6table_mangle ip6table_raw ip6table_security > iptable_nat nf_nat iptable_mangle iptable_raw iptable_security nf_conntrack > nf_defrag_ipv6 nf_defrag_ipv4 libcrc32c ip_set nfnetlink ebtable_filter > ebtables ip6table_filter ip6_tables iptable_filter sunrpc > snd_hda_codec_realtek snd_hda_codec_generic edac_mce_amd ledtrig_audio > snd_hda_codec_hdmi kvm_amd ccp snd_hda_intel snd_hda_codec kvm snd_hda_core > snd_hwdep irqbypass snd_seq crct10dif_pclmul snd_seq_device crc32_pclmul > snd_pcm joydev wmi_bmof ghash_clmulni_intel snd_timer nvme sp5100_tco > nvme_core snd k10temp i2c_piix4 soundcore acpi_cpufreq ip_tables uas > usb_storage amdgpu amd_iommu_v2 gpu_sched i2c_algo_bit ttm drm_kms_helper drm > crc32c_intel r8169 wmi video pinctrl_amd fuse > CPU: 1 PID: 917 Comm: Xorg Tainted: G W > 5.3.16-300.fc31.x86_64 #1 > Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./A300M-STX, BIOS > P3.50 05/15/2019 > RIP: 0010:write_i2c_retimer_setting.cold+0xae/0xb5 [amdgpu] > Code: 6c f4 a0 c1 0f 0b e9 0e 0d fd ff 48 c7 c7 80 ee 7f c0 e8 59 f4 a0 c1 > 0f 0b e9 27 0b fd ff 48 c7 c7 80 ee 7f c0 e8 46 f4 a0 c1 <0f> 0b e9 fd 09 fd > ff 48 c7 c7 80 ee 7f c0 e8 33 f4 a0 c1 0f 0b e9 > RSP: 0018:ffffbe23c190f6a0 EFLAGS: 00010246 > RAX: 0000000000000024 RBX: 0000000000000001 RCX: 0000000000000000 > RDX: 0000000000000000 RSI: ffff99fc50657908 RDI: ffff99fc50657908 > RBP: ffffbe23c190f71c R08: ffff99fc50657908 R09: 000000000000072a > R10: 0000000000024924 R11: 0000000000000003 R12: 000000000000005d > R13: 0000000000000000 R14: ffff99fc18140648 R15: 0000000000000000 > FS: 00007fd0f4484f00(0000) GS:ffff99fc50640000(0000) knlGS:0000000000000000 > CS: 0010 DS: 0000 ES: 0000 CR0: 0000000080050033 > CR2: 000055800df036e8 CR3: 00000003ef71a000 CR4: 00000000003406e0 > Call Trace: > -------------------------- Quick update: there is also a bug in Redhat bugzilla, which is directly repated: Bug link: https://bugzilla.redhat.com/show_bug.cgi?id=1786489 BTW, this happens both on the Display Port and the HDMI port. As an extra data point with this, the error in my case only happens when I have an external monitor plugged in via the HDMI port. This is on a laptop, with the error not showing up if only the in-built display is in use. eg the HDMI not even plugged in. Clearly no-one is ever going to look at this, so I'm just going to close it. |