Bug 219808 - Commit 5009628d8509dbb90e1b88e01eda00430fa24b4b Breaks AMDGPU
Summary: Commit 5009628d8509dbb90e1b88e01eda00430fa24b4b Breaks AMDGPU
Status: RESOLVED ANSWERED
Alias: None
Product: Drivers
Classification: Unclassified
Component: Video(DRI - non Intel) (show other bugs)
Hardware: All Linux
: P3 normal
Assignee: drivers_video-dri
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2025-02-23 18:14 UTC by Kieran
Modified: 2025-02-23 20:39 UTC (History)
0 users

See Also:
Kernel Version:
Subsystem:
Regression: No
Bisected commit-id:


Attachments
Bisect log finding commit 5009628d8509dbb90e1b88e01eda00430fa24b4b breaks my kernel (2.82 KB, text/plain)
2025-02-23 18:14 UTC, Kieran
Details

Description Kieran 2025-02-23 18:14:01 UTC
Created attachment 307704 [details]
Bisect log finding commit 5009628d8509dbb90e1b88e01eda00430fa24b4b breaks my kernel

PROBLEM:
When booting in a kernel after commit 5009628d8509dbb90e1b88e01eda00430fa24b4b was accepted, amdgpu would fail to initialize with message:

kernel: [drm:amdgpu_discove6.12.9-00103-g42ef932bad9bry_set_ip_blocks [amdgpu]] *ERROR* amdgpu_discovery_init failed
kernel: amdgpu 0000:03:00.0: amdgpu: Fatal error during GPU init
kernel: amdgpu 0000:03:00.0: amdgpu: amdgpu: finishing device.
kernel: amdgpu 0000:03:00.0: probe with driver amdgpu failed with error -22

(full boot log can be found http://0x0.st/8cPi.txt)

Performing a bisect yielded the problem commit as 5009628d8509dbb90e1b88e01eda00430fa24b4b. (Bisect log is attached)

KERNEL INFORMATION:
Kernel 6.12.9 was the last working version, and kernel 6.12.10 broke.  The last working compiled kernel is 6.12.9-00103-g42ef932bad9b.  Kernel was compiled with the standard Arch Linux .config.

VER_LINUX OUTPUT:

GNU C               	14.2.1
GNU Make            	4.4.1
Binutils            	2.43.1
Util-linux          	2.40.4
Mount               	2.40.4
Module-init-tools   	33
E2fsprogs           	1.47.2
Jfsutils            	1.1.15
Reiserfsprogs       	3.6.27
PPP                 	2.5.2
Bison               	3.8.2
Flex                	2.6.4
Linux C++ Library   	6.0.33
Dynamic linker (ldd)	2.40
Procps              	4.0.5
Kbd                 	2.7.1
Console-tools       	2.7.1
Sh-utils            	9.5
Udev                	257
Wireless-tools      	30
Modules Loaded      	aesni_intel amd_atl amdgpu amdxcp asus_wmi bluetooth btbcm btintel btmtk btrtl btusb ccm ccp cec cfg80211 crc16 crc32c_generic crc32c_intel crc32_pclmul crct10dif_pclmul cryptd crypto_simd dm_mod drm_buddy drm_display_helper drm_exec drm_suballoc_helper drm_ttm_helper eeepc_wmi ext4 fat gf128mul ghash_clmulni_intel gpio_amdpt gpio_generic gpu_sched hid_generic i2c_algo_bit i2c_piix4 i2c_smbus i8042 intel_rapl_common intel_rapl_msr ip_tables jbd2 k10temp kvm kvm_amd libarc4 libphy loop mac80211 mac_hid mbcache mdio_devres mousedev mt76 mt76_connac_lib mt7925_common mt7925e mt792x_lib nfnetlink nvme nvme_auth nvme_core pcspkr platform_profile polyval_clmulni polyval_generic r8169 radeon rapl realtek rfkill roles serio sha1_ssse3 sha256_ssse3 sha512_ssse3 snd snd_hda_codec snd_hda_codec_generic snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_core snd_hda_intel snd_hda_scodec_component snd_hwdep snd_intel_dspcfg snd_intel_sdw_acpi snd_pcm snd_timer soundcore sp5100_tco sparse_keymap spd5118 thunderbolt ttm typec typec_ucsi uas ucsi_acpi usbhid usb_storage vfat video wmi wmi_bmof x_tables

HARDWARE:
CPU: AMD Ryzen 7 8700F (16) @ 5.054GHz
GPU: AMD ATI Radeon RX 6600

NOTES:
My thinking is that the lines removed from drivers/gpu/drm/amd/display/amdgpu_dm/amdgpu_dm.c are actually need for my hardware and should be added back in.  I tested just writing the deleted lines back into the /gpu/drm/amd/display/amdgpu_dm/amdgpu_dm.c source for a kernel version I knew to be bad, and this solved the problem.
Comment 1 Artem S. Tashkinov 2025-02-23 20:39:47 UTC
Please report here https://gitlab.freedesktop.org/drm/amd/-/issues

Note You need to log in before you can comment on or make changes to this bug.