Bug 203855

Summary: System hangs at loading initramfs ASUS KGPE_D16 coreboot - AMD Opteron(tm) Processor 6380
Product: ACPI Reporter: Leon Gehling (leon.gehling)
Component: BIOSAssignee: acpi_bios
Status: NEW ---    
Severity: normal CC: ian, rui.zhang
Priority: P1    
Hardware: x86-64   
OS: Linux   
Kernel Version: 4.19.48 Subsystem:
Regression: No Bisected commit-id:
Attachments: Early Kernel Output with Panic
dmesg with no panic on similar system

Description Leon Gehling 2019-06-09 17:08:39 UTC
My File server is driven by a ASUS KGPE-D16 + Opteron 6380 Motherboard with
coreboot.
After doing an dist-upgrade from stretch to the upcoming Buster release i
experienced an System hang at kernel load with the new kernel. Its a full
freeze right after GRUB loads the kernel.
acpi=off makes the kernel boot, but the system remains unusable as it several
sata controller are not working correctly with acpi disabled.
Booting with the old 4.9.x Kernel as workaround works flawless and keeps my
system usable. But i guess this is nothing desired in the long term.
 
Also see Debian Bugreport: https://www.mail-archive.com/debian-kernel@lists.debian.org/msg115041.html
Maybe this is related to: https://bugzilla.kernel.org/show_bug.cgi?id=93171
Comment 1 Leon Gehling 2019-06-10 21:01:40 UTC
Some output i got with earlyprintk set

Loading Linux 4.19.0-5-amd64 ...Loading Linux 4.19.0-5-amd64 ..
.
Loading initial ramdisk ...Loading initial ramdisk ..
.
[    0.000000] Linux version 4.19.0-5-amd64
(debian-kernel@lists.debian.org) (g)
[    0.000000] Command line: BOOT_IMAGE=/boot/vmlinuz-4.19.0-5-amd64
root=UUID=0
[    0.000000] random: get_random_u32 called from
bsp_init_amd+0x20b/0x2b0 with0
[    0.000000] x86/fpu: Supporting XSAVE feature 0x001: 'x87 floating
point reg'
[    0.000000] x86/fpu: Supporting XSAVE feature 0x002: 'SSE registers'
[    0.000000] x86/fpu: Supporting XSAVE feature 0x004: 'AVX registers'
[    0.000000] x86/fpu: xstate_offset[2]:  576, xstate_sizes[2]:  256
[    0.000000] x86/fpu: Enabled xstate features 0x7, context size is 832
bytes,.
[    0.000000] BIOS-provided physical RAM map:
[    0.000000] BIOS-e820: [mem 0x0000000000000000-0x000000000009fbff] usable
[    0.000000] BIOS-e820: [mem 0x000000000009fc00-0x000000000009ffff]
reserved
[    0.000000] BIOS-e820: [mem 0x00000000000f0000-0x00000000000fffff]
reserved
[    0.000000] BIOS-e820: [mem 0x0000000000100000-0x00000000b7d97fff] usable
[    0.000000] BIOS-e820: [mem 0x00000000b7d98000-0x00000000b7ffffff]
reserved
[    0.000000] BIOS-e820: [mem 0x00000000b8000000-0x00000000bfffafff] usable
[    0.000000] BIOS-e820: [mem 0x00000000bfffb000-0x00000000cfffffff]
reserved
[    0.000000] BIOS-e820: [mem 0x00000000fcf00000-0x00000000fcf03fff]
reserved
[    0.000000] BIOS-e820: [mem 0x00000000feb00000-0x00000000feb00fff]
reserved
[    0.000000] BIOS-e820: [mem 0x00000000fec00000-0x00000000fec00fff]
reserved
[    0.000000] BIOS-e820: [mem 0x00000000fed00000-0x00000000fed00fff]
reserved
[    0.000000] BIOS-e820: [mem 0x00000000fed40000-0x00000000fed44fff]
reserved
[    0.000000] BIOS-e820: [mem 0x0000000100000000-0x0000000437ffffff] usable
[    0.000000] BIOS-e820: [mem 0x0000000438000000-0x000000043fffffff]
reserved
[    0.000000] bootconsole [earlyser0] enabled
[    0.000000] NX (Execute Disable) protection: active
[    0.000000] SMBIOS 2.7 present.
[    0.000000] DMI: ASUS KGPE-D16/KGPE-D16, BIOS 4.9-1859-gf3510cbe36
05/31/2019
[    0.000000] tsc: Fast TSC calibration using PIT
[    0.000000] tsc: Detected 2500.121 MHz processor
[    0.010404] AGP: No AGP bridge found
[    0.013881] last_pfn = 0x438000 max_arch_pfn = 0x400000000
[    0.021404] x86/PAT: Configuration [0-7]: WB  WC  UC- UC  WB  WP  UC-
WT 
Memory KASLR using RDTSC...
[    0.030856] last_pfn = 0xbfffb max_arch_pfn = 0x400000000
[    0.042030] Using GB pages for direct mapping
[    0.046429] RAMDISK: [mem 0x3476f000-0x363aefff]
[    0.050877] ACPI: Early table checksum verification disabled
[    0.056563] ACPI: RSDP 0x00000000000F6250 000024 (v02 COREv4)
[    0.062225] ACPI: XSDT 0x00000000B7D990E0 000074 (v01 COREv4 COREBOOT
000000)
[    0.070720] ACPI: FACP 0x00000000B7D9B9A0 0000F4 (v03 COREv4 COREBOOT
000000)
[    0.079213] ACPI: DSDT 0x00000000B7D99280 00271A (v02 COREv4 COREBOOT
000000)
[    0.087704] ACPI: FACS 0x00000000B7D99240 000040
[    0.092298] ACPI: FACS 0x00000000B7D99240 000040
[    0.096890] ACPI: SSDT 0x00000000B7D9BAA0 0020F2 (v02 COREv4 COREBOOT
000000)
[    0.105383] ACPI: MCFG 0x00000000B7D9DBA0 00003C (v01 COREv4 COREBOOT
000000)
[    0.113875] ACPI: APIC 0x00000000B7D9DBE0 0000DE (v02 COREv4 COREBOOT
000000)
[    0.122369] ACPI: SRAT 0x00000000B7D9DCC0 0001A8 (v01 COREv4 COREBOOT
000000)
[    0.130862] ACPI: SLIT 0x00000000B7D9DE68 000030 (v01 COREv4 COREBOOT
000000)
[    0.139355] ACPI: SRAT 0x00000000B7D9DEA0 0001A8 (v01 COREv4 COREBOOT
000000)
[    0.147848] ACPI: SLIT 0x00000000B7D9E048 000030 (v01 COREv4 COREBOOT
000000)
[    0.156341] ACPI: IVRS 0x00000000B7D9E080 0000BC (v01 COREv4 COREBOOT
000000)
[    0.164834] ACPI: HPET 0x00000000B7D9E140 000038 (v01 COREv4 COREBOOT
000000)
[    0.173380] SRAT: PXM 0 -> APIC 0x00 -> Node 0
[    0.177747] SRAT: PXM 0 -> APIC 0x01 -> Node 0
[    0.182167] SRAT: PXM 0 -> APIC 0x02 -> Node 0
[    0.186586] SRAT: PXM 0 -> APIC 0x03 -> Node 0
[    0.191006] SRAT: PXM 0 -> APIC 0x04 -> Node 0
[    0.195426] SRAT: PXM 0 -> APIC 0x05 -> Node 0
[    0.199845] SRAT: PXM 0 -> APIC 0x06 -> Node 0
[    0.204264] SRAT: PXM 0 -> APIC 0x07 -> Node 0
[    0.208685] SRAT: PXM 1 -> APIC 0x08 -> Node 1
[    0.213104] SRAT: PXM 1 -> APIC 0x09 -> Node 1
[    0.217525] SRAT: PXM 1 -> APIC 0x0a -> Node 1
[    0.221944] SRAT: PXM 1 -> APIC 0x0b -> Node 1
[    0.226364] SRAT: PXM 1 -> APIC 0x0c -> Node 1
[    0.230784] SRAT: PXM 1 -> APIC 0x0d -> Node 1
[    0.235204] SRAT: PXM 1 -> APIC 0x0e -> Node 1
[    0.239624] SRAT: PXM 1 -> APIC 0x0f -> Node 1
[    0.244046] ACPI: SRAT: Node 0 PXM 0 [mem 0x00000000-0x0009ffff]
[    0.250025] ACPI: SRAT: Node 1 PXM 1 [mem 0x00100000-0xbfffffff]
[    0.256004] ACPI: SRAT: Node 1 PXM 1 [mem 0x100000000-0x43fffffff]
[    0.262162] NUMA: Node 1 [mem 0x00100000-0xbfffffff] + [mem
0x100000000-0x43]
[    0.272647] NODE_DATA(1) allocated [mem 0x437ff8000-0x437ffcfff]
[    0.278660] node 0 must be removed before remove section 134
[    0.284272] Zone ranges:
[    0.286769]   DMA      [mem 0x0000000000001000-0x0000000000ffffff]
[    0.292922]   DMA32    [mem 0x0000000001000000-0x00000000ffffffff]
[    0.299075]   Normal   [mem 0x0000000100000000-0x0000000437ffffff]
[    0.305228]   Device   empty
[    0.308088] Movable zone start for each node
[    0.312335] Early memory node ranges
[    0.315888]   node   0: [mem 0x0000000000001000-0x000000000009efff]
[    0.322128]   node   1: [mem 0x0000000000100000-0x00000000b7d97fff]
[    0.328368]   node   1: [mem 0x00000000b8000000-0x00000000bfffafff]
[    0.334607]   node   1: [mem 0x0000000100000000-0x0000000437ffffff]
[    0.340850] Reserved but unavailable: 98 pages
[    0.340852] Initmem setup node 1 [mem
0x0000000000100000-0x0000000437ffffff]
[    0.466889] ACPI: PM-Timer IO Port: 0x820
[    0.470732] ACPI: LAPIC_NMI (acpi_id[0xff] dfl dfl lint[0x1])
[    0.476453] IOAPIC[0]: apic_id 32, version 33, address 0xfec00000,
GSI 0-23
[    0.483376] IOAPIC[1]: apic_id 33, version 33, address 0xfd000000,
GSI 24-55
[    0.490395] ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 dfl dfl)
[    0.496721] ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 9 low level)
[    0.503224] Using ACPI (MADT) for SMP configuration information
[    0.509114] ACPI: HPET id: 0x43538301 base: 0xfed00000
[    0.514232] smpboot: Allowing 16 CPUs, 0 hotplug CPUs
[    0.519255] NODE_DATA(0) allocated [mem 0x00093000-0x00097fff]
[    0.525066] Initmem setup node 0 [mem
0x0000000000001000-0x000000000009efff]
[    0.532110] PM: Registered nosave memory: [mem 0x00000000-0x00000fff]
[    0.538492] PM: Registered nosave memory: [mem 0x0009f000-0x0009ffff]
[    0.544905] PM: Registered nosave memory: [mem 0x000a0000-0x000effff]
[    0.551318] PM: Registered nosave memory: [mem 0x000f0000-0x000fffff]
[    0.557733] PM: Registered nosave memory: [mem 0xb7d98000-0xb7ffffff]
[    0.564146] PM: Registered nosave memory: [mem 0xbfffb000-0xcfffffff]
[    0.570558] PM: Registered nosave memory: [mem 0xd0000000-0xfcefffff]
[    0.576971] PM: Registered nosave memory: [mem 0xfcf00000-0xfcf03fff]
[    0.583384] PM: Registered nosave memory: [mem 0xfcf04000-0xfeafffff]
[    0.589797] PM: Registered nosave memory: [mem 0xfeb00000-0xfeb00fff]
[    0.596209] PM: Registered nosave memory: [mem 0xfeb01000-0xfebfffff]
[    0.602622] PM: Registered nosave memory: [mem 0xfec00000-0xfec00fff]
[    0.609035] PM: Registered nosave memory: [mem 0xfec01000-0xfecfffff]
[    0.615449] PM: Registered nosave memory: [mem 0xfed00000-0xfed00fff]
[    0.621862] PM: Registered nosave memory: [mem 0xfed01000-0xfed3ffff]
[    0.628275] PM: Registered nosave memory: [mem 0xfed40000-0xfed44fff]
[    0.634688] PM: Registered nosave memory: [mem 0xfed45000-0xffffffff]
[    0.641103] [mem 0xd0000000-0xfcefffff] available for PCI devices
[    0.647167] Booting paravirtualized kernel on bare hardware
[    0.652717] clocksource: refined-jiffies: mask: 0xffffffff
max_cycles: 0xfffs
[    0.795517] setup_percpu: NR_CPUS:512 nr_cpumask_bits:512
nr_cpu_ids:16 nr_n2
[    0.804648] percpu: Embedded 44 pages/cpu s143256 r8192 d28776 u262144
[    0.811044] Built 2 zonelists, mobility grouping on.  Total pages:
4095761
[    0.817843] Policy zone: Normal
[    0.820965] Kernel command line:
BOOT_IMAGE=/boot/vmlinuz-4.19.0-5-amd64 roo0
[    0.854959] AGP: Checking aperture...
[    0.866845] AGP: No AGP bridge found
[    0.870252] AGP: Node 0: aperture [bus addr 0xf8000000-0xfbffffff] (64MB)
[    0.877011] AGP: Node 1: aperture [bus addr 0xf8000000-0xfbffffff] (64MB)
[    0.975916] Memory: 16260640K/16643268K available (10252K kernel
code, 1242K)
[    0.989765] BUG: unable to handle kernel NULL pointer dereference at
00000000
[    0.997523] PGD 0 P4D 0
[    1.000037] Oops: 0002 [#1] SMP NOPTI
[    1.003678] CPU: 0 PID: 0 Comm: swapper Not tainted 4.19.0-5-amd64 #1
Debian3
[    1.011562] Hardware name: ASUS KGPE-D16/KGPE-D16, BIOS
4.9-1859-gf3510cbe369
[    1.019541] RIP: 0010:_raw_spin_lock+0xc/0x20
[    1.023869] Code: 01 00 00 75 05 48 89 d8 5b c3 e8 1f f7 9a ff eb f4
66 66 2b
[    1.042588] RSP: 0000:ffffffff91803e08 EFLAGS: 00010046
[    1.047788] RAX: 0000000000000000 RBX: ffff996c80083d80 RCX:
4132840b73c45ab1
[    1.054893] RDX: 0000000000000001 RSI: 0000000000000000 RDI:
0000000000000000
[    1.061999] RBP: ffffffff91803ec8 R08: ffff996c80080000 R09:
ffff996c80083d80
[    1.069107] R10: 0000000000000000 R11: ffff996c80080000 R12:
ffffed5880002000
[    1.076213] R13: 00000000802a0002 R14: 0000000000000001 R15:
ffff996c80083c00
[    1.083320] FS:  0000000000000000(0000) GS:ffff9970a7c00000(0000)
knlGS:00000
[    1.091380] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    1.097098] CR2: 0000000000000000 CR3: 0000000023c0a000 CR4:
00000000000406b0
[    1.104206] Call Trace:
[    1.106638]  deactivate_slab.isra.71+0x2d2/0x630
[    1.111232]  ? bootstrap+0x1f/0x12c
[    1.114693]  ? kmem_cache_alloc+0x1b8/0x1c0
[    1.118854]  bootstrap+0x66/0x12c
[    1.122148]  kmem_cache_init+0x8c/0x145
[    1.125962]  start_kernel+0x2a6/0x52c
[    1.129602]  secondary_startup_64+0xa4/0xb0
[    1.133759] Modules linked in:
[    1.136791] CR2: 0000000000000000
[    1.140094] ---[ end trace a6e00f6a2ecfb185 ]---
[    1.144679] RIP: 0010:_raw_spin_lock+0xc/0x20
[    1.149011] Code: 01 00 00 75 05 48 89 d8 5b c3 e8 1f f7 9a ff eb f4
66 66 2b
[    1.167730] RSP: 0000:ffffffff91803e08 EFLAGS: 00010046
[    1.172929] RAX: 0000000000000000 RBX: ffff996c80083d80 RCX:
4132840b73c45ab1
[    1.180036] RDX: 0000000000000001 RSI: 0000000000000000 RDI:
0000000000000000
[    1.187143] RBP: ffffffff91803ec8 R08: ffff996c80080000 R09:
ffff996c80083d80
[    1.194248] R10: 0000000000000000 R11: ffff996c80080000 R12:
ffffed5880002000
[    1.201355] R13: 00000000802a0002 R14: 0000000000000001 R15:
ffff996c80083c00
[    1.208462] FS:  0000000000000000(0000) GS:ffff9970a7c00000(0000)
knlGS:00000
[    1.216520] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[    1.222241] CR2: 0000000000000000 CR3: 0000000023c0a000 CR4:
00000000000406b0
[    1.229350] Kernel panic - not syncing: Attempted to kill the idle task!
[    1.236028] ---[ end Kernel panic - not syncing: Attempted to kill
the idle -
[    1.243910] PANIC: double fault, error_code: 0xffffffff9087cc95
[    1.249801] CPU: 0 PID: 0 Comm: swapper Tainted: G      D          
4.19.0-53
[    1.259073] Hardware name: ASUS KGPE-D16/KGPE-D16, BIOS
4.9-1859-gf3510cbe369
[    1.267048] RIP: 0246:0x10
[    1.269732] Code: Bad RIP value.
[    1.272939] RSP: 0000:0000000000000000 EFLAGS: ffffffff91803e58
[    1.278832] RAX: 000000000000004a RBX: ffffffff91813700 RCX:
ffffffff9184dda8
[    1.285938] RDX: 0000000000000000 RSI: 0000000000000096 RDI:
0000000000000046
[    1.293045] RBP: ffffffff91803ec8 R08: 00000000000000d2 R09:
2d2d2d5d20216b73
[    1.300151] R10: 74206c6c696b206f R11: 20656c6469206568 R12:
0000000000000000
[    1.307257] R13: 0000000000000000 R14: 0000000000000009 R15:
0000000000000001
[    1.314364] FS:  0000000000000000 GS:  0000000000000000
[    1.319564] Kernel panic - not syncing: Machine halted.
[    1.324770] ---[ end Kernel panic - not syncing: Machine halted. ]---
Comment 2 Leon Gehling 2019-06-10 21:03:04 UTC
Created attachment 283181 [details]
Early Kernel Output with Panic
Comment 3 Ian Kelling 2019-07-04 06:59:03 UTC
Created attachment 283535 [details]
dmesg with no panic on similar system

No kernel panic for me. I'm using an earlier coreboot version, and maybe different memory (I included dmidecode info). What microcode version are you using? I also have a 62xx cpu system I'll be testing on in the next few weeks.
Comment 4 Leon Gehling 2019-07-04 08:38:18 UTC
(In reply to Ian Kelling from comment #3)
> Created attachment 283535 [details]
> dmesg with no panic on similar system
> 
> No kernel panic for me. I'm using an earlier coreboot version, and maybe
> different memory (I included dmidecode info). What microcode version are you
> using? I also have a 62xx cpu system I'll be testing on in the next few
> weeks.

 I am using the latest microcode available for AMD Fam 15h:
microcode: CPU0: patch_level=0x06000852

dmidecode RAM:
 dmidecode 3.2
Getting SMBIOS data from sysfs.
SMBIOS 2.8 present.

Handle 0x0007, DMI type 17, 40 bytes
Memory Device
	Array Handle: 0x0006
	Error Information Handle: Not Provided
	Total Width: 72 bits
	Data Width: 64 bits
	Size: 8192 MB
	Form Factor: DIMM
	Set: None
	Locator: NODE 0 DIMM_C2
	Bank Locator: Not Specified
	Type: DDR3
	Type Detail: Synchronous Registered (Buffered)
	Speed: 667 MT/s
	Manufacturer: Samsung
	Serial Number: 7926BB85
	Asset Tag: Not Specified
	Part Number: M393B1K70DH0-CH9  
	Rank: 2
	Configured Memory Speed: 667 MT/s
	Minimum Voltage: 1.5 V
	Maximum Voltage: 1.5 V
	Configured Voltage: 1.5 V

Handle 0x0008, DMI type 17, 40 bytes
Memory Device
	Array Handle: 0x0006
	Error Information Handle: Not Provided
	Total Width: 72 bits
	Data Width: 64 bits
	Size: 8192 MB
	Form Factor: DIMM
	Set: None
	Locator: NODE 0 DIMM_D2
	Bank Locator: Not Specified
	Type: DDR3
	Type Detail: Synchronous Registered (Buffered)
	Speed: 667 MT/s
	Manufacturer: Hynix/Hyundai
	Serial Number: 393C405F
	Asset Tag: Not Specified
	Part Number: HMT31GR7BFR4C-H9  
	Rank: 2
	Configured Memory Speed: 667 MT/s
	Minimum Voltage: 1.5 V
	Maximum Voltage: 1.5 V
	Configured Voltage: 1.5 V