Bug 5758
Summary: | PANIC: early exception rip 10 error ffffffff805e1e9d cr2 0 on boot with (multi) dual core Opteron | ||
---|---|---|---|
Product: | Platform Specific/Hardware | Reporter: | Mark Williamson (mjw) |
Component: | x86-64 | Assignee: | Andi Kleen (andi-bz) |
Status: | CLOSED CODE_FIX | ||
Severity: | high | CC: | mark.langsdorf |
Priority: | P2 | ||
Hardware: | i386 | ||
OS: | Linux | ||
Kernel Version: | 2.6.15-rc6 | Subsystem: | |
Regression: | --- | Bisected commit-id: | |
Attachments: |
Config file for the referenced kernel
System map file for the referenced kernel Check for bad SRATs not covering all memory. |
Description
Mark Williamson
2005-12-19 03:02:04 UTC
Created attachment 6856 [details]
Config file for the referenced kernel
Created attachment 6857 [details]
System map file for the referenced kernel
Your SRAT table is broken. It reports all nodes as having zero length. Update the BIOS. But it should work with numa=noacpi, right? I will add a test for that particular breakage. Just tried it with numa=noacpi, and yes you are correct, it does work. Thanks for your guidance on this. For reference, the BIOS version is H8501180 06/28/2005 It is the latest one here (referred to as "Version: V180"): http://www.iwill.net/product_2s.asp?p_id=90&tp=BIOS as of time of bug posting. N.B even though the date on the website says 2005/7/5 for V180, string dumping the rom: http://www.iwill.net/product_imgs/90/H8501V180.zip [root@f02 tmp]# strings H8501180.ROM | grep "/05" 06/28/05 gives the date seen in the bootup screen, hence the date on the website is misleading/incorrect. Created attachment 6864 [details]
Check for bad SRATs not covering all memory.
With this patch it should work without pci=noacpi. It checks if the SRAT
covers all memory and rejects it if not.
Regarding BIOS - yes I will is not very good with them. We actually had
a long conversation with them about this, but it went nowhere in the end.
Mark, FYI. Looks like the official Iwill 8 way BIOS is still broken, but now in a different way than the older ones. Patch applies and works. Thanks again: ...SNIP.... SRAT: PXM 7 -> APIC 31 -> Node 7 SRAT: Node 0 PXM 0 0-0 SRAT: Node 1 PXM 1 0-0 SRAT: Node 2 PXM 2 0-0 SRAT: Node 3 PXM 3 0-0 SRAT: Node 4 PXM 4 0-0 SRAT: Node 5 PXM 5 0-0 SRAT: Node 6 PXM 6 0-0 SRAT: Node 7 PXM 7 0-0 SRAT: Node 0 PXM 0 0-9fc00 SRAT: PXMs only cover 0MB of your 32767MB e820 RAM. Not used. SRAT: SRAT not used. Scanning NUMA topology in Northbridge 24 Number of nodes 8 Node 0 MemBase 0000000000000000 Limit 0000000140000000 Node 1 MemBase 0000000140000000 Limit 0000000240000000 Node 2 MemBase 0000000240000000 Limit 0000000340000000 Node 3 MemBase 0000000340000000 Limit 0000000440000000 Node 4 MemBase 0000000440000000 Limit 0000000540000000 Node 5 MemBase 0000000540000000 Limit 0000000640000000 Node 6 MemBase 0000000640000000 Limit 0000000740000000 Node 7 MemBase 0000000740000000 Limit 0000000840000000 Using 30 for the hash shift. Using node hash shift of 30 Bootmem setup node 0 0000000000000000-0000000140000000 Bootmem setup node 1 0000000140000000-0000000240000000 Bootmem setup node 2 0000000240000000-0000000340000000 Bootmem setup node 3 0000000340000000-0000000440000000 Bootmem setup node 4 0000000440000000-0000000540000000 Bootmem setup node 5 0000000540000000-0000000640000000 Bootmem setup node 6 0000000640000000-0000000740000000 Bootmem setup node 7 0000000740000000-0000000840000000 ...SNIP.... I'll bring it up with the relevant teams. |