Most recent kernel where this bug did not occur: 2.6.16.20 Distribution: Custom distribution based on RedHat 6.2 (glibc2.3 though) Hardware Environment: SuperMicro SuperServer 6013L-8 Intel dual Xeon with HyperThreading Problem Description: Since 2.6.17(.-1 t/m .6) booting fails on a SuperMicro SuperServer 6013L-8, with dual Intel Xeon with HyperThreading. The machine fails during booting right after the message 'PCI: Probing PCI hardware' This did NOT happen with 2.6.16.20 and lower. It also happens with 2.6.18, 2.6.18-rc2, 2.6.18-rc1-mm2 It does NOT happen with 2.6.18-rc1-mm2 when SMP is NOT compiled in the kernel. Booting with nosmp still triggers the problem though. I've managed to track down where the problem occurs, and it's on drivers/pci/probe.c -> function pci_scan_bus_parented() And then specifically, this line : b->subordinate = pci_scan_child_bus(b); When I modify this in 'b->subordinate = 2;' ... the Oops does not occur. But then it will crash due to not found harddisk devices (SCSI @ Adaptec dpt_i2o), which probably is the result of the static '2'. I have various boot messages with Oopses available, I'll attach a few of them. Steps to reproduce: Boot a system like this with a 2.6.17/2.6.18 kernel, and it will crash when pci_scan_child_bus() is called within function pci_scan_bus_parented()
Linux version 2.6.18-rc1-mm2 (root@xxx) (gcc version 3.3.2 20031022 (Red Hat Linux 3.3.2-1)) #2 SMP Thu Jul 20 10:54:16 MEST 2006 BIOS-provided physical RAM map: sanitize start sanitize end copy_e820_map() start: 0000000000000000 size: 000000000009f400 end: 000000000009 f400 type: 1 copy_e820_map() type is E820_RAM add_memory_region(0000000000000000, 000000000009f400, 1) copy_e820_map() start: 000000000009f400 size: 0000000000000400 end: 000000000009 f800 type: 2 add_memory_region(000000000009f400, 0000000000000400, 2) copy_e820_map() start: 00000000000e0000 size: 0000000000020000 end: 000000000010 0000 type: 2 add_memory_region(00000000000e0000, 0000000000020000, 2) copy_e820_map() start: 0000000000100000 size: 00000000e0000000 end: 00000000e010 0000 type: 1 copy_e820_map() type is E820_RAM add_memory_region(0000000000100000, 00000000e0000000, 1) copy_e820_map() start: 00000000fec00000 size: 0000000000004000 end: 00000000fec0 4000 type: 2 add_memory_region(00000000fec00000, 0000000000004000, 2) copy_e820_map() start: 00000000fee00000 size: 0000000000001000 end: 00000000fee0 1000 type: 2 add_memory_region(00000000fee00000, 0000000000001000, 2) copy_e820_map() start: 00000000fff80000 size: 0000000000080000 end: 000000010000 0000 type: 2 add_memory_region(00000000fff80000, 0000000000080000, 2) BIOS-e820: 0000000000000000 - 000000000009f400 (usable) BIOS-e820: 000000000009f400 - 000000000009f800 (reserved) BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved) BIOS-e820: 0000000000100000 - 00000000e0100000 (usable) BIOS-e820: 00000000fec00000 - 00000000fec04000 (reserved) BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved) BIOS-e820: 00000000fff80000 - 0000000100000000 (reserved) 2689MB HIGHMEM available. 896MB LOWMEM available. found SMP MP-table at 000ff780 DMI 2.3 present. ACPI: Unable to locate RSDP Intel MultiProcessor Specification v1.4 Virtual Wire compatibility mode. OEM ID: AMI Product ID: GCHE APIC at: 0xFEE00000 Processor #0 15:2 APIC version 20 Processor #6 15:2 APIC version 20 I/O APIC #8 Version 17 at 0xFEC00000. I/O APIC #9 Version 17 at 0xFEC01000. I/O APIC #10 Version 17 at 0xFEC02000. I/O APIC #11 Version 17 at 0xFEC03000. Enabling APIC mode: Flat. Using 4 I/O APICs Processors: 2 Allocating PCI resources starting at e2000000 (gap: e0100000:1eb00000) Detected 3056.704 MHz processor. Built 1 zonelists. Total pages: 917760 Kernel command line: BOOT_IMAGE=linux-new ro root=801 console=tty0 console=ttyS0 ,9600n8 nosmp Enabling fast FPU save and restore... done. Enabling unmasked SIMD FPU exception support... done. Initializing CPU#0 CPU 0 irqstacks, hard=c0674000 soft=c066c000 PID hash table entries: 4096 (order: 12, 16384 bytes) Console: colour VGA+ 80x25 Dentry cache hash table entries: 131072 (order: 7, 524288 bytes) Inode-cache hash table entries: 65536 (order: 6, 262144 bytes) Memory: 3634132k/3671040k available (3717k kernel code, 35688k reserved, 1495k d ata, 292k init, 2753536k highmem) Checking if this processor honours the WP bit even in supervisor mode... Ok. Calibrating delay using timer specific routine.. 6117.41 BogoMIPS (lpj=12234821) Security Framework v1.0.0 initialized Capability LSM initialized Mount-cache hash table entries: 512 CPU: Trace cache: 12K uops, L1 D cache: 8K CPU: L2 cache: 512K CPU: Hyper-Threading is disabled Compat vDSO mapped to ffffe000. Checking 'hlt' instruction... OK. Freeing SMP alternatives: 20k freed CPU0: Intel(R) Xeon(TM) CPU 3.06GHz stepping 09 SMP mode deactivated, forcing use of dummy APIC emulation. Brought up 1 CPUs migration_cost=0 NET: Registered protocol family 16 PCI: PCI BIOS revision 2.10 entry at 0xfdb51, last bus=2 Setting up standard PCI resources ACPI: Interpreter disabled. SCSI subsystem initialized PCI: Probing PCI hardware BUG: unable to handle kernel paging request at virtual address 2b20c012 printing eip: 2b20c012 *pde = 00000000 Oops: 0000 [#1] 4K_STACKS SMP last sysfs file: CPU: 0 EIP: 0060:[<2b20c012>] Not tainted VLI EFLAGS: 00010047 (2.6.18-rc1-mm2 #2) EIP is at 0x2b20c012 eax: 80008004 ebx: 00000080 ecx: 00000000 edx: 00000cfe esi: c05bff01 edi: 00000006 ebp: c2d30ea0 esp: c2d30e3c ds: 007b es: 007b ss: 0068 Process swapper (pid: 1, ti=c2d30000 task=c2d2fab0 task.ti=c2d30000) Stack: 00930002 0000b109 c00fdc8a dba00dac 0046c00f c03fc340 00000060 00000000 00000006 00000246 c2d87a00 000000d0 c03fded6 00000000 00000000 00000080 00000006 00000002 c2d30ea0 c01ff5f7 c2d87a00 00000080 00000006 00000002 Call Trace: [<c03fc340>] pci_bios_read+0x8a/0xbe [<c03fded6>] pci_read+0x22/0x26 [<c01ff5f7>] pci_bus_read_config_word+0x39/0x5c [<c020137d>] __pci_bus_find_cap+0x1f/0x5b [<c02013d1>] pci_find_capability+0x18/0x1c [<c0200b14>] pci_cfg_space_size+0xe/0x6f [<c0200ca7>] pci_scan_device+0x12d/0x18d [<c0200d95>] pci_scan_single_device+0x10/0x28 [<c0200dc4>] pci_scan_slot+0x17/0x4b [<c0200e10>] pci_scan_child_bus+0x18/0x79 [<c020108d>] pci_scan_bus_parented+0x25/0x30 [<c03fe0af>] pcibios_scan_root+0x46/0x50 [<c063f534>] pci_legacy_init+0x35/0x4f [<c061e831>] do_initcalls+0x55/0xe8 [<c0185071>] proc_mkdir+0x12/0x16 [<c013656f>] init_irq_proc+0x21/0x2e [<c01003cd>] init+0x0/0xe3 [<c010041a>] init+0x4d/0xe3 [<c0100ae9>] kernel_thread_helper+0x5/0xb Code: Bad EIP value. EIP: [<2b20c012>] 0x2b20c012 SS:ESP 0068:c2d30e3c <0>Kernel panic - not syncing: Attempted to kill init!
Linux version 2.6.17.6 (root@xxx) (gcc version 3.3.2 20031022 (Red Hat Linux 3 .3.2-1)) #4 SMP Wed Jul 19 16:26:54 MEST 2006 BIOS-provided physical RAM map: BIOS-e820: 0000000000000000 - 000000000009f400 (usable) BIOS-e820: 000000000009f400 - 000000000009f800 (reserved) BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved) BIOS-e820: 0000000000100000 - 00000000e0100000 (usable) BIOS-e820: 00000000fec00000 - 00000000fec04000 (reserved) . BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved) BIOS-e820: 00000000fff80000 - 0000000100000000 (reserved) 2689MB HIGHMEM available. 896MB LOWMEM available. found SMP MP-table at 000ff780 DMI 2.3 present. ACPI: Unable to locate RSDP Intel MultiProcessor Specification v1.4 Virtual Wire compatibility mode. OEM ID: AMI Product ID: GCHE APIC at: 0xFEE00000 Processor #0 15:2 APIC version 20 Processor #1 15:2 APIC version 20 Processor #6 15:2 APIC version 20 Processor #7 15:2 APIC version 20 I/O APIC #8 Version 17 at 0xFEC00000. I/O APIC #9 Version 17 at 0xFEC01000. I/O APIC #10 Version 17 at 0xFEC02000. I/O APIC #11 Version 17 at 0xFEC03000. Enabling APIC mode: Flat. Using 4 I/O APICs Processors: 4 Allocating PCI resources starting at e2000000 (gap: e0100000:1eb00000) Built 1 zonelists Kernel command line: auto BOOT_IMAGE=linux-new ro root=801 console=tty0 console= ttyS0,9600n8 Enabling fast FPU save and restore... done. Enabling unmasked SIMD FPU exception support... done. Initializing CPU#0 PID hash table entries: 4096 (order: 12, 16384 bytes) Detected 3057.318 MHz processor. Using tsc for high-res timesource Console: colour VGA+ 80x25 Dentry cache hash table entries: 131072 (order: 7, 524288 bytes) Inode-cache hash table entries: 65536 (order: 6, 262144 bytes) Memory: 3634144k/3671040k available (3684k kernel code, 35712k reserved, 1574k d ata, 312k init, 2753536k highmem) Checking if this processor honours the WP bit even in supervisor mode... Ok. Calibrating delay using timer specific routine.. 6124.33 BogoMIPS (lpj=12248663) Security Framework v1.0.0 initialized Capability LSM initialized Mount-cache hash table entries: 512 CPU: Trace cache: 12K uops, L1 D cache: 8K CPU: L2 cache: 512K CPU: Physical Processor ID: 0 Checking 'hlt' instruction... OK. Freeing SMP alternatives: 24k freed CPU0: Intel(R) Xeon(TM) CPU 3.06GHz stepping 09 Booting processor 1/1 eip 2000 Initializing CPU#1 Calibrating delay using timer specific routine.. 6113.43 BogoMIPS (lpj=12226870) CPU: Trace cache: 12K uops, L1 D cache: 8K CPU: L2 cache: 512K CPU: Physical Processor ID: 0 CPU1: Intel(R) Xeon(TM) CPU 3.06GHz stepping 09 Booting processor 2/6 eip 2000 Initializing CPU#2 Calibrating delay using timer specific routine.. 6113.40 BogoMIPS (lpj=12226815) CPU: Trace cache: 12K uops, L1 D cache: 8K CPU: L2 cache: 512K CPU: Physical Processor ID: 3 CPU2: Intel(R) Xeon(TM) CPU 3.06GHz stepping 09 Booting processor 3/7 eip 2000 Initializing CPU#3 Calibrating delay using timer specific routine.. 6113.59 BogoMIPS (lpj=12227196) CPU: Trace cache: 12K uops, L1 D cache: 8K CPU: L2 cache: 512K CPU: Physical Processor ID: 3 CPU3: Intel(R) Xeon(TM) CPU 3.06GHz stepping 09 Total of 4 processors activated (24464.77 BogoMIPS). ExtINT not setup in hardware but reported by MP table ENABLING IO-APIC IRQs ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=0 pin2=0 ..MP-BIOS bug: 8254 timer not connected to IO-APIC ...trying to set up timer (IRQ0) through the 8259A ... ..... (found pin 0) ...works. checking TSC synchronization across 4 CPUs: passed. Brought up 4 CPUs migration_cost=84,288 NET: Registered protocol family 16 PCI: PCI BIOS revision 2.10 entry at 0xfdb51, last bus=2 Setting up standard PCI resources ACPI: Subsystem revision 20060127 ACPI: Interpreter disabled. SCSI subsystem initialized PCI: Probing PCI hardware PCI: Ignoring BAR0-3 of IDE controller 0000:00:0f.1 BUG: unable to handle kernel paging request at virtual address 00030000 printing eip: 00030000 *pde = 0067b001 Oops: 0000 [#1] SMP CPU: 1 EIP: 0060:[<00030000>] Not tainted VLI EFLAGS: 00010047 (2.6.17.6 #4) EIP is at 0x30000 eax: 80008004 ebx: 00000080 ecx: 00000000 edx: 00000cfe esi: c05bff01 edi: 00000006 ebp: c2d41e98 esp: c2d41e34 ds: 007b es: 007b ss: 0068 Process swapper (pid: 1, threadinfo=c2d40000 task=c2d39a70) Stack: 00930002 0000b109 c00fdc8a dba0f06c 0046c00f c04010d3 00000060 00000000 00000006 00000246 c2d56e00 000000d0 c0402c9e 00000000 00000000 00000080 00000006 00000002 c2d41e98 c01f6a47 c2d56e00 00000080 00000006 00000002 Call Trace: <c04010d3> pci_bios_read+0x8a/0xbe <c0402c9e> pci_read+0x22/0x26 <c01f6a47> pci_bus_read_config_word+0x39/0x5c <c01f85c9> __pci_bus_find_cap+0x 1f/0x5b <c01f861d> pci_find_capability+0x18/0x1c <c01f7de8> pci_cfg_space_size+0xe/0x6 f <c01f7f7b> pci_scan_device+0x12d/0x186 <c01f805a> pci_scan_single_device+0x10/ 0x28 <c01f8089> pci_scan_slot+0x17/0x4b <c01f80d5> pci_scan_child_bus+0x18/0x79 <c01f832e> pci_scan_bus_parented+0x25/0x2f <c0402e3d> pcibios_scan_root+0x46/0 x50 <c0650f04> pci_legacy_init+0x35/0x4f <c062c801> do_initcalls+0x55/0xe8 <c0180000> proc_task_lookup+0xcc/0xe2 <c0134017> init_irq_proc+0x21/0x2e <c0100348> init+0x4b/0x13f <c01002fd> init+0x0/0x13f <c0100e41> kernel_thread_helper+0x5/0xb Code: Bad EIP value. EIP: [<00030000>] 0x30000 SS:ESP 0068:c2d41e34 <0>Kernel panic - not syncing: Attempted to kill init! BUG: warning at arch/i386/kernel/smp.c:537/smp_call_function() <c010c3c6> smp_call_function+0x54/0x117 <c011c32f> printk+0xe/0x11 <c0111149> bust_spinlocks+0x3d/0x46 <c011bb39> panic+0x36/0xd8 <c010c4c6> smp_send_stop+0x10/0x1b <c010c489> stop_this_cpu+0x0/0x2d <c011bb41> panic+0x3e/0xd8 <c011dfb5> do_exit+0x6c/0x2ef <c01038c8> die+0x1d3/0x1db <c01118c0> do_page_fault+0x535/0x62b <c0129fb3> __rcu_process_callbacks+0x91/0xa7 <c013852d> buffered_rmqueue+0x12d /0x148 <c011138b> do_page_fault+0x0/0x62b <c01031ab> error_code+0x4f/0x54 <c04010d3> pci_bios_read+0x8a/0xbe <c0402c9e> pci_read+0x22/0x26 <c01f6a47> pci_bus_read_config_word+0x39/0x5c <c01f85c9> __pci_bus_find_cap+0x 1f/0x5b <c01f861d> pci_find_capability+0x18/0x1c <c01f7de8> pci_cfg_space_size+0xe/0x6 f <c01f7f7b> pci_scan_device+0x12d/0x186 <c01f805a> pci_scan_single_device+0x10/ 0x28 <c01f8089> pci_scan_slot+0x17/0x4b <c01f80d5> pci_scan_child_bus+0x18/0x79 <c01f832e> pci_scan_bus_parented+0x25/0x2f <c0402e3d> pcibios_scan_root+0x46/0 x50 <c0650f04> pci_legacy_init+0x35/0x4f <c062c801> do_initcalls+0x55/0xe8 <c0180000> proc_task_lookup+0xcc/0xe2 <c0134017> init_irq_proc+0x21/0x2e <c0100348> init+0x4b/0x13f <c01002fd> init+0x0/0x13f <c0100e41> kernel_thread_helper+0x5/0xb
Here an output of 2.6.18-rc1-mm2 with SMP disabled in the kernel. This boots correctly : ----------------------------------------------------------------- Linux version 2.6.18-rc1-mm2-WideXS-0019 (root@install01.dmz.widexs.nl) (gcc ver sion 3.3.2 20031022 (Red Hat Linux 3.3.2-1)) #1 Thu Jul 20 10:31:05 MEST 2006 BIOS-provided physical RAM map: sanitize start sanitize end copy_e820_map() start: 0000000000000000 size: 000000000009f400 end: 000000000009 f400 type: 1 copy_e820_map() type is E820_RAM add_memory_region(0000000000000000, 000000000009f400, 1) copy_e820_map() start: 000000000009f400 size: 0000000000000400 end: 000000000009 f800 type: 2 add_memory_region(000000000009f400, 0000000000000400, 2) copy_e820_map() start: 00000000000e0000 size: 0000000000020000 end: 000000000010 0000 type: 2 add_memory_region(00000000000e0000, 0000000000020000, 2) copy_e820_map() start: 0000000000100000 size: 00000000e0000000 end: 00000000e010 0000 type: 1 copy_e820_map() type is E820_RAM add_memory_region(0000000000100000, 00000000e0000000, 1) copy_e820_map() start: 00000000fec00000 size: 0000000000004000 end: 00000000fec0 4000 type: 2 add_memory_region(00000000fec00000, 0000000000004000, 2) copy_e820_map() start: 00000000fee00000 size: 0000000000001000 end: 00000000fee0 1000 type: 2 add_memory_region(00000000fee00000, 0000000000001000, 2) copy_e820_map() start: 00000000fff80000 size: 0000000000080000 end: 000000010000 0000 type: 2 add_memory_region(00000000fff80000, 0000000000080000, 2) BIOS-e820: 0000000000000000 - 000000000009f400 (usable) BIOS-e820: 000000000009f400 - 000000000009f800 (reserved) BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved) BIOS-e820: 0000000000100000 - 00000000e0100000 (usable) BIOS-e820: 00000000fec00000 - 00000000fec04000 (reserved) BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved) BIOS-e820: 00000000fff80000 - 0000000100000000 (reserved) 2689MB HIGHMEM available. 896MB LOWMEM available. DMI 2.3 present. ACPI: Unable to locate RSDP Allocating PCI resources starting at e2000000 (gap: e0100000:1eb00000) Detected 3056.744 MHz processor. Built 1 zonelists. Total pages: 917760 Kernel command line: auto BOOT_IMAGE=linux-current ro root=801 console=tty0 cons ole=ttyS0,9600n8 Enabling fast FPU save and restore... done. Enabling unmasked SIMD FPU exception support... done. Initializing CPU#0 CPU 0 irqstacks, hard=c0630000 soft=c062f000 PID hash table entries: 4096 (order: 12, 16384 bytes) Console: colour VGA+ 80x25 Dentry cache hash table entries: 131072 (order: 7, 524288 bytes) Inode-cache hash table entries: 65536 (order: 6, 262144 bytes) Memory: 3634624k/3671040k available (3604k kernel code, 35268k reserved, 1434k d ata, 244k init, 2753536k highmem) Checking if this processor honours the WP bit even in supervisor mode... Ok. Calibrating delay using timer specific routine.. 6117.10 BogoMIPS (lpj=12234202) Security Framework v1.0.0 initialized Capability LSM initialized Mount-cache hash table entries: 512 CPU: Trace cache: 12K uops, L1 D cache: 8K CPU: L2 cache: 512K Compat vDSO mapped to ffffe000. CPU: Intel(R) Xeon(TM) CPU 3.06GHz stepping 09 Checking 'hlt' instruction... OK. NET: Registered protocol family 16 PCI: PCI BIOS revision 2.10 entry at 0xfdb51, last bus=2 Setting up standard PCI resources ACPI: Interpreter disabled. SCSI subsystem initialized PCI: Probing PCI hardware PCI: Discovered peer bus 01 PCI: Discovered peer bus 02 NET: Registered protocol family 2 IP route cache hash table entries: 131072 (order: 7, 524288 bytes) TCP established hash table entries: 524288 (order: 9, 2097152 bytes) TCP bind hash table entries: 65536 (order: 6, 262144 bytes) TCP: Hash tables configured (established 524288 bind 65536) TCP reno registered IA-32 Microcode Update Driver: v1.14a <tigran@veritas.com> highmem bounce pool size: 64 pages VFS: Disk quotas dquot_6.5.1 Dquot-cache hash table entries: 1024 (order 0, 4096 bytes) Installing knfsd (copyright (C) 1996 okir@monad.swb.de). Initializing Cryptographic API io scheduler noop registered io scheduler anticipatory registered (default) io scheduler deadline registered io scheduler cfq registered rivafb_setup START atyfb: using auxiliary register aperture atyfb: 3D RAGE XL (Mach64 GR, PCI-33) [0x4752 rev 0x27] atyfb: Mach64 BIOS is located at c0000, mapped at c00c0000. atyfb: BIOS frequency table: atyfb: PCLK_min_freq 984, PCLK_max_freq 23500, ref_freq 1432, ref_divider 31 atyfb: MCLK_pwd 4200, MCLK_max_freq 8000, XCLK_max_freq 6000, SCLK_freq 5000 atyfb: BIOS contains driver information table. atyfb: 8M SDRAM (2:1) (32-bit), 14.31818 MHz XTAL, 235 MHz PLL, 80 Mhz MCLK, 60 MHz XCLK mtrr: type mismatch for fd000000,1000000 old: write-back new: write-combining Console: switching to colour frame buffer device 80x30 atyfb: fb0: ATY Mach64 frame buffer device on PCI cyblafb: CyblaFB version 0.62 initializing tridentfb: Trident framebuffer 0.7.8-NEWAPI initializing hgafb: HGA card not detected. hgafb: probe of hgafb.0 failed with error -22 vga16fb: mapped to 0xc00a0000 fb1: VGA16 VGA frame buffer device fb2: Virtual frame buffer device, using 1024K of video memory Real Time Clock Driver v1.12ac Linux agpgart interface v0.101 (c) Dave Jones intelfb: Framebuffer driver for Intel(R) 830M/845G/852GM/855GM/865G/915G/915GM/9 45G/945GM chipsets intelfb: Version 0.9.4 Serial: 8250/16550 driver $Revision: 1.90 $ 4 ports, IRQ sharing disabled serial8250: ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A serial8250: ttyS1 at I/O 0x2f8 (irq = 3) is a 16550A Floppy drive(s): fd0 is 1.44M FDC 0 is a National Semiconductor PC87306 RAMDISK driver initialized: 1 RAM disks of 32768K size 1024 blocksize loop: loaded (max 8 devices) Compaq SMART2 Driver (v 2.6.0) HP CISS Driver (v 2.6.10) nbd: registered device at major 43 Intel(R) PRO/1000 Network Driver - version 7.1.9-k2-NAPI Copyright (c) 1999-2006 Intel Corporation. Ethernet Channel Bonding Driver: v3.0.3 (March 23, 2006) bonding: Warning: either miimon or arp_interval and arp_ip_target module paramet ers must be specified, otherwise bonding will not detect link failures! see bond ing.txt for details. e100: Intel(R) PRO/100 Network Driver, 3.5.10-k2-NAPI e100: Copyright(c) 1999-2005 Intel Corporation ns83820.c: National Semiconductor DP83820 10/100/1000 driver. tg3.c:v3.62 (June 30, 2006) eth0: Tigon3 [partno(BCM95704A6) rev 2002 PHY(5704)] (PCIX:133MHz:64-bit) 10/100 /1000BaseT Ethernet 00:30:48:53:8c:2a eth0: RXcsums[1] LinkChgREG[0] MIirq[0] ASF[0] Split[0] WireSpeed[1] TSOcap[1] eth0: dma_rwctrl[769f4000] dma_mask[64-bit] eth1: Tigon3 [partno(BCM95704A6) rev 2002 PHY(5704)] (PCIX:133MHz:64-bit) 10/100 /1000BaseT Ethernet 00:30:48:53:8c:2b eth1: RXcsums[1] LinkChgREG[0] MIirq[0] ASF[0] Split[0] WireSpeed[1] TSOcap[1] eth1: dma_rwctrl[769f4000] dma_mask[64-bit] Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2 ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx Loading Adaptec I2O RAID: Version 2.4 Build 5go Detecting Adaptec I2O RAID controllers... Adaptec I2O RAID controller 0 irq=10 BAR0 f9200000 - size= 100000 BAR1 f9380000 - size= 1000000 dpti: If you have a lot of devices this could take a few minutes. dpti0: Reading the hardware resource table. TID 008 Vendor: ADAPTEC Device: AIC-7902 Rev: 00000001 TID 009 Vendor: ADAPTEC Device: AIC-7902 Rev: 00000001 TID 515 Vendor: SUPER G Device: GEM318 Rev: 0 0 TID 518 Vendor: ADAPTEC R Device: RAID-1 Rev: 3B0AD scsi0 : Vendor: Adaptec Model: 2015S FW:3B0A Vendor: ADAPTEC Model: RAID-1 Rev: 3B0A Type: Direct-Access ANSI SCSI revision: 02 Vendor: SUPER Model: GEM318 Rev: 0 Type: Processor ANSI SCSI revision: 02 Adaptec aacraid driver (1.1-5[2409]-mh2) NCR53c406a: no available ports found sym53c416.c: Version 1.0.0-ac QLogic Fibre Channel HBA Driver megaraid cmm: 2.20.2.6 (Release Date: Mon Mar 7 00:01:03 EST 2005) megaraid: 2.20.4.8 (Release Date: Mon Apr 11 12:27:22 EST 2006) GDT-HA: Storage RAID Controller Driver. Version: 3.05 GDT-HA: Found 0 PCI Storage RAID Controllers SCSI device sda: 143665152 512-byte hdwr sectors (73557 MB) sda: Write Protect is off SCSI device sda: drive cache: write back w/ FUA SCSI device sda: 143665152 512-byte hdwr sectors (73557 MB) sda: Write Protect is off SCSI device sda: drive cache: write back w/ FUA sda: sda1 sda2 sda3 sda4 sd 0:0:0:0: Attached scsi disk sda serio: i8042 AUX port at 0x60,0x64 irq 12 serio: i8042 KBD port at 0x60,0x64 irq 1 mice: PS/2 mouse device common for all mice I2O subsystem v1.325 i2o: max drivers = 8 i2o: Checking for PCI I2O controllers... iop0: controller found (0000:01:03.0) iop0: using 64-bit DMA PCI: Unable to reserve mem region #1:100000@fea00000 for device 0000:01:03.0 iop0: device already claimed iop0: DMA / IO allocation for I2O controller failed I2O Configuration OSM v1.323 I2O Bus Adapter OSM v1.317 I2O Block Device OSM v1.325 I2O SCSI Peripheral OSM v1.316 I2O ProcFS OSM v1.316 md: raid0 personality registered for level 0 md: raid1 personality registered for level 1 md: md driver 0.90.3 MAX_MD_DEVS=256, MD_SB_DISKS=27 md: bitmap version 4.39 device-mapper: ioctl: 4.8.0-ioctl (2006-06-24) initialised: dm-devel@redhat.com Netfilter messages via NETLINK v0.30. IPv4 over IPv4 tunneling driver GRE over IPv4 tunneling driver ip_tables: (C) 2000-2006 Netfilter Core Team input: AT Translated Set 2 keyboard as /class/input/input0 arp_tables: (C) 2002 David S. Miller TCP bic registered NET: Registered protocol family 1 NET: Registered protocol family 10 IPv6 over IPv4 tunneling driver ip6_tables: (C) 2000-2006 Netfilter Core Team NET: Registered protocol family 17 Bridge firewalling registered Ebtables v2.0 registered ebt_ulog: not logging via ulog since somebody else already registered for PF_BRI DGE 802.1Q VLAN Support v1.8 Ben Greear <greearb@candelatech.com> All bugs added by David S. Miller <davem@redhat.com> Using IPI Shortcut mode Freeing unused kernel memory: 244k freed Time: tsc clocksource has been installed. argc == 7 md: Autodetecting RAID arrays. argv[0]: "/inimd: autorun ... t" argv[1]: "md: ... autorun DONE. auto" argv[2]: "BOOT_IMAGE=likjournald starting. Commit interval 5 seconds nux-current" EXT3-fs: mounted filesystem with ordered data mode. argv[3]: "ro" argv[4]: "root=801" argv[5]:input: PS/2 Generic Mouse as /class/input/input1 "console=tty0" argv[6]: "console=ttyS0,9600n8" Running ipconfig kinit: do_mounts kinit: name_to_dev_t(801) = sda1(8,1) kinit: root_dev = sda1(8,1) kinit: /dev/root appears to be a ext3 filesystem kinit: trying to mount /dev/root on /root with type ext3 kinit: Mounted root (ext3 filesystem) readonly. Checking for init: /sbin/init INIT: version 2.78 booting
Here a crash of 2.6.18-rc2 : ---------------------------- Linux version 2.6.18-rc2 (root@xxx) (gcc version 3.3.2 20031022 (Red Hat Linux 3.3.2-1)) #3 SMP Thu Jul 20 15:30:34 MEST 2006 BIOS-provided physical RAM map: BIOS-e820: 0000000000000000 - 000000000009f400 (usable) BIOS-e820: 000000000009f400 - 000000000009f800 (reserved) BIOS-e820: 00000000000e0000 - 0000000000100000 (reserved) BIOS-e820: 0000000000100000 - 00000000e0100000 (usable) BIOS-e820: 00000000fec00000 - 00000000fec04000 (reserved) BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved) BIOS-e820: 00000000fff80000 - 0000000100000000 (reserved) 2689MB HIGHMEM available. 896MB LOWMEM available. found SMP MP-table at 000ff780 DMI 2.3 present. ACPI: Unable to locate RSDP Intel MultiProcessor Specification v1.4 Virtual Wire compatibility mode. OEM ID: AMI Product ID: GCHE APIC at: 0xFEE00000 Processor #0 15:2 APIC version 20 Processor #6 15:2 APIC version 20 I/O APIC #8 Version 17 at 0xFEC00000. I/O APIC #9 Version 17 at 0xFEC01000. I/O APIC #10 Version 17 at 0xFEC02000. I/O APIC #11 Version 17 at 0xFEC03000. Enabling APIC mode: Flat. Using 4 I/O APICs Processors: 2 Allocating PCI resources starting at e2000000 (gap: e0100000:1eb00000) Detected 3056.756 MHz processor. Built 1 zonelists. Total pages: 917760 Kernel command line: auto BOOT_IMAGE=linux-new ro root=801 console=tty0 console= ttyS0,9600n8 Enabling fast FPU save and restore... done. Enabling unmasked SIMD FPU exception support... done. Initializing CPU#0 PID hash table entries: 4096 (order: 12, 16384 bytes) Console: colour VGA+ 80x25 Dentry cache hash table entries: 131072 (order: 7, 524288 bytes) Inode-cache hash table entries: 65536 (order: 6, 262144 bytes) Memory: 3635116k/3671040k available (3174k kernel code, 34704k reserved, 1283k d ata, 272k init, 2753536k highmem) Checking if this processor honours the WP bit even in supervisor mode... Ok. Calibrating delay using timer specific routine.. 6117.38 BogoMIPS (lpj=12234768) Security Framework v1.0.0 initialized Capability LSM initialized Mount-cache hash table entries: 512 CPU: Trace cache: 12K uops, L1 D cache: 8K CPU: L2 cache: 512K CPU: Hyper-Threading is disabled Compat vDSO mapped to ffffe000. Checking 'hlt' instruction... OK. Freeing SMP alternatives: 20k freed CPU0: Intel(R) Xeon(TM) CPU 3.06GHz stepping 09 Booting processor 1/6 eip 2000 Initializing CPU#1 Calibrating delay using timer specific routine.. 6113.50 BogoMIPS (lpj=12227003) CPU: Trace cache: 12K uops, L1 D cache: 8K CPU: L2 cache: 512K CPU: Hyper-Threading is disabled CPU1: Intel(R) Xeon(TM) CPU 3.06GHz stepping 09 Total of 2 processors activated (12230.88 BogoMIPS). ExtINT not setup in hardware but reported by MP table ENABLING IO-APIC IRQs ..TIMER: vector=0x31 apic1=0 pin1=2 apic2=0 pin2=0 ..MP-BIOS bug: 8254 timer not connected to IO-APIC ...trying to set up timer (IRQ0) through the 8259A ... ..... (found pin 0) ...works. checking TSC synchronization across 2 CPUs: passed. Brought up 2 CPUs migration_cost=277 NET: Registered protocol family 16 PCI: PCI BIOS revision 2.10 entry at 0xfdb51, last bus=2 Setting up standard PCI resources ACPI: Interpreter disabled. SCSI subsystem initialized PCI: Probing PCI hardware BUG: unable to handle kernel NULL pointer dereference at virtual address 0000000 0 printing eip: 00000000 *pde = 005a8001 Oops: 0000 [#1] SMP CPU: 0 EIP: 0060:[<00000000>] Not tainted VLI EFLAGS: 00010047 (2.6.18-rc2 #3) EIP is at rest_init+0x3feffd68/0x23 eax: 80008004 ebx: 00000080 ecx: 00000000 edx: 00000cfe esi: c04fff01 edi: 00000006 ebp: c2d21e9c esp: c2d21e38 ds: 007b es: 007b ss: 0068 Process swapper (pid: 1, ti=c2d20000 task=c2d10ab0 task.ti=c2d20000) Stack: 00930002 0000b109 c00fdc8a dba0c50c 0046c00f c0377040 00000060 00000000 00000006 00000246 c2da3a00 000000d0 c0378c22 00000000 00000000 00000080 00000006 00000002 c2d21e9c c01fd2e3 c2da3a00 00000080 00000006 00000002 Call Trace: [<c0377040>] pci_bios_read+0x8a/0xbe [<c0378c22>] pci_read+0x22/0x26 [<c01fd2e3>] pci_bus_read_config_word+0x39/0x5c [<c01ff015>] __pci_bus_find_cap+0x1f/0x5b [<c01ff069>] pci_find_capability+0x18/0x1c [<c01fe80e>] pci_cfg_space_size+0xe/0x6f [<c01fe9a1>] pci_scan_device+0x12d/0x18d [<c01fea8f>] pci_scan_single_device+0x10/0x28 [<c01feabe>] pci_scan_slot+0x17/0x4b [<c01feb0a>] pci_scan_child_bus+0x18/0x79 [<c01fed6f>] pci_scan_bus_parented+0x25/0x2e [<c0378dc1>] pcibios_scan_root+0x46/0x50 [<c05825ae>] pci_legacy_init+0x35/0x90 [<c0563826>] do_initcalls+0x55/0xe8 [<c01842dd>] proc_mkdir+0x12/0x16 [<c0137797>] init_irq_proc+0x21/0x2e [<c01002fd>] init+0x0/0x143 [<c010034e>] init+0x51/0x143 [<c0100e61>] kernel_thread_helper+0x5/0xb Code: Bad EIP value. EIP: [<00000000>] rest_init+0x3feffd68/0x23 SS:ESP 0068:c2d21e38 <0>Kernel panic - not syncing: Attempted to kill init! BUG: warning at arch/i386/kernel/smp.c:547/smp_call_function() [<c010b9a8>] smp_call_function+0x5a/0x11d [<c011c15d>] printk+0xe/0x11 [<c0110b39>] bust_spinlocks+0x3d/0x46 [<c010baae>] smp_send_stop+0x10/0x1b [<c010ba6b>] stop_this_cpu+0x0/0x33 [<c011b935>] panic+0x3e/0xd8 [<c011de99>] do_exit+0x73/0x345 [<c0103989>] die+0x1da/0x1e2 [<c0111217>] do_page_fault+0x47c/0x55a [<c013bd42>] buffered_rmqueue+0x104/0x11e [<c0110d9b>] do_page_fault+0x0/0x55a [<c0103281>] error_code+0x39/0x40 [<c0377040>] pci_bios_read+0x8a/0xbe [<c0378c22>] pci_read+0x22/0x26 [<c01fd2e3>] pci_bus_read_config_word+0x39/0x5c [<c01ff015>] __pci_bus_find_cap+0x1f/0x5b [<c01ff069>] pci_find_capability+0x18/0x1c [<c01fe80e>] pci_cfg_space_size+0xe/0x6f [<c01fe9a1>] pci_scan_device+0x12d/0x18d [<c01fea8f>] pci_scan_single_device+0x10/0x28 [<c01feabe>] pci_scan_slot+0x17/0x4b [<c01feb0a>] pci_scan_child_bus+0x18/0x79 [<c01fed6f>] pci_scan_bus_parented+0x25/0x2e [<c0378dc1>] pcibios_scan_root+0x46/0x50 [<c05825ae>] pci_legacy_init+0x35/0x90 [<c0563826>] do_initcalls+0x55/0xe8 [<c01842dd>] proc_mkdir+0x12/0x16 [<c0137797>] init_irq_proc+0x21/0x2e [<c01002fd>] init+0x0/0x143 [<c010034e>] init+0x51/0x143 [<c0100e61>] kernel_thread_helper+0x5/0xb
Seems fixed in 2.6.18-rc3-mm2, but NOT in 2.6.18-rc4
it seems your BIOS is somewhat broken. 2.6.17 changed the ordering in which PCI access methods are probed. so with kernel <= 2.6.16 it would use direct access but with 2.6.17+ it would use the BIOS routines. 2.6.18-rc3-mm2 has a (broken) patch to address the problem. the correct one will show up in next -mm, but i attach it anyway here. you can try it on top of 2.6.18-rc4. should fix it.
Created attachment 8798 [details] pci init: PCBIOS as last fallback
Perfectly fixed, also checked with 2.6.17.8 :) Many thanks!
*** Bug 6920 has been marked as a duplicate of this bug. ***