Bug 5564 - Kernel Panic on PCI-Probe - 450NX - HP Netserver LH4r
Summary: Kernel Panic on PCI-Probe - 450NX - HP Netserver LH4r
Status: CLOSED PATCH_ALREADY_AVAILABLE
Alias: None
Product: Drivers
Classification: Unclassified
Component: PCI (show other bugs)
Hardware: i386 Linux
: P2 high
Assignee: Bartlomiej Zolnierkiewicz
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2005-11-07 13:05 UTC by Frank Overton
Modified: 2005-12-08 03:09 UTC (History)
4 users (show)

See Also:
Kernel Version: 2.6.9-22
Subsystem:
Regression: ---
Bisected commit-id:


Attachments
debug patch (854 bytes, patch)
2005-11-07 17:25 UTC, Shaohua
Details | Diff
2.6.14.1 debug screenshot (388.66 KB, image/jpeg)
2005-11-14 13:01 UTC, Frank Overton
Details
2.6.9-22 (CentOS 4.2) Max resolution Screenshot (361.24 KB, image/jpeg)
2005-11-14 13:02 UTC, Frank Overton
Details

Description Frank Overton 2005-11-07 13:05:43 UTC
Most recent kernel where this bug did not occur:
Distribution: CentOS 4.1, 4.2
Hardware Environment: HP Netserver LH4r latest BIOS & Firmware available from HP
Software Environment: ?? 
Problem Description: 2.6 series kernels panic on boot at PCI probe with 450nx
(2.4.x kernels boot without hitch)  

Steps to reproduce: Boot the kernel

-- top of screen --
>    c316d000  c3126240  00000000 00000000  c01de368  00000000  c31262b8 
> c3126240
> 
> Call Trace: 
> [<co23e390>] class_device_create_file+0x11/0x16
> [<co1ddcaf>] pci_alloc_child_bus+0x74/0xb8
> [<co1ddda5>] pci_scan_bridge+0x90/0x1cd
> [<co1de368>] pci_scan_child_bus+0x51/0x77
> [<co1de4c2>] pci_scan_bus_parented+0x11d/0x133
> [<co29d3ec>] pci_bios_scan_root+0x3d/0x40
> [<co208e00>] acpi_pci_root_add+0x19a/0x1f0
> [<co20d825>] acpi_bus_driver_init+0x2c/0x8c
> [<co20e019>] acpi_bus_find_driver+0x110/0x21a
> [<co20e561>] acpi_bus_add+0x12c/0x152
> [<co20e68b>] acpi_bus_scan+0x104/0x156
> [<co3b7c7a>] acpi_bus_scan_init+0x48/0x5e
> [<co3a86fa>] do_initcalls+0x49/0x8e
> [<co100454>] init+0x0/0x1d8
> [<co1041d5>] kernel_thread_helper+0x5/0xb

Keywords: kernel, i450nx, PCI probe, boot, panic
Kernel versions: 2.6.9-22, 2.6.9-11 by me and (2.6.6, 2.6.7 by Andrew
Feldhacker)For an additional example see: Andrew Feldhacker's June 2004 report
at http://uwsg.ucs.indiana.edu/hypermail/linux/kernel/0407.0/0915.html

Other notes:
    -Someone reported that Mandrake 10.2 beta did not have this problem.
    -I know the legacy megaraid now requires a custom built kernel. I just can't
get past the PCI-Probe to install it.

Environment
Processor information (from /proc/cpuinfo):
processor       : 0
vendor_id       : GenuineIntel
cpu family      : 6
model           : 7
model name      : Pentium III (Katmai)
stepping        : 3
cpu MHz         : 550.045
cache size      : 512 KB
physical id     : 0
siblings        : 1
runqueue        : 0
fdiv_bug        : no
hlt_bug         : no
f00f_bug        : no
coma_bug        : no
fpu             : yes
fpu_exception   : yes
cpuid level     : 2
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
pat pse36 mmx fxsr sse
bogomips        : 1097.72

processor       : 1
vendor_id       : GenuineIntel
cpu family      : 6
model           : 7
model name      : Pentium III (Katmai)
stepping        : 3
cpu MHz         : 550.045
cache size      : 512 KB
physical id     : 0
siblings        : 1
runqueue        : 1
fdiv_bug        : no
hlt_bug         : no
f00f_bug        : no
coma_bug        : no
fpu             : yes
fpu_exception   : yes
cpuid level     : 2
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
pat pse36 mmx fxsr sse
bogomips        : 1097.72

processor       : 2
vendor_id       : GenuineIntel
cpu family      : 6
model           : 7
model name      : Pentium III (Katmai)
stepping        : 3
cpu MHz         : 550.045
cache size      : 512 KB
physical id     : 0
siblings        : 1
runqueue        : 2
fdiv_bug        : no
hlt_bug         : no
f00f_bug        : no
coma_bug        : no
fpu             : yes
fpu_exception   : yes
cpuid level     : 2
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
pat pse36 mmx fxsr sse
bogomips        : 1097.72

processor       : 3
vendor_id       : GenuineIntel
cpu family      : 6
model           : 7
model name      : Pentium III (Katmai)
stepping        : 3
cpu MHz         : 550.045
cache size      : 512 KB
physical id     : 0
siblings        : 1
runqueue        : 3
fdiv_bug        : no
hlt_bug         : no
f00f_bug        : no
coma_bug        : no
fpu             : yes
fpu_exception   : yes
cpuid level     : 2
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov
pat pse36 mmx fxsr sse
bogomips        : 1097.72

[7.3.] Module information (from /proc/modules):
ide-cd                 34016   0 (autoclean)
cdrom                  32896   0 (autoclean) [ide-cd]
qla2100               287804   5
iptable_filter          2412   0 (autoclean) (unused)
ip_tables              16544   1 [iptable_filter]
softdog                 3004   1
usbserial              23964   0 (autoclean) (unused)
parport_pc             18884   1 (autoclean)
lp                      9156   0 (autoclean)
parport                38848   1 (autoclean) [parport_pc lp]
autofs4                16984   0 (autoclean) (unused)
lock_gulm              65712   1
crc32                   3764   0 [lock_gulm]
gfs                   291308   1 (autoclean)
lock_harness            3576   0 (autoclean) [lock_gulm gfs]
pool                   84928   4
natsemi                20000   1
e100                   60048   1
floppy                 57552   0 (autoclean)
sg                     37388   0 (autoclean)
microcode               6912   0 (autoclean)
keybdev                 2976   0 (unused)
mousedev                5688   1
hid                    22308   0 (unused)
input                   6176   0 [keybdev mousedev hid]
usbcore                81152   1 [usbserial hid]
ext3                   89992   2
jbd                    55092   2 [ext3]
megaraid               30636   3
sd_mod                 13936  16
scsi_mod              115240   3 [qla2100 sg megaraid sd_mod]

[7.4.] Loaded driver and hardware information (/proc/ioports, /proc/iomem)
ioports
0000-001f : dma1
0020-003f : pic1
0040-005f : timer
0060-006f : keyboard
0070-007f : rtc
0080-008f : dma page reg
00a0-00bf : pic2
00c0-00df : dma2
00f0-00ff : fpu
01f0-01f7 : ide0
02f8-02ff : serial(auto)
0378-037a : parport0
03c0-03df : vga+
03f6-03f6 : ide0
03f8-03ff : serial(auto)
0cf8-0cff : PCI conf1
1040-105f : Intel Corp. 82371AB/EB/MB PIIX4 ACPI
1400-14ff : LSI Logic / Symbios Logic 53c895
8000-803f : Intel Corp. 82371AB/EB/MB PIIX4 ACPI
9000-901f : Intel Corp. 82371AB/EB/MB PIIX4 USB
9020-902f : Intel Corp. 82371AB/EB/MB PIIX4 IDE
 9020-9027 : ide0
a000-afff : PCI Bus #01
 a000-a0ff : LSI Logic / Symbios Logic 53c895 (#2)
b000-bfff : PCI Bus #02
 b000-b0ff : National Semiconductor Corporation DP83815 (MacPhyter) Ethernet
Controller
   b000-b0ff : eth1
 b400-b4ff : QLogic Corp. QLA2100 64-bit Fibre Channel Adapter
   b400-b4ff : qla2100
 b800-b81f : Intel Corp. 82557/8/9 [Ethernet Pro 100]
   b800-b81f : e100

iomem
00000000-0009fbff : System RAM
0009fc00-0009ffff : reserved
000a0000-000bffff : Video RAM area
000c0000-000c7fff : Video ROM
000c8000-000c87ff : Extension ROM
000c8800-000c97ff : Extension ROM
000c9800-000c9fff : Extension ROM
000f0000-000fffff : System ROM
00100000-e7feffff : System RAM
 00100000-002aba6e : Kernel code
 002aba6f-003efb27 : Kernel data
e7ff0000-e7fffbff : ACPI Tables
e7fffc00-e7ffffff : ACPI Non-volatile Storage
e8000000-e8007fff : Hewlett-Packard Company NetServer Smart IRQ Router
e8008000-e8008fff : Cirrus Logic GD 5446
e8009000-e80090ff : LSI Logic / Symbios Logic 53c895
e800a000-e800afff : LSI Logic / Symbios Logic 53c895
e8100000-e81fffff : PCI Bus #01
 e8100000-e8100fff : LSI Logic / Symbios Logic 53c895 (#2)
 e8101000-e81010ff : LSI Logic / Symbios Logic 53c895 (#2)
e8200000-e83fffff : PCI Bus #02
 e8200000-e82fffff : Intel Corp. 82557/8/9 [Ethernet Pro 100]
   e8200000-e82fffff : e100
 e8300000-e8300fff : National Semiconductor Corporation DP83815 (MacPhyter)
Ethernet Controller
   e8300000-e8300fff : eth1
 e8301000-e8301fff : QLogic Corp. QLA2100 64-bit Fibre Channel Adapter
   e8301000-e8301fff : qla2100
ea000000-ebffffff : Cirrus Logic GD 5446
f0000000-f7ffffff : Intel Corp. 80960RP [i960RP Microprocessor]
 f0000000-f000007f : MegaRAID: LSI Logic Corporation
f8000000-f80fffff : PCI Bus #02
 f8000000-f8000fff : Intel Corp. 82557/8/9 [Ethernet Pro 100]
   f8000000-f8000fff : e100
fec00000-fec0ffff : reserved
fee00000-fee00fff : reserved
fffe8800-ffffffff : reserved

[7.5.] PCI information ('lspci -vvv' as root)

> 00:02.0 PCI bridge: Intel Corp. 80960RP [i960 RP Microprocessor/Bridge] (rev
03) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle-
MemWINV- VGASnoop- ParErr+ Stepping- SERR+ FastB2B- Status: Cap- 66Mhz- UDF-
FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency:
64, cache line size 08 Bus: primary=00, secondary=01, subordinate=01,
sec-latency=248 I/O behind bridge: 0000a000-0000afff Memory behind bridge:
e8100000-e81fffff Prefetchable memory behind bridge: fff00000-000fffff
BridgeCtl: Parity+ SERR+ NoISA+ VGA- MAbort- >Reset- FastB2B- 00:02.1 I2O: Intel
Corp. 80960RP [i960RP Microprocessor] (rev 03) (prog-if 01) Subsystem:
Hewlett-Packard Company MegaRAID T5, Integrated HP NetRAID Control: I/O- Mem+
BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr+ Stepping- SERR+ FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR- Latency: 64, cache line size 08 Interrupt: pin A routed
to IRQ 10 BIST result: 00 Region 0: Memory at f0000000 (32-bit, prefetchable)
[size=128M] Expansion ROM at <unassigned> [disabled] [size=32K] 00:03.0 PCI
bridge: Digital Equipment Corporation DECchip 21152 (rev 02) (prog-if 00 [Normal
decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+
Stepping- SERR+ FastB2B- Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium
>TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 64, cache line size 08 Bus:
primary=00, secondary=02, subordinate=02, sec-latency=234 I/O behind bridge:
0000b000-0000bfff Memory behind bridge: e8200000-e83fffff Prefetchable memory
behind bridge: 00000000f8000000-00000000f8000000 BridgeCtl: Parity+ SERR+ NoISA+
VGA- MAbort- >Reset- FastB2B- 00:06.0 System peripheral: Hewlett-Packard Company
NetServer Smart IRQ Router (rev a0) Subsystem: Hewlett-Packard Company: Unknown
device 0001 Control: I/O- Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr+
Stepping- SERR+ FastB2B- Status: Cap- 66Mhz- UDF- FastB2B- ParErr- DEVSEL=slow
>TAbort- <TAbort- <MAbort- >SERR- <PERR- Region 0: Memory at e8000000 (32-bit,
non-prefetchable) [size=32K] 00:08.0 VGA compatible controller: Cirrus Logic GD
5446 (rev 45) (prog-if 00 [VGA]) Subsystem: Hewlett-Packard Company: Unknown
device 0001 Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B- Status: Cap- 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium
>TAbort- <TAbort- <MAbort- >SERR- <PERR- Region 0: Memory at ea000000 (32-bit,
prefetchable) [size=32M] Region 1: Memory at e8008000 (32-bit, non-prefetchable)
[size=4K] Expansion ROM at <unassigned> [disabled] [size=32K] 00:0f.0 ISA
bridge: Intel Corp. 82371AB/EB/MB PIIX4 ISA (rev 02) Control: I/O+ Mem+
BusMaster+ SpecCycle+ MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR- Latency: 0 00:0f.1 IDE interface: Intel Corp.
82371AB/EB/MB PIIX4 IDE (rev 01) (prog-if 80 [Master]) Control: I/O+ Mem-
BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR- Latency: 64 Region 4: I/O ports at 9020 [size=16] 00:0f.2
USB Controller: Intel Corp. 82371AB/EB/MB PIIX4 USB (rev 01) (prog-if 00 [UHCI])
Control: I/O+ Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping-
SERR- FastB2B- Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort-
<TAbort- <MAbort- >SERR- <PERR- Interrupt: pin D routed to IRQ 0 Region 4: I/O
ports at 9000 [size=32] 00:0f.3 Bridge: Intel Corp. 82371AB/EB/MB PIIX4 ACPI
(rev 02) Control: I/O+ Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B- Status: Cap- 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium
>TAbort- <TAbort- <MAbort- >SERR- <PERR- Interrupt: pin ? routed to IRQ 9
00:10.0 Host bridge: Intel Corp. 450NX - 82451NX Memory & I/O Controller (rev
03) Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr-
Stepping- SERR- FastB2B- Status: Cap- 66Mhz- UDF- FastB2B- ParErr- DEVSEL=fast
>TAbort- <TAbort- <MAbort- >SERR- <PERR- 00:12.0 Host bridge: Intel Corp. 450NX
- 82454NX/84460GX PCI Expander Bridge (rev 02) Control: I/O- Mem+ BusMaster+
SpecCycle- MemWINV+ VGASnoop- ParErr+ Stepping- SERR+ FastB2B- Status: Cap-
66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort+ >SERR-
<PERR- Latency: 72, cache line size 08 01:07.0 SCSI storage controller: LSI
Logic / Symbios Logic 53c895 (rev 01) Subsystem: Hewlett-Packard Company:
Unknown device 1000 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop-
ParErr+ Stepping- SERR+ FastB2B- Status: Cap- 66Mhz- UDF- FastB2B- ParErr-
DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 247 (7500ns min,
16000ns max), cache line size 08 Interrupt: pin A routed to IRQ 11 Region 0: I/O
ports at 1400 [size=256] Region 1: Memory at e8009000 (32-bit, non-prefetchable)
[size=256] Region 2: Memory at e800a000 (32-bit, non-prefetchable) [size=4K]
01:07.0 SCSI storage controller: LSI Logic / Symbios Logic 53c895 (rev 01)
Subsystem: Hewlett-Packard Company: Unknown device 1000 Control: I/O+ Mem+
BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr+ Stepping- SERR+ FastB2B-
Status: Cap- 66Mhz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort-
<MAbort- >SERR- <PERR- Latency: 247 (7500ns min, 16000ns max), cache line size
08 Interrupt: pin A routed to IRQ 11 Region 0: I/O ports at a000 [size=256]
Region 1: Memory at e8101000 (32-bit, non-prefetchable) [size=256] Region 2:
Memory at e8100000 (32-bit, non-prefetchable) [size=4K] 02:02.0 Ethernet
controller: Intel Corp. 82557/8/9 [Ethernet Pro 100] (rev 05) Subsystem:
Hewlett-Packard Company NetServer 10/100TX Control: I/O+ Mem+ BusMaster+
SpecCycle- MemWINV+ VGASnoop- ParErr+ Stepping- SERR+ FastB2B- Status: Cap+
66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR-
<PERR- Latency: 66 (2000ns min, 14000ns max), cache line size 08 Interrupt: pin
A routed to IRQ 15 Region 0: Memory at f8000000 (32-bit, prefetchable) [size=4K]
Region 1: I/O ports at b800 [size=32] Region 2: Memory at e8200000 (32-bit,
non-prefetchable) [size=1M] Expansion ROM at <unassigned> [disabled] [size=1M]
Capabilities: [dc] Power Management version 1 Flags: PMEClk- DSI+ D1+ D2+
AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold-) Status: D0 PME-Enable- DSel=0
DScale=0 PME- 02:03.0 Ethernet controller: National Semiconductor Corporation
DP83815 (MacPhyter) Ethernet Controller Subsystem: Netgear FA311 / FA312 (FA311
with WoL HW) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+
Stepping- SERR+ FastB2B+ Status: Cap+ 66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium
>TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 90 (2750ns min, 13000ns max)
Interrupt: pin A routed to IRQ 5 Region 0: I/O ports at b000 [size=256] Region
1: Memory at e8300000 (32-bit, non-prefetchable) [size=4K] Expansion ROM at
<unassigned> [disabled] [size=64K] Capabilities: [40] Power Management version 2
Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold-)
Status: D0 PME-Enable- DSel=0 DScale=0 PME+ 02:04.0 SCSI storage controller:
QLogic Corp. QLA2100 64-bit Fibre Channel Adapter (rev 03) Subsystem: QLogic
Corp. QLA2100 64-bit Fibre Channel Adapter Control: I/O+ Mem+ BusMaster+
SpecCycle- MemWINV+ VGASnoop- ParErr+ Stepping- SERR+ FastB2B- Status: Cap-
66Mhz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR-
<PERR- Latency: 64, cache line size 08 Interrupt: pin A routed to IRQ 11 Region
0: I/O ports at b400 [size=256] Region 1: Memory at e8301000 (32-bit,
non-prefetchable) [size=4K] Expansion ROM at <unassigned> [disabled] [size=64K] 




[7.6.] SCSI information (from /proc/scsi/scsi)
Host: scsi0 Channel: 00 Id: 00 Lun: 00
 Vendor: MegaRAID Model: LD0 RAID1  8677R Rev:   D
 Type:   Direct-Access                    ANSI SCSI revision: 02
Host: scsi2 Channel: 00 Id: 00 Lun: 00
 Vendor: HP       Model: FCArray 5_104196 Rev: 5549
 Type:   Direct-Access                    ANSI SCSI revision: 02
Host: scsi2 Channel: 00 Id: 00 Lun: 01
 Vendor: HP       Model: FCArray 5_69464  Rev: 5549
 Type:   Direct-Access                    ANSI SCSI revision: 02
Host: scsi2 Channel: 00 Id: 00 Lun: 02
 Vendor: HP       Model: FCArray 5_104196 Rev: 5549
 Type:   Direct-Access                    ANSI SCSI revision: 02
Comment 1 Greg Kroah-Hartman 2005-11-07 13:49:04 UTC
This is a bug in your bios.  Please contact HP to get an update for it.
Comment 2 Shaohua 2005-11-07 17:25:22 UTC
Created attachment 6492 [details]
debug patch

I somehow suspect it's a kernel bug. pci_fixup_i450nx calls pci_scan_bus, which
appears adding new devices too early to me (root bridge scanning will add all
new devices finally). If it works for you, we might also fix pci_fixup_i450gx.
Comment 3 Shaohua 2005-11-07 17:27:40 UTC
I would mention the patch is against 2.6.14. It can't work for previous kernel 
version (pci_scan_bus_parented changed recently)
Comment 4 Shaohua 2005-11-07 21:18:22 UTC
Comment on attachment 6492 [details]
debug patch

Relooked at it. Sorry, my guess is wrong.
Comment 5 Frank Overton 2005-11-07 21:55:57 UTC
I am not a coder and my C is quite rusty but might the following be an issue?

The following patch appears about the same time as the bug:

===== piix.c 1.20 vs edited =====
--- 1.20/drivers/ide/pci/piix.c Wed Oct 22 09:27:24 2003
+++ edited/piix.c Fri Nov 21 09:28:43 2003
@@ -768,8 +768,8 @@
/* Only on the original revision: IDE DMA can hang */
if(rev == 0x00)
no_piix_dma = 1;
- /* On all revisions PXB bus lock must be disabled for IDE */
- else if(cfg & (1<<14))
+ /* On all revisions below 5 PXB bus lock must be disabled for IDE */
+ else if(cfg & (1<<14) && rev < 5)
no_piix_dma = 2;
}
if(no_piix_dma) 

Could it make a difference that the last line is not...
+ else if(cfg & (1<<14) && rev < 0x05)
Comment 6 Frank Overton 2005-11-10 14:11:22 UTC
Recreated error on kernel 2.6.14.1 with DEBUG defined in arch/i386/pci/pci.h
(BTW: Changing to 5 to 0x05 in piix.c makes no difference.)

This is the OOPS message:
Starting balanced_irq
Using IPI Shortcut mode
md: Autodetecting RAID arrays
md: autorun ...
md: ... autorun DONE.
RAMDISK: Compressed image found at block 0
logips2pp: Detected unknown logitech mouse model 1
input: PS/2 Logitech Mouse on isa0060/serio1
VFS: Mounted root (ext2 filesystem)
RED Hat nash version 3.5.13.6 starting
Mounted /proc filesystem
Loading scsi_mod.ko module
insmod: QM_MODULES:

Error: /bin/insmod exited abnormally!
Loading megaraid.ko module
insmod: QM_MODULES:

Error: /bin/insmod exited abnormally!
Loading sym53c8xx.ko module
insmod: QM_MODULES:

Error: /bin/insmod exited abnormally!
Loading ext3.ko module
insmod: QM_MODULES:

Error: /bin/insmod exited abnormally!
Creating block devices
VFS: CAnnot open root device "LABEL=/" or unknown-block(0,0)
Please append a correct "root=" boot option
Kernel Panic - not syncing: VFS: Unable to mount root fs on unknown-block(0,0)
Comment 7 Len Brown 2005-11-10 21:20:44 UTC
The output at the URL in the initial report shows
the same PCI stack trace with "acpi=off".
punting this report to PCI sub-system.
Comment 8 Andrew Morton 2005-11-11 01:54:48 UTC
It would help if we could get the whole oops trace.

Can you get a 50-row screen?  I can't remember how to do that with grub,
but in lilo you add `vga=extended' to the config file.

That'll get us the whole oops.  A digital photo of the screen will then
save a lot of pain.

Comment 9 Len Brown 2005-11-14 10:13:52 UTC
We've booted RHEL4 (2.6.9-22) on an Intel AD450NX,
so this issue is not as simple as "2.6 vs 450NX".
It may be specific to the HP Netserver LH4r.

Please verify that the system is running an up-to-date BIOS.
Comment 10 Greg Kroah-Hartman 2005-11-14 10:20:12 UTC
Odds are RHEL has a patch that we do not know about :(
Comment 11 Frank Overton 2005-11-14 13:01:19 UTC
Created attachment 6586 [details]
2.6.14.1 debug screenshot

Screen shot at max resolution of 2.6.14.1 debug
Comment 12 Frank Overton 2005-11-14 13:02:55 UTC
Created attachment 6587 [details]
2.6.9-22 (CentOS 4.2) Max resolution Screenshot
Comment 13 Frank Overton 2005-11-14 13:05:17 UTC
I have verified with HP that I am running the latest BIOS files available.
Comment 14 Andrew Morton 2005-11-14 13:16:55 UTC
bugme-daemon@bugzilla.kernel.org wrote:
>
> http://bugzilla.kernel.org/show_bug.cgi?id=5564
> 

Ah, the dreaded "PCI: device disabled (BIOS)" thing.

Is there a fix for that?   Is it a simple matter of enabling the thing in BIOS?

If so, why?   Can't we just ignore what BIOS said and go ahead and do it?

Is this new behaviour?


Comment 15 Frank Overton 2005-11-14 14:28:38 UTC
Behaviour is new to 2.6 kernels.

All 2.4 kernels work fine.
Comment 16 Frank Overton 2005-11-14 14:32:30 UTC
Hmmm...

The LH4 has two internal RAID controllers. I have disabled one of them in the
BIOS and have RAID1 mirror drives configured in the other. I'll can enable both
controllers but I don't expect it to make a difference.
Comment 17 Frank Overton 2005-11-14 14:42:37 UTC
Enabling both RAID controllers doesn't change a thing.
Comment 18 Bartlomiej Zolnierkiewicz 2005-11-16 02:06:51 UTC
On 11/14/05, Andrew Morton <akpm@osdl.org> wrote:
> bugme-daemon@bugzilla.kernel.org wrote:
> >
> > http://bugzilla.kernel.org/show_bug.cgi?id=5564
> >
>
> Ah, the dreaded "PCI: device disabled (BIOS)" thing.
>
> Is there a fix for that?   Is it a simple matter of enabling the thing in BIOS?

For IDE enabling it in BIOS should be OK.

> If so, why?   Can't we just ignore what BIOS said and go ahead and do it?

In general - yes, the only side-effect would be different ordering of
IDE devices.

However it doesn't look like IDE problem et all, Andrew please look at this:

"
This is the OOPS message:
Starting balanced_irq
Using IPI Shortcut mode
md: Autodetecting RAID arrays
md: autorun ...
md: ... autorun DONE.
RAMDISK: Compressed image found at block 0
logips2pp: Detected unknown logitech mouse model 1
input: PS/2 Logitech Mouse on isa0060/serio1
VFS: Mounted root (ext2 filesystem)
RED Hat nash version 3.5.13.6 starting
Mounted /proc filesystem
Loading scsi_mod.ko module
insmod: QM_MODULES:

Error: /bin/insmod exited abnormally!
Loading megaraid.ko module
insmod: QM_MODULES:

Error: /bin/insmod exited abnormally!
Loading sym53c8xx.ko module
insmod: QM_MODULES:

Error: /bin/insmod exited abnormally!
Loading ext3.ko module
insmod: QM_MODULES:

Error: /bin/insmod exited abnormally!
Creating block devices
VFS: CAnnot open root device "LABEL=/" or unknown-block(0,0)
Please append a correct "root=" boot option
Kernel Panic - not syncing: VFS: Unable to mount root fs on unknown-block(0,0)
"

Frank, I don't see any OOPS here.

IMO it is PCI problem in RH 2.6.9-22 kernel which is fixed in 2.6.14.1, plus
RHEL4.2 (CentOS4.2) comes with one year old module-init-tools package.

You should try upgrading module-init-tools to the latest version,
regenerate your initrd and try again with 2.6.14.1.

BTW Documentation/Changes in -git tree lists module-init-tools-0.9.10
as a required version which seems a bit unbelievable (they were released
on 25-Feb-2003).
Comment 19 Bartlomiej Zolnierkiewicz 2005-11-16 02:10:53 UTC
On 11/14/05, Andrew Morton <akpm@osdl.org> wrote:
> bugme-daemon@bugzilla.kernel.org wrote:
> >
> > http://bugzilla.kernel.org/show_bug.cgi?id=5564
> >
>
> Ah, the dreaded "PCI: device disabled (BIOS)" thing.
>
> Is there a fix for that?   Is it a simple matter of enabling the thing in BIOS?

For IDE enabling it in BIOS should be OK.

> If so, why?   Can't we just ignore what BIOS said and go ahead and do it?

In general - yes, the only side-effect would be different ordering of
IDE devices.

However it doesn't look like IDE problem et all, Andrew please look at this:

"
This is the OOPS message:
Starting balanced_irq
Using IPI Shortcut mode
md: Autodetecting RAID arrays
md: autorun ...
md: ... autorun DONE.
RAMDISK: Compressed image found at block 0
logips2pp: Detected unknown logitech mouse model 1
input: PS/2 Logitech Mouse on isa0060/serio1
VFS: Mounted root (ext2 filesystem)
RED Hat nash version 3.5.13.6 starting
Mounted /proc filesystem
Loading scsi_mod.ko module
insmod: QM_MODULES:

Error: /bin/insmod exited abnormally!
Loading megaraid.ko module
insmod: QM_MODULES:

Error: /bin/insmod exited abnormally!
Loading sym53c8xx.ko module
insmod: QM_MODULES:

Error: /bin/insmod exited abnormally!
Loading ext3.ko module
insmod: QM_MODULES:

Error: /bin/insmod exited abnormally!
Creating block devices
VFS: CAnnot open root device "LABEL=/" or unknown-block(0,0)
Please append a correct "root=" boot option
Kernel Panic - not syncing: VFS: Unable to mount root fs on unknown-block(0,0)
"

Frank, I don't see any OOPS here.

IMO it is PCI problem in RH 2.6.9-22 kernel which is fixed in 2.6.14.1, plus
RHEL4.2 (CentOS4.2) comes with one year old module-init-tools package.

You should try upgrading module-init-tools to the latest version,
regenerate your initrd and try again with 2.6.14.1.

BTW Documentation/Changes in -git tree lists module-init-tools-0.9.10
as a required version which seems a bit unbelievable (they were released
on 25-Feb-2003).
Comment 20 Frank Overton 2005-11-16 11:55:35 UTC
Upgrading module-init-tools let 2.6.14.1 boot! Thanks!!!

Now being a noob, I'll ask a noob question...
Can I run CentOS 4.2 on top of the newer kernel or is too much of the release
dependent on a specific kernel version?

Thanks again!

Comment 21 Bartlomiej Zolnierkiewicz 2005-11-17 06:05:33 UTC
Generally you should stay with distribution provided kernel, but if your
user-space matches newer kernel versions (see Documentation/Changes)
you can give it a try.  However you are probably better off reporting the
problem (+ pointing to this bugzilla entry) to CentOS developers.

PS I would like to close this bug as it is fixed in the mainstream kernel.

Note You need to log in before you can comment on or make changes to this bug.