Bug 11404

Summary: BUG: in 2.6.23-rc3-git7 in do_cciss_intr
Product: SCSI Drivers Reporter: Rafael J. Wysocki (rjw)
Component: OtherAssignee: scsi_drivers-other
Status: CLOSED INVALID    
Severity: normal CC: mike.miller, randy.dunlap
Priority: P1    
Hardware: All   
OS: Linux   
Kernel Version: 2.6.23-rc3-git7 Subsystem:
Regression: Yes Bisected commit-id:
Bug Depends on:    
Bug Blocks: 11167    

Description Rafael J. Wysocki 2008-08-22 14:51:14 UTC
Subject    : BUG: in 2.6.23-rc3-git7 in do_cciss_intr
Submitter  : rdunlap <randy.dunlap@oracle.com>
Date       : 2008-08-21 5:52
References : http://marc.info/?l=linux-kernel&m=121929819616273&w=4
References : http://marc.info/?l=linux-kernel&m=121932889105368&w=4
Handled-By : "Miller, Mike (OS Dev)" <Mike.Miller@hp.com>
Handled-By : James Bottomley <James.Bottomley@hansenpartnership.com>

This entry is being used for tracking a regression from 2.6.26.  Please don't
close it until the problem is fixed in the mainline.
Comment 1 Rafael J. Wysocki 2008-09-04 12:42:35 UTC
On Thursday, 4 of September 2008, Randy Dunlap wrote:
> Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a report
> > of recent regressions.
> > 
> > The following bug entry is on the current list of known regressions
> > from 2.6.26.  Please verify if it still should be listed and let me know
> > (either way).
> > 
> > 
> > Bug-Entry   : http://bugzilla.kernel.org/show_bug.cgi?id=11404
> > Subject             : BUG: in 2.6.23-rc3-git7 in do_cciss_intr
> > Submitter   : rdunlap <randy.dunlap@oracle.com>
> > Date                : 2008-08-21 5:52 (10 days old)
> > References  : http://marc.info/?l=linux-kernel&m=121929819616273&w=4
> >               http://marc.info/?l=linux-kernel&m=121932889105368&w=4
> > Handled-By  : Miller, Mike (OS Dev) <Mike.Miller@hp.com>
> >               James Bottomley <James.Bottomley@hansenpartnership.com>
> 
> Yes, just got this same bug in 2.6.27-rc5-git3.
Comment 2 Mike Miller 2008-10-23 13:26:30 UTC
Randy, 
Can you provide your configuration info? System type, controller type, etc. I am not able to reproduce this in my lab. -- mikem
Comment 3 Randy Dunlap 2008-10-23 13:51:20 UTC
System is an HP Blade, dual x86_64 (AMD) with hyperthreading, 8 GB RAM,
nVidia CK804 chipset.  The storage controller is this:

42:08.0 RAID bus controller: Hewlett-Packard Company Unknown device 3238
        Subsystem: Hewlett-Packard Company Unknown device 3211
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr+ Stepping- SERR+ FastB2B-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR-
        Latency: 120
        Interrupt: pin A routed to IRQ 87
        Region 0: Memory at fdf80000 (64-bit, non-prefetchable) [size=512K]
        Region 2: I/O ports at 4000 [size=256]
        Region 3: Memory at fdf70000 (32-bit, non-prefetchable) [size=32K]
        [virtual] Expansion ROM at 84000000 [disabled] [size=16K]
        Capabilities: [c0] Power Management version 2 
                Flags: PMEClk- DSI- D1+ D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
                Status: D0 PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [cc] Message Signalled Interrupts: 64bit+ Queue=0/1 Enable+
                Address: 00000000fee0200c  Data: 41b9
        Capabilities: [dc] PCI-X non-bridge device
                Command: DPERE- ERO- RBC=512 OST=1
                Status: Dev=42:08.0 64bit+ 133MHz+ SCD- USC- DC=simple DMMRBC=4096 DMOST=1 DMCRS=8 RSCEM- 266MHz- 533MHz-

The problem just happened again on 2.6.27-git10.
The kernel boot log & config file for this are at:
http://oss.oracle.com/~rdunlap/kerneltest/logs/cciss/
along with lots of /proc files & lspci etc. from a good boot.
Comment 4 Rafael J. Wysocki 2008-10-26 03:58:04 UTC
On Sunday, 26 of October 2008, Randy Dunlap wrote:
> On Sat, 25 Oct 2008 23:07:47 +0200 (CEST) Rafael J. Wysocki wrote:
> 
> > This message has been generated automatically as a part of a report
> > of regressions introduced between 2.6.26 and 2.6.27.
> > 
> > The following bug entry is on the current list of known regressions
> > introduced between 2.6.26 and 2.6.27.  Please verify if it still should
> > be listed and let me know (either way).
> 
> Yes, it should still be listed.
> 
> > Bug-Entry   : http://bugzilla.kernel.org/show_bug.cgi?id=11404
> > Subject             : BUG: in 2.6.23-rc3-git7 in do_cciss_intr
> > Submitter   : rdunlap <randy.dunlap@oracle.com>
> > Date                : 2008-08-21 5:52 (66 days old)
> > References  : http://marc.info/?l=linux-kernel&m=121929819616273&w=4
> >               http://marc.info/?l=linux-kernel&m=121932889105368&w=4
> > Handled-By  : Miller, Mike (OS Dev) <Mike.Miller@hp.com>
> >               James Bottomley <James.Bottomley@hansenpartnership.com>
Comment 5 Mike Miller 2008-10-27 08:09:01 UTC
Yes, please continue to track. So far, I cannot recreate the failure so I may just have to take a stab in the dark and post possible fixes.
Comment 6 Rafael J. Wysocki 2008-11-09 15:24:28 UTC
On Sunday, 9 of November 2008, Randy Dunlap wrote:
> Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a report
> > of regressions introduced between 2.6.26 and 2.6.27.
> > 
> > The following bug entry is on the current list of known regressions
> > introduced between 2.6.26 and 2.6.27.  Please verify if it still should
> > be listed and let me know (either way).
> 
> Yes, it should still be listed.
> 
> > Bug-Entry   : http://bugzilla.kernel.org/show_bug.cgi?id=11404
> > Subject             : BUG: in 2.6.23-rc3-git7 in do_cciss_intr
> > Submitter   : rdunlap <randy.dunlap@oracle.com>
> > Date                : 2008-08-21 5:52 (81 days old)
> > References  : http://marc.info/?l=linux-kernel&m=121929819616273&w=4
> >               http://marc.info/?l=linux-kernel&m=121932889105368&w=4
> > Handled-By  : Miller, Mike (OS Dev) <Mike.Miller@hp.com>
> >               James Bottomley <James.Bottomley@hansenpartnership.com>
Comment 7 Rafael J. Wysocki 2008-11-17 13:29:38 UTC
On Monday, 17 of November 2008, Randy Dunlap wrote:
> Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a report
> > of regressions introduced between 2.6.26 and 2.6.27.
> > 
> > The following bug entry is on the current list of known regressions
> > introduced between 2.6.26 and 2.6.27.  Please verify if it still should
> > be listed and let me know (either way).
> > 
> 
> Nothing has changed.  IMO that means leave the bug as is (alive).
> 
> > 
> > Bug-Entry   : http://bugzilla.kernel.org/show_bug.cgi?id=11404
> > Subject             : BUG: in 2.6.23-rc3-git7 in do_cciss_intr
> > Submitter   : rdunlap <randy.dunlap@oracle.com>
> > Date                : 2008-08-21 5:52 (88 days old)
> > References  : http://marc.info/?l=linux-kernel&m=121929819616273&w=4
> >               http://marc.info/?l=linux-kernel&m=121932889105368&w=4
> > Handled-By  : Miller, Mike (OS Dev) <Mike.Miller@hp.com>
> >               James Bottomley <James.Bottomley@hansenpartnership.com>
Comment 8 Randy Dunlap 2008-11-19 11:39:25 UTC
This currently only happens when loading a new kernel via kexec
and the first kernel is "old" (like 2.6.18; using 2.6.25 to load
the kernel-under-test works OK).