Bug 7292 - irq 7: nobody cared - 945GM/ICH7/T1300
Summary: irq 7: nobody cared - 945GM/ICH7/T1300
Status: REJECTED INVALID
Alias: None
Product: ACPI
Classification: Unclassified
Component: Config-Interrupts (show other bugs)
Hardware: i386 Linux
: P2 normal
Assignee: Len Brown
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2006-10-09 10:48 UTC by Boris B. Zhmurov
Modified: 2006-10-12 20:37 UTC (History)
2 users (show)

See Also:
Kernel Version: 2.6.18
Subsystem:
Regression: ---
Bisected commit-id:


Attachments
dmesg (14.72 KB, text/plain)
2006-10-09 10:55 UTC, Boris B. Zhmurov
Details
.config (53.13 KB, text/plain)
2006-10-09 10:58 UTC, Boris B. Zhmurov
Details
dmesg of 2.6.16 (16.40 KB, text/plain)
2006-10-11 22:53 UTC, Boris B. Zhmurov
Details
acpidump of 2.6.16 (149.12 KB, application/octet-stream)
2006-10-11 22:54 UTC, Boris B. Zhmurov
Details
interrupts of 2.6.26 (553 bytes, text/plain)
2006-10-11 22:54 UTC, Boris B. Zhmurov
Details
lspci -vv of 2.6.16 (18.14 KB, text/plain)
2006-10-11 23:02 UTC, Boris B. Zhmurov
Details

Description Boris B. Zhmurov 2006-10-09 10:48:28 UTC
Most recent kernel where this bug did not occur: 2.6.18 (vanilla)

Distribution: RHEL 4 based distro


Hardware Environment:
00:00.0 Host bridge: Intel Corporation Mobile 945GM/PM/GMS/940GML and 945GT
Express Memory Controller Hub (rev 03)
00:01.0 PCI bridge: Intel Corporation Mobile 945GM/PM/GMS/940GML and 945GT
Express PCI Express Root Port (rev 03)
00:1b.0 Audio device: Intel Corporation 82801G (ICH7 Family) High Definition
Audio Controller (rev 02)
00:1c.0 PCI bridge: Intel Corporation 82801G (ICH7 Family) PCI Express Port 1
(rev 02)
00:1c.3 PCI bridge: Intel Corporation 82801G (ICH7 Family) PCI Express Port 4
(rev 02)
00:1d.0 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI #1 (rev 02)
00:1d.1 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI #2 (rev 02)
00:1d.2 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI #3 (rev 02)
00:1d.3 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI #4 (rev 02)
00:1d.7 USB Controller: Intel Corporation 82801G (ICH7 Family) USB2 EHCI
Controller (rev 02)
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev e2)
00:1f.0 ISA bridge: Intel Corporation 82801GBM (ICH7-M) LPC Interface Bridge
(rev 02)
00:1f.1 IDE interface: Intel Corporation 82801G (ICH7 Family) IDE Controller
(rev 02)
01:00.0 VGA compatible controller: nVidia Corporation Quadro NVS 110M / GeForce
Go 7300 (rev a1)
02:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI
Express Gigabit Ethernet controller (rev 01)
03:00.0 Network controller: Intel Corporation PRO/Wireless 3945ABG Network
Connection (rev 02)
04:01.0 CardBus bridge: Ricoh Co Ltd RL5c476 II (rev b3)
04:01.1 FireWire (IEEE 1394): Ricoh Co Ltd R5C552 IEEE 1394 Controller (rev 08)
04:01.2 Class 0805: Ricoh Co Ltd R5C822 SD/SDIO/MMC/MS/MSPro Host Adapter (rev 17)
04:01.3 System peripheral: Ricoh Co Ltd R5C592 Memory Stick Bus Host Adapter
(rev 08)



Software Environment:
linux-2.6.18
glibc-2.3.4-2.25
gcc-3.4.6-3
udev-039-10.15.EL4
binutils-2.15.92.0.2-21
coreutils-5.2.1-31.4


Problem Description:
spurious 8259A interrupt: IRQ15.
irq 7: nobody cared (try booting with the "irqpoll" option)
 [<c0131c2a>] __report_bad_irq+0x2b/0x68
 [<c0131cf3>] note_interrupt+0x73/0xa4
 [<c01316a6>] __do_IRQ+0x70/0xa2
 [<c01040a1>] do_IRQ+0x5f/0x7a
 =======================
 [<c0102bd2>] common_interrupt+0x1a/0x20
 [<c024007b>] bitmap_unplug+0x104/0x13a
 [<c0118831>] __do_softirq+0x2c/0x73
 [<c0104186>] do_softirq+0x3a/0x7f
 =======================
 [<c01040b3>] do_IRQ+0x71/0x7a
 [<c0102bd2>] common_interrupt+0x1a/0x20
 [<c01dfdaa>] acpi_processor_idle+0x1af/0x2ad
 [<c01010c4>] cpu_idle+0x3f/0x5b
 [<c034962b>] start_kernel+0x19c/0x19e
handlers:
[<c021f7e9>] (usb_hcd_irq+0x0/0x4e)
[<c021f7e9>] (usb_hcd_irq+0x0/0x4e)
Disabling IRQ #7
spurious 8259A interrupt: IRQ7.



Steps to reproduce:
just boot the kernel and wait for a little bit...
Comment 1 Boris B. Zhmurov 2006-10-09 10:55:28 UTC
Created attachment 9192 [details]
dmesg

dmesg
Comment 2 Boris B. Zhmurov 2006-10-09 10:58:08 UTC
Created attachment 9193 [details]
.config

kernel's config
Comment 3 Andrew Morton 2006-10-09 11:58:13 UTC
On Mon, 9 Oct 2006 10:59:57 -0700
bugme-daemon@bugzilla.kernel.org wrote:

> Most recent kernel where this bug did not occur: 2.6.18 (vanilla)

You say it occurred in 2.6.18, and that it didn't occur in 2.6.18.

Please update the bug to tell us the most recent kernel version which did
_not_ have this bug, thanks.

Comment 4 Boris B. Zhmurov 2006-10-09 14:46:33 UTC
Most recent kernel where this bug did not occur: 2.6._16_ (vanilla)

My mistake, sorry. Kernel, which did not have this bug - 2.6.16!
I tried 2.6.18 without nvidia - bug still exists. And it didn't occur on 2.6.16
with nvidia driver.
Comment 5 Len Brown 2006-10-11 18:43:26 UTC
Please attach the dmesg and /proc/interrupts from the successful
2.6.16 boot.

Please test that the problem goes away with "acpi=off".

Please attach the output from lspci -vv
and the output from acpidump, available in the latest pmtools here:

http://ftp.kernel.org/pub/linux/kernel/people/lenb/acpi/utils/

Academic question:

> # CONFIG_X86_UP_APIC is not set
Curious why you have the IOAPIC disabled, the machine seems to have one:
> ACPI: MADT (v001 A M I  OEMAPIC  0x01000627 MSFT 0x00000097) @ 0x3ffd0390

Comment 6 Boris B. Zhmurov 2006-10-11 22:53:17 UTC
Created attachment 9228 [details]
dmesg of 2.6.16

2.6.16's dmesg
Comment 7 Boris B. Zhmurov 2006-10-11 22:54:13 UTC
Created attachment 9229 [details]
acpidump of 2.6.16

acpidump of 2.6.16
Comment 8 Boris B. Zhmurov 2006-10-11 22:54:59 UTC
Created attachment 9230 [details]
interrupts of 2.6.26

interrupts of 2.6.26
Comment 9 Boris B. Zhmurov 2006-10-11 23:01:11 UTC
It's strange, but on 2.6.16.29 now I see the same problem, but it never happend
like for three or four months. :( And now I see it on 2.6.16. What should I try?
Should I enable some debug options in .config?
 
>> # CONFIG_X86_UP_APIC is not set
> Curious why you have the IOAPIC disabled, the machine seems to have one:

On one of my HP Proliant DL320 servers, I have completely freeze at boot time
with that option on 2.6.12 kernel. Due to server is located about 1000
kilometers far from me, I'm afraid to turn it on again :) 
Comment 10 Boris B. Zhmurov 2006-10-11 23:02:12 UTC
Created attachment 9231 [details]
lspci -vv of 2.6.16

lspci -vv of 2.6.16
Comment 11 Boris B. Zhmurov 2006-10-12 06:55:35 UTC
When I boot with "irqpoll" command line, after 2-3 hours, notebook hang
completely. Only poweroff helps... Trying with acpi=off
Comment 12 Boris B. Zhmurov 2006-10-12 12:47:57 UTC
same errors in dmesg with acpi=off
Comment 13 Len Brown 2006-10-12 20:37:04 UTC
> on 2.6.16.29 now I see the same problem 
 
Good to know this isn't a recent Linux regression. 
 
> IOAPIC  
  
It is safe to venture that enabling the IOAPIC on your laptop  
will not have any bad effect on your 2.6.12 proliant server 1000 km away:-) 
  
> ERR:     137813  
  
This is not good news, it means your 8259 is getting  
lots of spurious interrupts.  Curious that you don't  
see the following line in your dmesg to identify  
which IRQ is taking the spurious interrupts:  
  
mask_and_ack_8259A(unsigned int irq)  
...  
if (!(spurious_irq_mask & irqmask)) {  
  printk(KERN_DEBUG "spurious 8259A interrupt: IRQ%d.\n", irq);  
  spurious_irq_mask |= irqmask;  
}  
atomic_inc(&irq_err_count);  
  
> same errors in dmesg with acpi=off   
   
Good to know -- then by definition, this issue   
does not belong in the ACPI bugzilla category.   
   
> nvidia: module license 'NVIDIA' taints kernel    
    
Please re-open when you can reproduce the issue    
and supply debug output without this binary module loaded.    
    

Note You need to log in before you can comment on or make changes to this bug.