Bug 9994

Summary: atiixp ide timeouts
Product: IO/Storage Reporter: Honza Fikar (jan.fikar)
Component: IDEAssignee: io_ide (io_ide)
Status: CLOSED CODE_FIX    
Severity: normal CC: kernel
Priority: P1    
Hardware: All   
OS: Linux   
Kernel Version: 2.6.24 Subsystem:
Regression: Yes Bisected commit-id:

Description Honza Fikar 2008-02-14 16:17:11 UTC
Latest working kernel version:2.6.23
Earliest failing kernel version:2.6.24
Distribution:gentoo
Hardware Environment:Acer Aspire 5021
Software Environment:
Problem Description:
After some hours of uptime I can find this in dmesg and dma of my harddrive is
switched off:
...
hda: irq timeout: status=0xd0 { Busy }
hda: irq timeout: status=0xd0 { Busy }
hda: status timeout: status=0xd0 { Busy }
hda: DMA disabled
...

I can switch it back with hdparm -d1 /dev/hda, but after some hours the same
thing happens.

Steps to reproduce: just wait, the attached dmesg contains 26 ide0 resets during 17 hours uptime

lspci -v:
00:00.0 Host bridge: ATI Technologies Inc RS480 Host Bridge (rev 01)
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMMi
        Flags: bus master, 66MHz, medium devsel, latency 64

00:02.0 PCI bridge: ATI Technologies Inc RS480 PCI-X Root Port (prog-if 00 [Normal decode])
        Flags: bus master, fast devsel, latency 0
        Bus: primary=00, secondary=01, subordinate=01, sec-latency=0
        I/O behind bridge: 00009000-00009fff
        Memory behind bridge: c0100000-c01fffff
        Prefetchable memory behind bridge: c8000000-cfffffff
        Capabilities: [50] Power Management version 3
        Capabilities: [58] Express Root Port (Slot-), MSI 00
        Capabilities: [80] Message Signalled Interrupts: Mask- 64bit- Queue=0/0 Enable-
        Capabilities: [b0] Subsystem: ATI Technologies Inc Unknown device 5950
        Capabilities: [b8] HyperTransport: MSI Mapping Enable+ Fixed+

00:06.0 PCI bridge: ATI Technologies Inc RS480 PCI Bridge (prog-if 00 [Normal decode])
        Flags: bus master, fast devsel, latency 0
        Bus: primary=00, secondary=09, subordinate=09, sec-latency=0
        Capabilities: [50] Power Management version 3
        Capabilities: [58] Express Root Port (Slot-), MSI 00
        Capabilities: [80] Message Signalled Interrupts: Mask- 64bit- Queue=0/0 Enable-
        Capabilities: [b0] Subsystem: ATI Technologies Inc Unknown device 5950
        Capabilities: [b8] HyperTransport: MSI Mapping Enable+ Fixed+

00:07.0 PCI bridge: ATI Technologies Inc RS480 PCI Bridge (prog-if 00 [Normal decode])
        Flags: bus master, fast devsel, latency 0
        Bus: primary=00, secondary=04, subordinate=05, sec-latency=0
        Capabilities: [50] Power Management version 3
        Capabilities: [58] Express Root Port (Slot-), MSI 00
        Capabilities: [80] Message Signalled Interrupts: Mask- 64bit- Queue=0/0 Enable-
        Capabilities: [b0] Subsystem: ATI Technologies Inc Unknown device 5950
        Capabilities: [b8] HyperTransport: MSI Mapping Enable+ Fixed+

00:13.0 USB Controller: ATI Technologies Inc IXP SB400 USB Host Controller (prog-if 10 [OHCI])
        Subsystem: Acer Incorporated [ALI] Unknown device 0080
        Flags: bus master, 66MHz, medium devsel, latency 64, IRQ 11
        Memory at c0000000 (32-bit, non-prefetchable) [size=4K]
        Capabilities: [d0] Message Signalled Interrupts: Mask- 64bit- Queue=0/0 Enable-

00:13.1 USB Controller: ATI Technologies Inc IXP SB400 USB Host Controller (prog-if 10 [OHCI])
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMMi
        Flags: bus master, 66MHz, medium devsel, latency 64, IRQ 11
        Memory at c0001000 (32-bit, non-prefetchable) [size=4K]
        Capabilities: [d0] Message Signalled Interrupts: Mask- 64bit- Queue=0/0 Enable-

00:13.2 USB Controller: ATI Technologies Inc IXP SB400 USB2 Host Controller (prog-if 20 [EHCI])
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMMi
        Flags: 66MHz, medium devsel, IRQ 11
        Memory at c0002000 (32-bit, non-prefetchable) [size=4K]
        Capabilities: [dc] Power Management version 2
        Capabilities: [d0] Message Signalled Interrupts: Mask- 64bit- Queue=0/0 Enable-

00:14.0 SMBus: ATI Technologies Inc IXP SB400 SMBus Controller (rev 11)
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMMi
        Flags: 66MHz, medium devsel
        I/O ports at 8400 [size=16]
        Memory at c0003000 (32-bit, non-prefetchable) [size=1K]
        Capabilities: [b0] HyperTransport: MSI Mapping Enable- Fixed+

00:14.1 IDE interface: ATI Technologies Inc Standard Dual Channel PCI IDE Controller (prog-if 8a [Master SecP PriP])
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMMi
        Flags: bus master, 66MHz, medium devsel, latency 0, IRQ 16
        I/O ports at 01f0 [size=8]
        I/O ports at 03f4 [size=1]
        I/O ports at 0170 [size=8]
        I/O ports at 0374 [size=1]
        I/O ports at 8410 [size=16]
        Capabilities: [70] Message Signalled Interrupts: Mask- 64bit- Queue=0/0 Enable-

00:14.3 ISA bridge: ATI Technologies Inc IXP SB400 PCI-ISA Bridge
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMi
        Flags: bus master, 66MHz, medium devsel, latency 0

00:14.4 PCI bridge: ATI Technologies Inc IXP SB400 PCI-PCI Bridge (prog-if 01 [Subtractive decode])
        Flags: bus master, 66MHz, medium devsel, latency 64
        Bus: primary=00, secondary=06, subordinate=06, sec-latency=64
        I/O behind bridge: 0000a000-0000afff
        Memory behind bridge: c0200000-c02fffff

00:14.5 Multimedia audio controller: ATI Technologies Inc IXP SB400 AC'97 Audio Controller (rev 02)
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMMi
        Flags: bus master, 66MHz, slow devsel, latency 64, IRQ 17
        Memory at c0003400 (32-bit, non-prefetchable) [size=256]
        Capabilities: [40] Message Signalled Interrupts: Mask- 64bit- Queue=0/0 Enable-

00:14.6 Modem: ATI Technologies Inc SB400 AC'97 Modem Controller (rev 02) (prog-if 00 [Generic])
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMMi
        Flags: bus master, 66MHz, slow devsel, latency 64, IRQ 11
        Memory at c0003800 (32-bit, non-prefetchable) [size=256]
        Capabilities: [40] Message Signalled Interrupts: Mask- 64bit- Queue=0/0 Enable-

00:18.0 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] HyperTransport Technology Configuration
        Flags: fast devsel
        Capabilities: [80] HyperTransport: Host or Secondary Interface

00:18.1 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Address Map
        Flags: fast devsel

00:18.2 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] DRAM Controller
        Flags: fast devsel

00:18.3 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Miscellaneous Control
        Flags: fast devsel

01:00.0 VGA compatible controller: ATI Technologies Inc Radeon Mobility X700 (PCIE) (prog-if 00 [VGA])
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMi
        Flags: bus master, fast devsel, latency 0, IRQ 18
        Memory at c8000000 (32-bit, prefetchable) [size=128M]
        I/O ports at 9000 [size=256]
        Memory at c0100000 (32-bit, non-prefetchable) [size=64K]
        [virtual] Expansion ROM at c0120000 [disabled] [size=128K]
        Capabilities: [50] Power Management version 2
        Capabilities: [58] Express Endpoint, MSI 00
        Capabilities: [80] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 Enable-

06:05.0 Network controller: Broadcom Corporation BCM4318 [AirForce One 54g] 802.11g Wireless LAN Controller (rev 02)
        Subsystem: AMBIT Microsystem Corp. Aspire 3022WLMi, 5024WLMi, 5020
        Flags: bus master, fast devsel, latency 64, IRQ 21
        Memory at c0204000 (32-bit, non-prefetchable) [size=8K]

06:06.0 CardBus bridge: Texas Instruments PCIxx21/x515 Cardbus Controller
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMi
        Flags: bus master, medium devsel, latency 64, IRQ 20
        Memory at c020a000 (32-bit, non-prefetchable) [size=4K]
        Bus: primary=06, secondary=07, subordinate=08, sec-latency=176
        Memory window 0: 60000000-63fff000 (prefetchable)
        Memory window 1: 64000000-67fff000 (prefetchable)
        I/O window 0: 0000a400-0000a4ff
        I/O window 1: 0000a800-0000a8ff
        16-bit legacy interface ports at 0001

06:06.2 FireWire (IEEE 1394): Texas Instruments OHCI Compliant IEEE 1394 Host Controller (prog-if 10 [OHCI])
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMi
        Flags: bus master, medium devsel, latency 64, IRQ 22
        Memory at c0208000 (32-bit, non-prefetchable) [size=2K]
        Memory at c0200000 (32-bit, non-prefetchable) [size=16K]
        Capabilities: [44] Power Management version 2

06:06.3 Mass storage controller: Texas Instruments PCIxx21 Integrated FlashMedia Controller
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMi
        Flags: bus master, medium devsel, latency 64, IRQ 11
        Memory at c0206000 (32-bit, non-prefetchable) [size=8K]
        Capabilities: [44] Power Management version 2

06:06.4 Generic system peripheral [0805]: Texas Instruments PCI6411/6421/6611/6621/7411/7421/7611/7621 Secure Digital Controller
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMi
        Flags: bus master, medium devsel, latency 64, IRQ 11
        Memory at c0209000 (32-bit, non-prefetchable) [size=256]
        Memory at c0208c00 (32-bit, non-prefetchable) [size=256]
        Memory at c0208800 (32-bit, non-prefetchable) [size=256]
        Capabilities: [80] Power Management version 2

06:07.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8169 Gigabit Ethernet (rev 10)
        Subsystem: Acer Incorporated [ALI] Aspire 5024WLMi
        Flags: bus master, 66MHz, medium devsel, latency 64, IRQ 11
        I/O ports at a000 [size=256]
        Memory at c0209400 (32-bit, non-prefetchable) [size=256]
        [virtual] Expansion ROM at c0220000 [disabled] [size=128K]
        Capabilities: [dc] Power Management version 2

dmesg:
Linux version 2.6.24-gentoo-r2 (root@fuf) (gcc version 4.2.2 (Gentoo 4.2.2 p1.0)) #2 PREEMPT Mon Feb 11 18:58:21 CET 2008
Command line: root=/dev/hda5 udev powernow-k8.vcore_list=1125,925  usbcore.autosuspend=1 hpet=force
BIOS-provided physical RAM map:
 BIOS-e820: 0000000000000000 - 000000000009b800 (usable)
 BIOS-e820: 000000000009b800 - 00000000000a0000 (reserved)
 BIOS-e820: 00000000000d0000 - 0000000000100000 (reserved)
 BIOS-e820: 0000000000100000 - 000000004fea0000 (usable)
 BIOS-e820: 000000004fea0000 - 000000004feae000 (ACPI data)
 BIOS-e820: 000000004feae000 - 000000004ff00000 (ACPI NVS)
 BIOS-e820: 000000004ff00000 - 0000000050000000 (reserved)
 BIOS-e820: 00000000fec00000 - 00000000fec10000 (reserved)
 BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
 BIOS-e820: 00000000fff80000 - 0000000100000000 (reserved)
Entering add_active_range(0, 0, 155) 0 entries of 256 used
Entering add_active_range(0, 256, 327328) 1 entries of 256 used
end_pfn_map = 1048576
DMI 2.3 present.
ACPI: RSDP 000F6DA0, 0014 (r0 PTLTD )
ACPI: RSDT 4FEA81E3, 0034 (r1 PTLTD    RSDT    6040000  LTP        0)
ACPI: FACP 4FEADE41, 0074 (r1 ATI    Piranha   6040000 ATI     F4240)
ACPI: DSDT 4FEA8217, 5C2A (r1    ATI    SB400  6040000 MSFT  100000E)
ACPI: FACS 4FEAEFC0, 0040
ACPI: SSDT 4FEADEB5, 00B5 (r1 PTLTD  POWERNOW  6040000  LTP        1)
ACPI: APIC 4FEADF6A, 005A (r1 PTLTD      APIC    6040000  LTP        0)
ACPI: MCFG 4FEADFC4, 003C (r1 PTLTD    MCFG    6040000  LTP        0)
ACPI: DMI detected: Acer
Entering add_active_range(0, 0, 155) 0 entries of 256 used
Entering add_active_range(0, 256, 327328) 1 entries of 256 used
Zone PFN ranges:
  DMA             0 ->     4096
  DMA32        4096 ->  1048576
  Normal    1048576 ->  1048576
Movable zone start PFN for each node
early_node_map[2] active PFN ranges
    0:        0 ->      155
    0:      256 ->   327328
On node 0 totalpages: 327227
  DMA zone: 56 pages used for memmap
  DMA zone: 1371 pages reserved
  DMA zone: 2568 pages, LIFO batch:0
  DMA32 zone: 4419 pages used for memmap
  DMA32 zone: 318813 pages, LIFO batch:31
  Normal zone: 0 pages used for memmap
  Movable zone: 0 pages used for memmap
ATI board detected. Disabling timer routing over 8254.
ACPI: PM-Timer IO Port: 0x8008
ACPI: Local APIC address 0xfee00000
ACPI: LAPIC (acpi_id[0x00] lapic_id[0x00] enabled)
Processor #0 (Bootup-CPU)
ACPI: LAPIC_NMI (acpi_id[0x00] high edge lint[0x1])
ACPI: IOAPIC (id[0x01] address[0xfec00000] gsi_base[0])
IOAPIC[0]: apic_id 1, address 0xfec00000, GSI 0-23
ACPI: INT_SRC_OVR (bus 0 bus_irq 0 global_irq 2 high edge)
ACPI: INT_SRC_OVR (bus 0 bus_irq 9 global_irq 21 low level)
ACPI: IRQ0 used by override.
ACPI: IRQ2 used by override.
Setting APIC routing to flat
Using ACPI (MADT) for SMP configuration information
Allocating PCI resources starting at 60000000 (gap: 50000000:aec00000)
Built 1 zonelists in Zone order, mobility grouping on.  Total pages: 321381
Kernel command line: root=/dev/hda5 udev powernow-k8.vcore_list=1125,925  usbcore.autosuspend=1 hpet=force
Initializing CPU#0
PID hash table entries: 4096 (order: 12, 32768 bytes)
TSC calibrated against PM_TIMER
time.c: Detected 1599.999 MHz processor.
Console: colour VGA+ 80x25
console [tty0] enabled
Dentry cache hash table entries: 262144 (order: 9, 2097152 bytes)
Inode-cache hash table entries: 131072 (order: 8, 1048576 bytes)
Checking aperture...
CPU 0: aperture @ 180000000 size 32 MB
Aperture too small (32 MB)
No AGP bridge found
Memory: 1282044k/1309312k available (3486k kernel code, 26556k reserved, 1281k data, 216k init)
Calibrating delay using timer specific routine.. 3202.05 BogoMIPS (lpj=1601028)
Mount-cache hash table entries: 256
CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
CPU: L2 Cache: 512K (64 bytes/line)
CPU: AMD Turion(tm) 64 Mobile Technology ML-28 stepping 02
ACPI: Core revision 20070126
ACPI: Table DSDT replaced by host OS
ACPI: DSDT 00000000, 5624 (r1    ATI    SB400  6040000 INTL 20061109)
ACPI: DSDT override uses original SSDTs unless "acpi_no_auto_ssdt"<3>..MP-BIOS bug: 8254 timer not connected to IO-APIC
Using local APIC timer interrupts.
APIC timer calibration result 12499988
Detected 12.499 MHz APIC timer.
net_namespace: 120 bytes
NET: Registered protocol family 16
ACPI: bus type pci registered
PCI: BIOS Bug: MCFG area at e0000000 is not E820-reserved
PCI: Not using MMCONFIG.
PCI: Using configuration type 1
ACPI: EC: Look up EC in DSDT
ACPI: Interpreter enabled
ACPI: (supports S0 S3 S5)
ACPI: Using IOAPIC for interrupt routing
ACPI: EC: non-query interrupt received, switching to interrupt mode
ACPI: EC: GPE = 0x3, I/O: command/status = 0x66, data = 0x62
ACPI: EC: driver started in interrupt mode
ACPI: PCI Root Bridge [PCI0] (0000:00)
PCI: Transparent bridge - 0000:00:14.4
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0._PRT]
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.PB6_._PRT]
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.PB7_._PRT]
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.EXP1._PRT]
ACPI: PCI Interrupt Routing Table [\_SB_.PCI0.P2P_._PRT]
ACPI: PCI Interrupt Link [LNKA] (IRQs 10 11) *0, disabled.
ACPI: PCI Interrupt Link [LNKB] (IRQs 10 11) *0, disabled.
ACPI: PCI Interrupt Link [LNKC] (IRQs 10 11) *0, disabled.
ACPI: PCI Interrupt Link [LNKD] (IRQs 10 11) *0, disabled.
ACPI: PCI Interrupt Link [LNKE] (IRQs 10 11) *0, disabled.
ACPI: PCI Interrupt Link [LNKF] (IRQs 10 11) *0, disabled.
ACPI: PCI Interrupt Link [LNKG] (IRQs 10 11) *0, disabled.
ACPI: PCI Interrupt Link [LNKH] (IRQs 10 11) *0, disabled.
Linux Plug and Play Support v0.97 (c) Adam Belay
pnp: PnP ACPI init
ACPI: bus type pnp registered
pnp: PnP ACPI: found 10 devices
ACPI: ACPI bus type pnp unregistered
SCSI subsystem initialized
usbcore: registered new interface driver usbfs
usbcore: registered new interface driver hub
usbcore: registered new device driver usb
PCI: Using ACPI for IRQ routing
PCI: If a device doesn't work, try "pci=routeirq".  If it helps, post a report
PCI: Cannot allocate resource region 7 of bridge 0000:00:06.0
PCI: Cannot allocate resource region 8 of bridge 0000:00:06.0
PCI: Cannot allocate resource region 7 of bridge 0000:00:07.0
PCI: Cannot allocate resource region 8 of bridge 0000:00:07.0
ACPI: RTC can wake from S4
Time: tsc clocksource has been installed.
system 00:01: iomem range 0xe0000000-0xefffffff has been reserved
system 00:01: iomem range 0xfec00000-0xfec00fff could not be reserved
system 00:01: iomem range 0xfee00000-0xfee00fff could not be reserved
system 00:08: ioport range 0x1080-0x1080 has been reserved
system 00:08: ioport range 0x1200-0x120f has been reserved
system 00:08: ioport range 0x500-0x51f has been reserved
system 00:08: ioport range 0x40b-0x40b has been reserved
system 00:08: ioport range 0x4d0-0x4d1 has been reserved
system 00:08: ioport range 0x4d6-0x4d6 has been reserved
system 00:08: ioport range 0xc00-0xc01 has been reserved
system 00:08: ioport range 0xc14-0xc14 has been reserved
system 00:08: ioport range 0xc50-0xc52 has been reserved
system 00:08: ioport range 0xc6c-0xc6c has been reserved
system 00:08: ioport range 0xc6f-0xc6f has been reserved
system 00:08: ioport range 0xcd4-0xcd5 has been reserved
system 00:08: ioport range 0xcd6-0xcd7 has been reserved
system 00:08: ioport range 0xcd8-0xcdf has been reserved
system 00:08: ioport range 0x8000-0x805f has been reserved
system 00:08: ioport range 0xf40-0xf47 has been reserved
system 00:08: ioport range 0x87f-0x87f has been reserved
system 00:09: iomem range 0xe0000-0xfffff could not be reserved
system 00:09: iomem range 0xfff80000-0xffffffff could not be reserved
system 00:09: iomem range 0x0-0xfff could not be reserved
PCI: Bridge: 0000:00:02.0
  IO window: 9000-9fff
  MEM window: c0100000-c01fffff
  PREFETCH window: c8000000-cfffffff
PCI: Bridge: 0000:00:06.0
  IO window: disabled.
  MEM window: disabled.
  PREFETCH window: disabled.
PCI: Bridge: 0000:00:07.0
  IO window: disabled.
  MEM window: disabled.
  PREFETCH window: disabled.
PCI: Bus 7, cardbus bridge: 0000:06:06.0
  IO window: 0000a400-0000a4ff
  IO window: 0000a800-0000a8ff
  PREFETCH window: 60000000-63ffffff
  MEM window: 64000000-67ffffff
PCI: Bridge: 0000:00:14.4
  IO window: a000-afff
  MEM window: c0200000-c02fffff
  PREFETCH window: disabled.
PCI: Setting latency timer of device 0000:00:02.0 to 64
PCI: Setting latency timer of device 0000:00:06.0 to 64
PCI: Setting latency timer of device 0000:00:07.0 to 64
ACPI: PCI Interrupt 0000:06:06.0[A] -> GSI 20 (level, low) -> IRQ 20
NET: Registered protocol family 2
IP route cache hash table entries: 65536 (order: 7, 524288 bytes)
TCP established hash table entries: 262144 (order: 10, 4194304 bytes)
TCP bind hash table entries: 65536 (order: 7, 524288 bytes)
TCP: Hash tables configured (established 262144 bind 65536)
TCP reno registered
SGI XFS with large block/inode numbers, no debug enabled
io scheduler noop registered
io scheduler anticipatory registered
io scheduler deadline registered
io scheduler cfq registered (default)
PCI: MSI quirk detected. MSI deactivated.
Boot video device is 0000:01:00.0
PCI: Setting latency timer of device 0000:00:02.0 to 64
assign_interrupt_mode Found MSI capability
Allocate Port Service[0000:00:02.0:pcie00]
PCI: Setting latency timer of device 0000:00:06.0 to 64
assign_interrupt_mode Found MSI capability
Allocate Port Service[0000:00:06.0:pcie00]
PCI: Setting latency timer of device 0000:00:07.0 to 64
assign_interrupt_mode Found MSI capability
Allocate Port Service[0000:00:07.0:pcie00]
Generic RTC Driver v1.07
Linux agpgart interface v0.102
ACPI: AC Adapter [ADP1] (on-line)
ACPI: Battery Slot [BAT0] (battery absent)
input: Power Button (FF) as /devices/virtual/input/input0
ACPI: Power Button (FF) [PWRF]
input: Lid Switch as /devices/virtual/input/input1
ACPI: Lid Switch [LID0]
input: Sleep Button (CM) as /devices/virtual/input/input2
ACPI: Sleep Button (CM) [SLPB]
ACPI: CPU0 (power states: C1[C1] C2[C2] C3[C3])
ACPI: Thermal Zone [TZS0] (57 C)
ACPI: Thermal Zone [TZS1] (53 C)
ACPI: Thermal Zone [TZSV] (65 C)
loop: module loaded
Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
ATIIXP: IDE controller (0x1002:0x4376 rev 0x00) at  PCI slot 0000:00:14.1
ACPI: PCI Interrupt 0000:00:14.1[A] -> GSI 16 (level, low) -> IRQ 16
ATIIXP: not 100% native mode: will probe irqs later
    ide0: BM-DMA at 0x8410-0x8417, BIOS settings: hda:DMA, hdb:pio
    ide1: BM-DMA at 0x8418-0x841f, BIOS settings: hdc:DMA, hdd:pio
Probing IDE interface ide0...
Switched to high resolution mode on CPU 0
hda: ST9160821A, ATA DISK drive
hda: host max PIO4 wanted PIO255(auto-tune) selected PIO4
hda: UDMA/100 mode selected
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
Probing IDE interface ide1...
hdc: MATSHITAUJ-845D, ATAPI CD/DVD-ROM drive
hdc: host max PIO4 wanted PIO255(auto-tune) selected PIO4
hdc: UDMA/33 mode selected
ide1 at 0x170-0x177,0x376 on irq 15
hda: max request size: 512KiB
hda: 312581808 sectors (160041 MB) w/8192KiB Cache, CHS=19457/255/63
hda: cache flushes supported
 hda: hda1 hda2 hda3 < hda5 hda6 >
hdc: ATAPI 24X DVD-ROM DVD-R CD-R/RW drive, 2048kB Cache
Uniform CD-ROM driver Revision: 3.20
Driver 'sd' needs updating - please use bus_type methods
ACPI: PCI Interrupt 0000:06:06.2[C] -> GSI 22 (level, low) -> IRQ 22
firewire_ohci: Added fw-ohci device 0000:06:06.2, OHCI version 1.10
PNP: PS/2 Controller [PNP0303:PS2K,PNP0f13:PS2M] at 0x60,0x64 irq 1,12
i8042.c: Detected active multiplexing controller, rev 1.1.
serio: i8042 KBD port at 0x60,0x64 irq 1
serio: i8042 AUX0 port at 0x60,0x64 irq 12
serio: i8042 AUX1 port at 0x60,0x64 irq 12
serio: i8042 AUX2 port at 0x60,0x64 irq 12
serio: i8042 AUX3 port at 0x60,0x64 irq 12
mice: PS/2 mouse device common for all mice
input: AT Translated Set 2 keyboard as /devices/platform/i8042/serio0/input/input3
rtc_cmos 00:04: rtc core: registered rtc_cmos as rtc0
rtc0: alarms up to one month
i2c /dev entries driver
device-mapper: ioctl: 4.12.0-ioctl (2007-10-02) initialised: dm-devel@redhat.com
cpuidle: using governor ladder
firewire_core: created new fw device fw0 (0 config rom retries, S400)
Synaptics Touchpad, model: 1, fw: 6.2, id: 0x12a0b1, caps: 0xa04713/0x204000
input: SynPS/2 Synaptics TouchPad as /devices/platform/i8042/serio4/input/input4
cpuidle: using governor menu
Advanced Linux Sound Architecture Driver Version 1.0.15 (Tue Nov 20 19:16:42 2007 UTC).
ALSA device list:
  No soundcards found.
TCP cubic registered
NET: Registered protocol family 1
NET: Registered protocol family 17
RPC: Registered udp transport module.
RPC: Registered tcp transport module.
powernow-k8: Found 1 AMD Turion(tm) 64 Mobile Technology ML-28 processors (1 cpu cores) (version 2.20.00)
powernow-k8: Requested Voltage for id0 is 1125 mV default voltage is 1450
powernow-k8: Performing required adjustment of 325 mV  or 13 25mV steps
powernow-k8: Requested Voltage for id1 is 925 mV default voltage is 1000
powernow-k8: Performing required adjustment of 75 mV  or 3 25mV steps
powernow-k8:    0 : fid 0x8 (1600 MHz), vid 0x11
powernow-k8:    1 : fid 0x0 (800 MHz), vid 0x19
rtc_cmos 00:04: setting system clock to 2008-02-14 08:47:44 UTC (1202978864)
Marking TSC unstable due to TSC halts in idle
Time: acpi_pm clocksource has been installed.
XFS mounting filesystem hda5
Starting XFS recovery on filesystem: hda5 (logdev: internal)
Ending XFS recovery on filesystem: hda5 (logdev: internal)
VFS: Mounted root (xfs filesystem) readonly.
Freeing unused kernel memory: 216k freed
Clocksource tsc unstable (delta = -91005859 ns)
ACPI: WMI-Acer: Mapper loaded
acer_acpi: Acer Laptop ACPI Extras version 0.10.0
acer_acpi: Detected Acer AMW0 interface
Registered led device: acer_acpi:mail
fuse init (API version 7.9)
ndiswrapper version 1.52 loaded (smp=no, preempt=yes)
ndiswrapper (link_pe_images:576): fixing KI_USER_SHARED_DATA address in the driver
ndiswrapper: driver bcmwl5 (Broadcom,02/11/2005, 3.100.64.0) loaded
ACPI: PCI Interrupt 0000:06:05.0[A] -> GSI 21 (level, low) -> IRQ 21
ndiswrapper: using IRQ 21
wlan0: ethernet device 00:0e:9b:bf:b0:b8 using NDIS driver: bcmwl5, version: 0x364400a, NDIS version: 0x501, vendor: 'NDIS Network Adapter', 14E4:4318.5.conf
wlan0: encryption modes supported: WEP; TKIP with WPA, WPA2, WPA2PSK; AES/CCMP with WPA, WPA2, WPA2PSK
usbcore: registered new interface driver ndiswrapper
Adding 2449872k swap on /dev/mapper/crypt-swap.  Priority:-1 extents:1 across:2449872k
ACPI: PCI Interrupt 0000:00:14.5[B] -> GSI 17 (level, low) -> IRQ 17
ndiswrapper: device wlan0 removed
ACPI: PCI interrupt for device 0000:06:05.0 disabled
usbcore: deregistering interface driver ndiswrapper
ndiswrapper version 1.52 loaded (smp=no, preempt=yes)
ndiswrapper (link_pe_images:576): fixing KI_USER_SHARED_DATA address in the driver
ndiswrapper: driver bcmwl5 (Broadcom,02/11/2005, 3.100.64.0) loaded
ACPI: PCI Interrupt 0000:06:05.0[A] -> GSI 21 (level, low) -> IRQ 21
ndiswrapper: using IRQ 21
wlan0: ethernet device 00:0e:9b:bf:b0:b8 using NDIS driver: bcmwl5, version: 0x364400a, NDIS version: 0x501, vendor: 'NDIS Network Adapter', 14E4:4318.5.conf
wlan0: encryption modes supported: WEP; TKIP with WPA, WPA2, WPA2PSK; AES/CCMP with WPA, WPA2, WPA2PSK
usbcore: registered new interface driver ndiswrapper
[drm] Initialized drm 1.1.0 20060810
ACPI: PCI Interrupt 0000:01:00.0[A] -> GSI 18 (level, low) -> IRQ 18
PCI: Setting latency timer of device 0000:01:00.0 to 64
[drm] Initialized radeon 1.28.0 20060524 on minor 0
[drm] Setting GART location based on new memory map
[drm] Loading R300 Microcode
[drm] writeback test succeeded in 1 usecs
XFS mounting filesystem hda6
Starting XFS recovery on filesystem: hda6 (logdev: internal)
Ending XFS recovery on filesystem: hda6 (logdev: internal)
APIC error on CPU0: 00(40)
hda: irq timeout: status=0xd0 { Busy }
ide: failed opcode was: 0xea
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: DMA disabled
hda: drive not ready for command
ide0: reset: master: error (0x00?)
hda: status error: status=0x00 { }
ide: failed opcode was: unknown
hda: drive not ready for command
hda: status error: status=0x00 { }
ide: failed opcode was: unknown
hda: drive not ready for command
hda: status error: status=0x10 { SeekComplete }
ide: failed opcode was: unknown
hda: drive not ready for command
hda: status error: status=0x10 { SeekComplete }
ide: failed opcode was: unknown
hda: drive not ready for command
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: irq timeout: status=0xd0 { Busy }
ide: failed opcode was: 0xea
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: drive not ready for command
ide0: reset: success
APIC error on CPU0: 40(40)
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: irq timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
ide0: reset timed-out, status=0x80
hda: status timeout: status=0x80 { Busy }
ide: failed opcode was: unknown
hda: drive not ready for command
ide0: reset: success
hda: irq timeout: status=0xd0 { Busy }
ide: failed opcode was: 0xea
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: drive not ready for command
ide0: reset timed-out, status=0x80
hda: status timeout: status=0x80 { Busy }
ide: failed opcode was: unknown
hda: drive not ready for command
ide0: reset: success
hda: irq timeout: status=0xd0 { Busy }
ide: failed opcode was: 0xea
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: drive not ready for command
ide0: reset timed-out, status=0x80
hda: status timeout: status=0x80 { Busy }
ide: failed opcode was: unknown
hda: drive not ready for command
ide0: reset: success
hda: irq timeout: status=0xd0 { Busy }
ide: failed opcode was: 0xea
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
psmouse.c: TouchPad at isa0060/serio4/input0 lost synchronization, throwing 2 bytes away.
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
hda: status timeout: status=0xd0 { Busy }
ide: failed opcode was: unknown
hda: no DRQ after issuing MULTWRITE_EXT
ide0: reset: success
Comment 1 Bartlomiej Zolnierkiewicz 2008-02-16 10:21:22 UTC
Please try using git-bisect to narrow down the problem to the specific commit.
Install git package, get git tree from kernel.org and do:

git bisect start
git bisect good 2.6.23
git bisect bad 2.6.24

It will select the kernel to test - compile and boot it to see if the problem is still there.  If so do "git bisect bad" which will give you new kernel to test.  If the kernel works fine do "git bisect good" instead.  After few iterations you should find the exact commit which introduced the bug.

Thanks.
Comment 2 Honza Fikar 2008-03-09 12:38:19 UTC
I have tried the bisection and I got something, which does not seem right to me, but maybe it is.

The problem is that the bug does not show up immediately, but after a couple of hours. So I tried to wait for the bug at least 6 hours before I did "git bisect good", and due to this the whole bisection was long, but maybe it was not enough... 

I will try once again and I will wait longer. Anyway the result is here:

2421ba5b57ddbc3a972b9d6fb884817c39d2fff7 is first bad commit
commit 2421ba5b57ddbc3a972b9d6fb884817c39d2fff7
Author: Kyle McMartin <kyle@mako.i.cabal.ca>
Date:   Wed Nov 28 02:17:53 2007 -0500

    [PARISC] timer interrupt should not be IRQ_DISABLED

    The timer interrupt had accidentally been marked IRQ_DISABLED since
    IRQ_PER_CPU had been OR-ed in, instead of set. This had been working
    by accident for quite a while.

    Commit c642b8391cf8efc3622cc97329a0f46e7cbb70b8 changed the behaviour of
    IRQ_PER_CPU interrupts, which previously weren't checked for IRQ_DISABLED.

    Signed-off-by: Kyle McMartin <kyle@mcmartin.ca>

:040000 040000 e77168c15d33749d5072ee1f6db59d05c128c5c6 426eb6ed185908219e7469fcfedf53d405b720d6 M      arch
Comment 3 Bartlomiej Zolnierkiewicz 2008-03-09 12:50:54 UTC
Unfortunately this commit is not the one we are looking for as it only affects PARISC architecture :/

PS I assume that 2.6.25-rc4 still has this problem?
Comment 4 Honza Fikar 2008-03-09 12:55:35 UTC
I am afraid I missed the bug at least once, so the result is wrong :(
I'll check 2.6.25-rc4 first and then I'll try to redo the bisection.
Comment 5 Daniel Drake 2008-03-11 10:16:51 UTC
original downstream report: http://bugs.gentoo.org/show_bug.cgi?id=209786
Comment 6 Honza Fikar 2008-04-11 02:40:41 UTC
The latest results are not very good...
Bisecting and waiting for at least 24 hours leads to a different wrong result:

f435a91e66e7776f0c73fca5af3cb87c61130ed6 is first bad commit
commit f435a91e66e7776f0c73fca5af3cb87c61130ed6
Author: Ralf Baechle <ralf@linux-mips.org>
Date:   Thu Dec 6 17:15:57 2007 +0000

    [MIPS] BCM1480: Fix interrupt routing, take 2.

    Signed-off-by: Ralf Baechle <ralf@linux-mips.org>

:040000 040000 73dbf425d802ab4d4a1c6fd4776df2b9d0086043 
0fa229550fa638511ff9f9edde20f0329d650174 M      arch

Then I tried kernel 2.6.25-rc6 and unfortunately I still see the error, although it seems to be triggered even less often. One time I saw a single DMA timeout in 4 days without disabling DMA. Second time after three days and last time I saw the real problem, so a lot of timeouts, resets and disabling DMA on hda during two days.


Is there a possibility to try only atiixp related patches instead of the bisecting? If I need to wait let's say 5 days, bisecting would take in the worst case two months :(
Comment 7 Bartlomiej Zolnierkiewicz 2008-04-11 03:34:45 UTC
You can do this by appending " drivers/ide/pci/atiixp.c" at the end of git-bisect command but I worry that it may be fruitless since there was very little atiixp changes.

Hmmm, it is also possible that these timeouts are caused by IRQ routing problems and not by IDE changes:

> APIC error on CPU0: 00(40)

Does this error always show up just before IDE timeout?
Comment 8 Honza Fikar 2008-04-11 04:32:47 UTC
I'll try...

I'm not sure about APIC error, as some kernels before (till 22 or 23 if I remember well) the log was full of those APIC errors. They were produced at a speed of two or three per hour. And I thought they are harmless. Anyway at that time there were no problems with atiixp.

Then with the 22 or 23 upgrade they do no come so often. Maybe once per day or so and I still ignored them. In fact I'm not sure it was corrected or only silenced. There were still no atiixp problems. The atiixp timeouts started in 24.

I think I can check your hypothesis, maybe you are right. I remember that at least in the 2.6.25-rc6 case with the single DMA timeout the APIC error was in the log just before. However I'm not sure how long before. I think I can switch on the timestamps.

And if it really is an IRQ routing problem, shouldn't I try to disable it by noapic kernel parameter or something like that?
Comment 9 Daniel Drake 2008-12-04 06:03:01 UTC
Any updates here? Is this still a problem on the latest development release, currently v2.6.28-rc7?
Comment 10 Honza Fikar 2008-12-04 06:20:03 UTC
Sorry for the silence, the bug is meanwhile gone! It disapeared during 2.6.26 I guess. Now I'm at 2.6.27 and I haven't seen the bug since long time.

So it is solved, but I don't know by what exactly.