Bug 2864 - Abnormal IOwait on 2.6 kernels causing machine to freeze
Summary: Abnormal IOwait on 2.6 kernels causing machine to freeze
Status: REJECTED UNREPRODUCIBLE
Alias: None
Product: IO/Storage
Classification: Unclassified
Component: Block Layer (show other bugs)
Hardware: i386 Linux
: P2 blocking
Assignee: Jens Axboe
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2004-06-11 00:07 UTC by Guy Van Sanden
Modified: 2009-12-12 17:15 UTC (History)
7 users (show)

See Also:
Kernel Version: 2.6.6
Subsystem:
Regression: Yes
Bisected commit-id:


Attachments

Description Guy Van Sanden 2004-06-11 00:07:19 UTC
Distribution: Gentoo Linux 1.4
Hardware Environment: 
1. Abit KD7A (Via KT400) with Athlon XP-2500 (IBM disk)
2. ECS K7S5A (Sis735) with Athlon XP-1700 (Maxtor disk)
Software Environment:
Running udev
Problem Description:
when running an io-operation such as a copy or bonnie, 2.6 kernels produce a
rising level of iowait, freezing the machine when iowait reaches 99%.
See iostat dumps below.
Any 2.4 kernel on the same machine works fine (0% iowait) under the same load

Steps to reproduce:
1. Reproduced on two of my machines
1a. Rerproducable on 2.6.5, 2.6.6, Andrew Morton patchset , ...
1b. A friend running the same distro and 2.6.6 vanilla on another chipset tested
this too, his machine did *not* generate so much iowait (3%, still higher then 2.4).
2. Start an IO process such as bonnie while running iostat or top
3. the level of iowait rises, reaching 99%, then the machine stops responding to
anything.

-- IOSTAT (iostat -c 1) --
avg-cpu:  %user   %nice    %sys %iowait   %idle
          26.73    0.00   10.89   62.38    0.00

avg-cpu:  %user   %nice    %sys %iowait   %idle
           0.00    0.00   13.00   87.00    0.00

avg-cpu:  %user   %nice    %sys %iowait   %idle
           1.01    0.00   16.16   82.83    0.00

avg-cpu:  %user   %nice    %sys %iowait   %idle
           0.00    0.00   11.88   88.12    0.00

avg-cpu:  %user   %nice    %sys %iowait   %idle
           0.00    0.00    1.01   98.99    0.00

avg-cpu:  %user   %nice    %sys %iowait   %idle
           0.99    0.00    2.97   96.04    0.00

avg-cpu:  %user   %nice    %sys %iowait   %idle
           0.00    0.00   37.00   63.00    0.00
Comment 1 Bartlomiej Zolnierkiewicz 2004-06-11 06:16:31 UTC
dmesg output please
Comment 2 Guy Van Sanden 2004-06-14 01:32:18 UTC
Linux version 2.6.6 (root@nocturnus) (gcc version 3.3.2 20031218 (Gentoo Linux
3.3.2-r5, propolice-3.3-7)) #2 Thu Jun 10 16:11:57 CEST 2004
BIOS-provided physical RAM map:
 BIOS-e820: 0000000000000000 - 00000000000a0000 (usable)
 BIOS-e820: 00000000000f0000 - 0000000000100000 (reserved)
 BIOS-e820: 0000000000100000 - 000000000fef0000 (usable)
 BIOS-e820: 000000000fef0000 - 000000000fef3000 (ACPI NVS)
 BIOS-e820: 000000000fef3000 - 000000000ff00000 (ACPI data)
 BIOS-e820: 00000000fec00000 - 00000000fec01000 (reserved)
 BIOS-e820: 00000000fee00000 - 00000000fee01000 (reserved)
 BIOS-e820: 00000000ffff0000 - 0000000100000000 (reserved)
254MB LOWMEM available.
found SMP MP-table at 000f5ec0
On node 0 totalpages: 65264
  DMA zone: 4096 pages, LIFO batch:1
  Normal zone: 61168 pages, LIFO batch:14
  HighMem zone: 0 pages, LIFO batch:1
DMI 2.2 present.
Intel MultiProcessor Specification v1.4
    Virtual Wire compatibility mode.
OEM ID: OEM00000 Product ID: PROD00000000 APIC at: 0xFEE00000
Processor #0 6:10 APIC version 17
I/O APIC #2 Version 17 at 0xFEC00000.
Enabling APIC mode:  Flat.  Using 1 I/O APICs
Processors: 1
Built 1 zonelists
Kernel command line: BOOT_IMAGE=dv-sources ro root=302 nodevfs acpi=off
Initializing CPU#0
PID hash table entries: 1024 (order 10: 8192 bytes)
Detected 1852.392 MHz processor.
Using tsc for high-res timesource
Console: colour VGA+ 80x25
Memory: 255220k/261056k available (1925k kernel code, 5108k reserved, 768k data,
152k init, 0k highmem)
Checking if this processor honours the WP bit even in supervisor mode... Ok.
Calibrating delay loop... 3645.44 BogoMIPS
Dentry cache hash table entries: 32768 (order: 5, 131072 bytes)
Inode-cache hash table entries: 16384 (order: 4, 65536 bytes)
Mount-cache hash table entries: 512 (order: 0, 4096 bytes)
CPU:     After generic identify, caps: 0383fbff c1c3fbff 00000000 00000000
CPU:     After vendor identify, caps: 0383fbff c1c3fbff 00000000 00000000
CPU: L1 I Cache: 64K (64 bytes/line), D cache 64K (64 bytes/line)
CPU: L2 Cache: 512K (64 bytes/line)
CPU:     After all inits, caps: 0383fbff c1c3fbff 00000000 00000020
Intel machine check architecture supported.
Intel machine check reporting enabled on CPU#0.
CPU: AMD Athlon(tm) XP 2500+ stepping 00
Enabling fast FPU save and restore... done.
Enabling unmasked SIMD FPU exception support... done.
Checking 'hlt' instruction... OK.
POSIX conformance testing by UNIFIX
enabled ExtINT on CPU#0
ESR value before enabling vector: 00000000
ESR value after enabling vector: 00000000
Using local APIC timer interrupts.
calibrating APIC timer ...
..... CPU clock speed is 1851.0702 MHz.
..... host bus clock speed is 336.0673 MHz.
NET: Registered protocol family 16
PCI: PCI BIOS revision 2.10 entry at 0xfaf40, last bus=1
PCI: Using configuration type 1
mtrr: v2.0 (20020519)
PCI: Probing PCI hardware
PCI: Probing PCI hardware (bus 00)
PCI: Using IRQ router VIA [1106/3177] at 0000:00:11.0
spurious 8259A interrupt: IRQ7.
Machine check exception polling timer started.
apm: BIOS not found.
Installing knfsd (copyright (C) 1996 okir@monad.swb.de).
udf: registering filesystem
Initializing Cryptographic API
Real Time Clock Driver v1.12
Non-volatile memory driver v1.2
Serial: 8250/16550 driver $Revision: 1.90 $ 8 ports, IRQ sharing disabled
ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
Using anticipatory io scheduler
Floppy drive(s): fd0 is 1.44M
FDC 0 is a post-1991 82077
RAMDISK driver initialized: 16 RAM disks of 8192K size 1024 blocksize
loop: loaded (max 8 devices)
Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
VP_IDE: IDE controller at PCI slot 0000:00:11.1
VP_IDE: chipset revision 6
VP_IDE: not 100% native mode: will probe irqs later
VP_IDE: VIA vt8235 (rev 00) IDE UDMA133 controller on pci0000:00:11.1
    ide0: BM-DMA at 0xe400-0xe407, BIOS settings: hda:DMA, hdb:pio
    ide1: BM-DMA at 0xe408-0xe40f, BIOS settings: hdc:DMA, hdd:pio
hda: IC35L060AVV207-0, ATA DISK drive
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
hdc: CD-ROM Drive/F5E, ATAPI CD/DVD-ROM drive
ide1 at 0x170-0x177,0x376 on irq 15
hda: max request size: 1024KiB
hda: 120103200 sectors (61492 MB) w/1821KiB Cache, CHS=16383/255/63, UDMA(100)
 hda: hda1 hda2 hda3 hda4 < hda5 >
hdc: ATAPI 52X CD-ROM drive, 128kB Cache, UDMA(33)
Uniform CD-ROM driver Revision: 3.20
mice: PS/2 mouse device common for all mice
serio: i8042 AUX port at 0x60,0x64 irq 12
serio: i8042 KBD port at 0x60,0x64 irq 1
input: AT Translated Set 2 keyboard on isa0060/serio0
NET: Registered protocol family 2
IP: routing cache hash table of 2048 buckets, 16Kbytes
TCP: Hash tables configured (established 16384 bind 32768)
NET: Registered protocol family 1
NET: Registered protocol family 17
found reiserfs format "3.6" with standard journal
reiserfs: using ordered data mode
Reiserfs journal params: device hda2, size 8192, journal first block 18, max
trans len 1024, max batch 900, max commit age 30, max trans age 30
reiserfs: checking transaction log (hda2) for (hda2)
reiserfs: replayed 52 transactions in 4 seconds
Using r5 hash to sort names
VFS: Mounted root (reiserfs filesystem) readonly.
Freeing unused kernel memory: 152k freed
Adding 747012k swap on /dev/hda3.  Priority:-1 extents:1
found reiserfs format "3.6" with standard journal
reiserfs: using ordered data mode
Reiserfs journal params: device hda5, size 8192, journal first block 18, max
trans len 1024, max batch 900, max commit age 30, max trans age 30
reiserfs: checking transaction log (hda5) for (hda5)
Using r5 hash to sort names
usbcore: registered new driver usbfs
usbcore: registered new driver hub
8139cp: 10/100 PCI Ethernet driver v1.2 (Mar 22, 2004)
8139cp: pci dev 0000:00:0a.0 (id 10ec:8139 rev 10) is not an 8139C+ compatible chip
8139cp: Try the "8139too" driver instead.
USB Universal Host Controller Interface driver v2.2
PCI: Found IRQ 11 for device 0000:00:10.0
PCI: Sharing IRQ 11 with 0000:00:08.0
PCI: Sharing IRQ 11 with 0000:00:12.0
uhci_hcd 0000:00:10.0: VIA Technologies, Inc. USB
uhci_hcd 0000:00:10.0: irq 11, io base 0000d800
uhci_hcd 0000:00:10.0: new USB bus registered, assigned bus number 1
uhci_hcd 0000:00:10.0: detected 2 ports
uhci_hcd 0000:00:10.0: root hub device address 1
usb usb1: new device strings: Mfr=3, Product=2, SerialNumber=1
usb usb1: default language 0x0409
usb usb1: Product: VIA Technologies, Inc. USB
usb usb1: Manufacturer: Linux 2.6.6 uhci_hcd
usb usb1: SerialNumber: 0000:00:10.0
usb usb1: hotplug
usb usb1: adding 1-0:1.0 (config #1, interface 0)
usb 1-0:1.0: hotplug
hub 1-0:1.0: usb_probe_interface
hub 1-0:1.0: usb_probe_interface - got id
hub 1-0:1.0: USB hub found
hub 1-0:1.0: 2 ports detected
hub 1-0:1.0: standalone hub
hub 1-0:1.0: no power switching (usb 1.0)
hub 1-0:1.0: individual port over-current protection
hub 1-0:1.0: power on to power good time: 2ms
hub 1-0:1.0: hub controller current requirement: 0mA
hub 1-0:1.0: local power source is good
hub 1-0:1.0: enabling power on all ports
PCI: Found IRQ 5 for device 0000:00:10.1
uhci_hcd 0000:00:10.1: VIA Technologies, Inc. USB (#2)
uhci_hcd 0000:00:10.1: irq 5, io base 0000dc00
uhci_hcd 0000:00:10.1: new USB bus registered, assigned bus number 2
uhci_hcd 0000:00:10.1: detected 2 ports
uhci_hcd 0000:00:10.1: root hub device address 1
usb usb2: new device strings: Mfr=3, Product=2, SerialNumber=1
usb usb2: default language 0x0409
usb usb2: Product: VIA Technologies, Inc. USB (#2)
usb usb2: Manufacturer: Linux 2.6.6 uhci_hcd
usb usb2: SerialNumber: 0000:00:10.1
usb usb2: hotplug
usb usb2: adding 2-0:1.0 (config #1, interface 0)
usb 2-0:1.0: hotplug
hub 2-0:1.0: usb_probe_interface
hub 2-0:1.0: usb_probe_interface - got id
hub 2-0:1.0: USB hub found
hub 2-0:1.0: 2 ports detected
hub 2-0:1.0: standalone hub
hub 2-0:1.0: no power switching (usb 1.0)
hub 2-0:1.0: individual port over-current protection
hub 2-0:1.0: power on to power good time: 2ms
hub 2-0:1.0: hub controller current requirement: 0mA
hub 2-0:1.0: local power source is good
hub 2-0:1.0: enabling power on all ports
PCI: Found IRQ 10 for device 0000:00:10.2
PCI: Sharing IRQ 10 with 0000:00:0a.0
PCI: Sharing IRQ 10 with 0000:00:11.5
uhci_hcd 0000:00:10.2: VIA Technologies, Inc. USB (#3)
uhci_hcd 0000:00:10.0: port 1 portsc 008a
hub 1-0:1.0: port 1, status 100, change 3, 12 Mb/s
uhci_hcd 0000:00:10.0: port 2 portsc 008a
hub 1-0:1.0: port 2, status 100, change 3, 12 Mb/s
uhci_hcd 0000:00:10.2: irq 10, io base 0000e000
uhci_hcd 0000:00:10.1: port 1 portsc 008a
hub 2-0:1.0: port 1, status 100, change 3, 12 Mb/s
uhci_hcd 0000:00:10.1: port 2 portsc 008a
hub 2-0:1.0: port 2, status 100, change 3, 12 Mb/s
uhci_hcd 0000:00:10.2: new USB bus registered, assigned bus number 3
uhci_hcd 0000:00:10.2: detected 2 ports
uhci_hcd 0000:00:10.2: root hub device address 1
usb usb3: new device strings: Mfr=3, Product=2, SerialNumber=1
usb usb3: default language 0x0409
usb usb3: Product: VIA Technologies, Inc. USB (#3)
usb usb3: Manufacturer: Linux 2.6.6 uhci_hcd
usb usb3: SerialNumber: 0000:00:10.2
usb usb3: hotplug
uhci_hcd 0000:00:10.0: port 1 portsc 0088
hub 1-0:1.0: port 1 enable change, status 100
uhci_hcd 0000:00:10.0: port 2 portsc 0088
hub 1-0:1.0: port 2 enable change, status 100
usb usb3: adding 3-0:1.0 (config #1, interface 0)
usb 3-0:1.0: hotplug
hub 3-0:1.0: usb_probe_interface
hub 3-0:1.0: usb_probe_interface - got id
hub 3-0:1.0: USB hub found
hub 3-0:1.0: 2 ports detected
hub 3-0:1.0: standalone hub
hub 3-0:1.0: no power switching (usb 1.0)
hub 3-0:1.0: individual port over-current protection
hub 3-0:1.0: power on to power good time: 2ms
hub 3-0:1.0: hub controller current requirement: 0mA
hub 3-0:1.0: local power source is good
hub 3-0:1.0: enabling power on all ports
uhci_hcd 0000:00:10.1: port 1 portsc 0088
hub 2-0:1.0: port 1 enable change, status 100
uhci_hcd 0000:00:10.1: port 2 portsc 0088
hub 2-0:1.0: port 2 enable change, status 100
uhci_hcd 0000:00:10.2: port 1 portsc 018a
hub 3-0:1.0: port 1, status 300, change 3, 1.5 Mb/s
uhci_hcd 0000:00:10.2: port 2 portsc 008a
hub 3-0:1.0: port 2, status 100, change 3, 12 Mb/s
ehci_hcd: block sizes: qh 128 qtd 96 itd 192 sitd 96
PCI: Found IRQ 12 for device 0000:00:10.3
ehci_hcd 0000:00:10.3: VIA Technologies, Inc. USB 2.0
ehci_hcd 0000:00:10.3: reset hcs_params 0x3206 dbg=0 cc=3 pcc=2 ordered !ppc ports=6
ehci_hcd 0000:00:10.3: reset hcc_params 6872 thresh 7 uframes 256/512/1024
uhci_hcd 0000:00:10.2: port 1 portsc 0188
hub 3-0:1.0: port 1 enable change, status 300
uhci_hcd 0000:00:10.2: port 2 portsc 0088
hub 3-0:1.0: port 2 enable change, status 100
ehci_hcd 0000:00:10.3: capability 0001 at 68
ehci_hcd 0000:00:10.3: irq 12, pci mem d09ce000
ehci_hcd 0000:00:10.3: new USB bus registered, assigned bus number 4
ehci_hcd 0000:00:10.3: reset command 080002 (park)=0 ithresh=8 period=1024 Reset
HALT
ehci_hcd 0000:00:10.3: init command 010009 (park)=0 ithresh=1 period=256 RUN
ehci_hcd 0000:00:10.3: USB 2.0 enabled, EHCI 1.00, driver 2003-Dec-29
ehci_hcd 0000:00:10.3: root hub device address 1
usb usb4: new device strings: Mfr=3, Product=2, SerialNumber=1
usb usb4: default language 0x0409
usb usb4: Product: VIA Technologies, Inc. USB 2.0
usb usb4: Manufacturer: Linux 2.6.6 ehci_hcd
usb usb4: SerialNumber: 0000:00:10.3
usb usb4: hotplug
usb usb4: adding 4-0:1.0 (config #1, interface 0)
usb 4-0:1.0: hotplug
hub 4-0:1.0: usb_probe_interface
hub 4-0:1.0: usb_probe_interface - got id
hub 4-0:1.0: USB hub found
hub 4-0:1.0: 6 ports detected
hub 4-0:1.0: standalone hub
hub 4-0:1.0: ganged power switching
hub 4-0:1.0: individual port over-current protection
hub 4-0:1.0: Single TT
hub 4-0:1.0: TT requires at most 8 FS bit times
hub 4-0:1.0: power on to power good time: 20ms
hub 4-0:1.0: hub controller current requirement: 0mA
hub 4-0:1.0: local power source is good
hub 4-0:1.0: enabling power on all ports
via-rhine.c:v1.10-LK1.1.19-2.5  July-12-2003  Written by Donald Becker
  http://www.scyld.com/network/via-rhine.html
PCI: Found IRQ 11 for device 0000:00:12.0
PCI: Sharing IRQ 11 with 0000:00:08.0
PCI: Sharing IRQ 11 with 0000:00:10.0
eth0: VIA VT6102 Rhine-II at 0xec00, 00:50:8d:59:6f:c9, IRQ 11.
eth0: MII PHY found at address 1, status 0x786d advertising 05e1 Link 45e1.
uhci_hcd 0000:00:10.0: suspend_hc
ohci_hcd: 2004 Feb 02 USB 1.1 'Open' Host Controller (OHCI) Driver (PCI)
ohci_hcd: block sizes: ed 64 td 64
uhci_hcd 0000:00:10.1: suspend_hc
uhci_hcd 0000:00:10.2: suspend_hc
eth0: Setting full-duplex based on MII #1 link partner capability of 45e1.
blk: queue cfdfd600, I/O limit 4095Mb (mask 0xffffffff)
Comment 3 Vladi Belperchinov-Shabanski 2004-11-29 14:08:28 UTC
'find / > /dev/null' or any other disk access as file copy etc. produces 
100%wa (iowait).
during disk access machine freezes for few seconds with short pauses between.

I can reproduce the effect on this machine:

Asus K8N, Socket 754, AMD Athlon 64, NVIDIA nForce3 250Gb
IDE hdd: Seagate 120GB 7200.7 8MB ST3120026A

with the following kernel versions:

linux-2.6.9
linux-2.6.9-ac12
linux-2.6.10-rc2
linux-2.6.10-rc2-mm3

and other 2.6.x releases as well. with 2.4.28 on the same machine works fine.

I'm pretty sure that this bug is the same as:

http://bugzilla.kernel.org/show_bug.cgi?id=3094
http://bugzilla.kernel.org/show_bug.cgi?id=3420

-------------------------------------------------------------------- hdparm:

/dev/hda:
 multcount    =  8 (on)
 I/O support  =  1 (32-bit)
 unmaskirq    =  1 (on)
 using_dma    =  1 (on)
 keepsettings =  0 (off)
 nowerr       =  0 (off)
 readonly     =  0 (off)
 readahead    =  8 (on)

but all are set by kernel ide autotune.

--------------------------------------------------------------------- dmesg:

testing michael_mic
failed to load transform for michael_mic
lp: driver loaded but no devices found
Real Time Clock Driver v1.12
Non-volatile memory driver v1.2
Linux agpgart interface v0.100 (c) Dave Jones
serio: i8042 AUX port at 0x60,0x64 irq 12
serio: i8042 KBD port at 0x60,0x64 irq 1
Serial: 8250/16550 driver $Revision: 1.90 $ 8 ports, IRQ sharing disabled
ttyS0 at I/O 0x3f8 (irq = 4) is a 16550A
parport0: PC-style at 0x378 (0x778) [PCSPP(,...)]
parport0: irq 7 detected
lp0: using parport0 (polling).
loop: loaded (max 8 devices)
8139too Fast Ethernet driver 0.9.27
eth0: RealTek RTL8139 at 0xffffff0000054c00, 00:02:44:6e:cf:cc, IRQ 11
eth0:  Identified 8139 chip type 'RTL-8100B/8139D'
Uniform Multi-Platform E-IDE driver Revision: 7.00alpha2
ide: Assuming 33MHz system bus speed for PIO modes; override with idebus=xx
NFORCE3-250: IDE controller at PCI slot 0000:00:08.0
NFORCE3-250: chipset revision 162
NFORCE3-250: not 100% native mode: will probe irqs later
NFORCE3-250: BIOS didn't set cable bits correctly. Enabling workaround.
NFORCE3-250: 0000:00:08.0 (rev a2) UDMA133 controller
    ide0: BM-DMA at 0xffa0-0xffa7, BIOS settings: hda:DMA, hdb:DMA
    ide1: BM-DMA at 0xffa8-0xffaf, BIOS settings: hdc:DMA, hdd:pio
Probing IDE interface ide0...
hda: ST3120026A, ATA DISK drive
hdb: ST3120026A, ATA DISK drive
Using anticipatory io scheduler
ide0 at 0x1f0-0x1f7,0x3f6 on irq 14
Probing IDE interface ide1...
hdc: HL-DT-ST GCE-8526B, ATAPI CD/DVD-ROM drive
ide1 at 0x170-0x177,0x376 on irq 15 (serialized with ide0)
hda: max request size: 1024KiB
hda: 234441648 sectors (120034 MB) w/8192KiB Cache, CHS=16383/255/63
hda: cache flushes supported
 hda: hda1 hda2 hda3
hdb: max request size: 1024KiB
hdb: 234441648 sectors (120034 MB) w/8192KiB Cache, CHS=16383/255/63
hdb: cache flushes supported
 hdb: hdb1 hdb2
hdc: ATAPI 52X CD-ROM CD-R/RW drive, 2048kB Cache
Uniform CD-ROM driver Revision: 3.20
libata version 1.02 loaded.
ehci_hcd: block sizes: qh 160 qtd 96 itd 192 sitd 96
ehci_hcd 0000:00:02.2: PCI device 10de:00e8 (nVidia Corporation)
ehci_hcd 0000:00:02.2: reset hcs_params 0x102488 dbg=1 cc=2 pcc=4 !ppc ports=8
ehci_hcd 0000:00:02.2: reset portroute 0 0 1 1 1 0 1 0 
ehci_hcd 0000:00:02.2: reset hcc_params a082 caching frame 256/512/1024
ehci_hcd 0000:00:02.2: capability 0001 at a0
PCI: Setting latency timer of device 0000:00:02.2 to 64
ehci_hcd 0000:00:02.2: irq 10, pci mem ffffff0000056c00
ehci_hcd 0000:00:02.2: new USB bus registered, assigned bus number 1
ehci_hcd 0000:00:02.2: reset command 08001a (park)=0 ithresh=8 Periodic
period=256 Reset HALT
PCI: cache line size of 64 is not supported by device 0000:00:02.2
ehci_hcd 0000:00:02.2: init command 010009 (park)=0 ithresh=1 period=256 RUN
ehci_hcd 0000:00:02.2: USB 2.0 enabled, EHCI 1.00, driver 2004-May-10
ehci_hcd 0000:00:02.2: supports USB remote wakeup
usb usb1: new device strings: Mfr=3, Product=2, SerialNumber=1
usb usb1: default language 0x0409
usb usb1: Product: PCI device 10de:00e8 (nVidia Corporation)
usb usb1: Manufacturer: Linux 2.6.9 ehci_hcd
usb usb1: SerialNumber: 0000:00:02.2
usb usb1: hotplug
usb usb1: adding 1-0:1.0 (config #1, interface 0)
usb 1-0:1.0: hotplug
hub 1-0:1.0: usb_probe_interface
hub 1-0:1.0: usb_probe_interface - got id
hub 1-0:1.0: USB hub found
hub 1-0:1.0: 8 ports detected
hub 1-0:1.0: standalone hub
hub 1-0:1.0: ganged power switching
hub 1-0:1.0: individual port over-current protection
hub 1-0:1.0: Single TT
hub 1-0:1.0: TT requires at most 8 FS bit times
hub 1-0:1.0: power on to power good time: 20ms
hub 1-0:1.0: local power source is good
hub 1-0:1.0: enabling power on all ports
ohci_hcd: 2004 Feb 02 USB 1.1 'Open' Host Controller (OHCI) Driver (PCI)
ohci_hcd: block sizes: ed 80 td 96
ohci_hcd 0000:00:02.0: PCI device 10de:00e7 (nVidia Corporation)
ohci_hcd 0000:00:02.0: USB HC TakeOver from BIOS/SMM
ohci_hcd 0000:00:02.0: reset, control = 0x600
PCI: Setting latency timer of device 0000:00:02.0 to 64
ohci_hcd 0000:00:02.0: irq 3, pci mem ffffff0000058000
ohci_hcd 0000:00:02.0: new USB bus registered, assigned bus number 2
ohci_hcd 0000:00:02.0: supports USB remote wakeup
usb usb2: new device strings: Mfr=3, Product=2, SerialNumber=1
usb usb2: default language 0x0409
usb usb2: Product: PCI device 10de:00e7 (nVidia Corporation)
usb usb2: Manufacturer: Linux 2.6.9 ohci_hcd
usb usb2: SerialNumber: 0000:00:02.0
usb usb2: hotplug
usb usb2: adding 2-0:1.0 (config #1, interface 0)
usb 2-0:1.0: hotplug
hub 2-0:1.0: usb_probe_interface
hub 2-0:1.0: usb_probe_interface - got id
hub 2-0:1.0: USB hub found
hub 2-0:1.0: 4 ports detected
hub 2-0:1.0: standalone hub
hub 2-0:1.0: no power switching (usb 1.0)
hub 2-0:1.0: no over-current protection
hub 2-0:1.0: power on to power good time: 2ms
hub 2-0:1.0: local power source is good
ohci_hcd 0000:00:02.0: created debug files
ohci_hcd 0000:00:02.0: OHCI controller state
ohci_hcd 0000:00:02.0: OHCI 1.0, with legacy support registers
ohci_hcd 0000:00:02.0: control 0x683 RWE RWC HCFS=operational CBSR=3
ohci_hcd 0000:00:02.0: cmdstatus 0x00000 SOC=0
ohci_hcd 0000:00:02.0: intrstatus 0x00000004 SF
ohci_hcd 0000:00:02.0: intrenable 0x8000000a MIE RD WDH
ohci_hcd 0000:00:02.0: hcca frame #0004
ohci_hcd 0000:00:02.0: roothub.a 01001204 POTPGT=1 NOCP NPS NDP=4
ohci_hcd 0000:00:02.0: roothub.b 00000000 PPCM=0000 DR=0000
ohci_hcd 0000:00:02.0: roothub.status 00008000 DRWE
ohci_hcd 0000:00:02.0: roothub.portstatus [0] 0x00000100 PPS
ohci_hcd 0000:00:02.0: roothub.portstatus [1] 0x00000100 PPS
ohci_hcd 0000:00:02.0: roothub.portstatus [2] 0x00000100 PPS
ohci_hcd 0000:00:02.0: roothub.portstatus [3] 0x00000100 PPS
ohci_hcd 0000:00:02.1: PCI device 10de:00e7 (nVidia Corporation)
ohci_hcd 0000:00:02.1: USB HC TakeOver from BIOS/SMM
ohci_hcd 0000:00:02.1: reset, control = 0x600
PCI: Setting latency timer of device 0000:00:02.1 to 64
ohci_hcd 0000:00:02.1: irq 9, pci mem ffffff000005a000
ohci_hcd 0000:00:02.1: new USB bus registered, assigned bus number 3
ohci_hcd 0000:00:02.1: supports USB remote wakeup
usb usb3: new device strings: Mfr=3, Product=2, SerialNumber=1
usb usb3: default language 0x0409
usb usb3: Product: PCI device 10de:00e7 (nVidia Corporation)
usb usb3: Manufacturer: Linux 2.6.9 ohci_hcd
usb usb3: SerialNumber: 0000:00:02.1
usb usb3: hotplug
usb usb3: adding 3-0:1.0 (config #1, interface 0)
usb 3-0:1.0: hotplug
hub 3-0:1.0: usb_probe_interface
hub 3-0:1.0: usb_probe_interface - got id
hub 3-0:1.0: USB hub found
hub 3-0:1.0: 4 ports detected
hub 3-0:1.0: standalone hub
hub 3-0:1.0: no power switching (usb 1.0)
hub 3-0:1.0: no over-current protection
hub 3-0:1.0: power on to power good time: 2ms
hub 3-0:1.0: local power source is good
ohci_hcd 0000:00:02.1: created debug files
ohci_hcd 0000:00:02.1: OHCI controller state
ohci_hcd 0000:00:02.1: OHCI 1.0, with legacy support registers
ohci_hcd 0000:00:02.1: control 0x683 RWE RWC HCFS=operational CBSR=3
ohci_hcd 0000:00:02.1: cmdstatus 0x00000 SOC=0
ohci_hcd 0000:00:02.1: intrstatus 0x00000004 SF
ohci_hcd 0000:00:02.1: intrenable 0x8000000a MIE RD WDH
ohci_hcd 0000:00:02.1: hcca frame #0004
ohci_hcd 0000:00:02.1: roothub.a 01001204 POTPGT=1 NOCP NPS NDP=4
ohci_hcd 0000:00:02.1: roothub.b 00000000 PPCM=0000 DR=0000
ohci_hcd 0000:00:02.1: roothub.status 00008000 DRWE
ohci_hcd 0000:00:02.1: roothub.portstatus [0] 0x00000100 PPS
ohci_hcd 0000:00:02.1: roothub.portstatus [1] 0x00000100 PPS
ohci_hcd 0000:00:02.1: roothub.portstatus [2] 0x00000100 PPS
ohci_hcd 0000:00:02.1: roothub.portstatus [3] 0x00000100 PPS
USB Universal Host Controller Interface driver v2.2
Initializing USB Mass Storage driver...
usbcore: registered new driver usb-storage
USB Mass Storage support registered.
usbcore: registered new driver hiddev
usbcore: registered new driver usbhid
drivers/usb/input/hid-core.c: v2.0:USB HID core driver
mice: PS/2 mouse device common for all mice
ehci_hcd 0000:00:02.2: GetStatus port 6 status 001403 POWER sig=k  CSC CONNECT
hub 1-0:1.0: port 6, status 0501, change 0001, 480 Mb/s
hub 1-0:1.0: debounce: port 6: total 100ms stable 100ms status 0x501
ehci_hcd 0000:00:02.2: port 6 low speed --> companion
input: AT Translated Set 2 keyboard on isa0060/serio0
ehci_hcd 0000:00:02.2: GetStatus port 6 status 003402 POWER OWNER sig=k  CSC
ohci_hcd 0000:00:02.1: GetStatus roothub.portstatus [2] = 0x00010301 CSC LSDA
PPS CCS
hub 3-0:1.0: port 3, status 0301, change 0001, 1.5 Mb/s
input: PC Speaker
i2c /dev entries driver
Advanced Linux Sound Architecture Driver Version 1.0.6 (Sun Aug 15 07:17:53 2004
UTC).
no UART detected at 0x1
MTVAP port 0x378 is busy
specify port
ALSA device list:
  #0: Dummy 1
  #1: Virtual MIDI Card 1
  #2: C-Media PCI CMI8738 (model 37) at 0xd800, irq 9
NET: Registered protocol family 2
IP: routing cache hash table of 8192 buckets, 64Kbytes
TCP: Hash tables configured (established 262144 bind 65536)
NET: Registered protocol family 1
NET: Registered protocol family 17
ReiserFS: hda3: found reiserfs format "3.6" with standard journal
hub 3-0:1.0: debounce: port 3: total 100ms stable 100ms status 0x301
ohci_hcd 0000:00:02.1: GetStatus roothub.portstatus [2] = 0x00100303 PRSC LSDA
PPS PES CCS
usb 3-3: new low speed USB device using address 2
usb 3-3: skipped 1 descriptor after interface
usb 3-3: new device strings: Mfr=1, Product=2, SerialNumber=0
usb 3-3: default language 0x0409
usb 3-3: Product: USB-PS/2 Optical Mouse
usb 3-3: Manufacturer: Logitech
usb 3-3: hotplug
usb 3-3: adding 3-3:1.0 (config #1, interface 0)
usb 3-3:1.0: hotplug
usbhid 3-3:1.0: usb_probe_interface
usbhid 3-3:1.0: usb_probe_interface - got id
input: USB HID v1.10 Mouse [Logitech USB-PS/2 Optical Mouse] on usb-0000:00:02.1-3
ohci_hcd 0000:00:02.1: GetStatus roothub.portstatus [1] = 0x00010100 CSC PPS
hub 3-0:1.0: port 2, status 0100, change 0001, 12 Mb/s
hub 3-0:1.0: debounce: port 2: total 100ms stable 100ms status 0x100
ReiserFS: hda3: using ordered data mode
ReiserFS: hda3: journal params: device hda3, size 8192, journal first block 18,
max trans len 1024, max batch 900, max commit age 30, max trans age 30
ReiserFS: hda3: checking transaction log (hda3)
ReiserFS: hda3: Using r5 hash to sort names
VFS: Mounted root (reiserfs filesystem) readonly.
Freeing unused kernel memory: 144k freed
Adding 530136k swap on /dev/hda2.  Priority:-1 extents:1
ReiserFS: hdb2: found reiserfs format "3.6" with standard journal
ReiserFS: hdb2: using ordered data mode
ReiserFS: hdb2: journal params: device hdb2, size 8192, journal first block 18,
max trans len 1024, max batch 900, max commit age 30, max trans age 30
ReiserFS: hdb2: checking transaction log (hdb2)
ReiserFS: hdb2: Using r5 hash to sort names
nvidia: module license 'NVIDIA' taints kernel.
NVRM: loading NVIDIA Linux x86_64 NVIDIA Kernel Module  1.0-6111  Tue Jul 27
07:58:17 PDT 2004
eth0: link up, 100Mbps, full-duplex, lpa 0x45E1
nfs warning: mount version older than kernel
nfs warning: mount version older than kernel
NVRM: loading NVIDIA Linux x86_64 NVIDIA Kernel Module  1.0-6111  Tue Jul 27
07:58:17 PDT 2004
agpgart: Found an AGP 3.0 compliant device at 0000:00:00.0.
agpgart: Putting AGP V3 device at 0000:00:00.0 into 8x mode
agpgart: Putting AGP V3 device at 0000:01:00.0 into 8x mode
agpgart: Found an AGP 3.0 compliant device at 0000:00:00.0.
agpgart: Putting AGP V3 device at 0000:00:00.0 into 8x mode
agpgart: Putting AGP V3 device at 0000:01:00.0 into 8x mode
spurious 8259A interrupt: IRQ7.
Comment 4 Paul A Sumner 2004-12-16 14:00:51 UTC
I've encountered this same issue as well on dissimilar hardware:

Arima HDAMA mobo
2x250 AMD Opteron cpus
12GB ram
6x15K Seagate drives
2200s Adaptec raid card

Kernel:

Linux taz 2.6.9-gentoo-r9 #1 SMP Thu Dec 16 12:47:42 PST 2004 x86_64 AMD
Opteron(tm) Processor 250 AuthenticAMD GNU/Linux

Sets to reproduce:

1. Run bonnie++ w/ defaults (2xram or 24GB in my case).
2. While the iowait fluctuates it rises overall steadly as bonnie++'s test file
size rises.
3. When iowait reaches 99-100% my scsi array goes offline and/or machine freezes... 

Also, I noticed that if I kill the process and rm the test files that the iowait
remains high, even for a bit after a 'sync' I am unsure if this is expected
behavior. 'dd' also produces high iowait for 24 & 32GB files; although, it seems
to just miss "Lockup:"

Device:            tps   Blk_read/s   Blk_wrtn/s   Blk_read   Blk_wrtn
sda             379.00         0.00     84512.00          0      84512

avg-cpu:  %user   %nice    %sys %iowait   %idle
           0.00    0.00    1.01   98.99    0.00
Comment 5 Vladi Belperchinov-Shabanski 2005-02-09 14:20:29 UTC
same effect with linux 2.6.10 on asus a8n-sli mb with sata disks (nvsata device).

I can provide dmesg or other info if needed...
Comment 6 Anton 2005-03-02 21:50:13 UTC
Same bug in "bugfree kernel" 2.6.11 :-) and SiS645 chipset.
Comment 7 Adrian Bunk 2005-07-03 13:13:54 UTC
Jens, can you look at this bug?
Comment 8 Vesa-Pekka Palmu 2005-07-10 23:05:17 UTC
I have been having similiar problems running debian sarge (2.6.8 kernel) no 
packages outside sarge installed.

The machine is Pentium 3 running lvm over software raid-5 and the %wa in top 
kept going up during heavy disk-io, but booting with noapic and apic=off boot 
parameters seems to have fixed it (fingers crossed).

Comment 9 Athanasius 2006-11-03 08:10:42 UTC
This might possibly be what I've just seen twice now (once with nvidia.ko
loaded, but the 2nd time was a fresh boot without that module file even being
inside /lib/modules/`uname -r` and I confirmed it had NOT loaded at all).

I was copying a 5.7GB DVD image from my 2.6.19-rc4 (vanilla) desktop to the
2.6.16.1 (also vanilla) server over NFS.  The first time I got to 900297728
bytes, and then the desktop totally froze up (the server was fine, and still
is).  The 2nd try, with the NVIDIA module never loaded (I had to edit xorg.conf
to only use one monitor due to using the 'nv' driver) it got as far as
3962806272 and then the exact same problem again, a total freezeup of the desktop

By total freeze I mean the mouse and keyboard were totally unresponsive.  Not
even a Capslock light operating.

Desktop:
Linux 2.6.19-rc4 (from vanilla sources)
Asus A-8V Deluxe motherboard (using the Ethernet controller: Marvell Technology
Group Ltd. 88E8001 Gigabit Ethernet Controller for network access)
Debian Etch (testing), uptodate before the problem earlier today and not change
since.

Server:
Linux 2.6.16.1 (from vanilla sources)
Gigabyte GA-686 BX motherboard
NIC in question is an "SMC 10/100 8139 based card" using 8139too module.
Debian Sarge (stable), with nfs-kernel-server 1.0.6-3.1 package exporting the
mount on the desktop that was the target of the copy (actually a 'mv').

If I get chance I'll reboot again without the nvidia module and log some iowait
stats to a file on another attempt.
Comment 10 Natalie Protasevich 2007-09-03 23:00:30 UTC
Any updates on this problem please.
Thanks.
Comment 11 Natalie Protasevich 2008-03-03 20:39:33 UTC
Closing the bug. Please reopen if confirmed with latest kernel.
Thanks.
Comment 12 Vladimir 2009-12-12 01:11:29 UTC
  Still high %iowait's (60%-70%) on 2.6.32 kernel during any kinds of disk operations (a one file copying for example).
Comment 13 Jens Axboe 2009-12-12 12:11:19 UTC
When you do IO, it's expected to have high io wait. It's not a bug. It should not freeze, of course.
Comment 14 Vladimir 2009-12-12 17:15:13 UTC
(In reply to comment #13)
> When you do IO, it's expected to have high io wait. It's not a bug. It should
> not freeze, of course.

  Yes, I think so too, but with the last kernel version I cannot even listen to music! My Mplayer makes gaps in the sound every several seconds and each time I observe high (up to 70% or even more) iowait burtst by a "top" command.
  Some office programs, for example, also "think" too long when I open a SaveFile dialog trying to save my document.
  I must note that I haven't noticed such behavior before. I cannot tell for sure, but I think it emerged somewhere between 2.6.30 and 2.6.32 kernels. At least, I have been using OpenOffice for quite a long time (that's why I don't blame it), and it began to put me into long waiting states only after the most recent system update.

Note You need to log in before you can comment on or make changes to this bug.