Ethernet stops working, or wery slow on nvidia "MCP78S [GeForce 8200] Ethernet (rev a2)" after enough data copied over nfs. On some tests the kernel panicked. (driver forcedeth) The machine is an AMD64x4, 2.5 GHz on a ASUS M3N-H/HDMI motherboard. Problem found on SLAMD64 12.2, kernel 2.6.27.7-SMP (64 bit), slackware 12.1 with 2.6.24.5 and 2.6.28.7 kernels (32 bit). Machine is connected to a 1 Gbit netgear switch. On that switch is among other machines two servers, one is an AMD 64x2 2.6 GHz running slackware 12.0, (first kernel 2.6.21.5 and later 2.6.28.7),(forcedeth on CK804) the other is a dual celeron 400 MHz running slackware 8.0, kernel 2.4.32-SMP. (RTL-8139 ethernet) Problem only seen with NFS so far, small rsize, wsize makes problem worse, SCP and SAMBA tested ok. Copy from slow server better than fast. After copying from 50 to 100 GB over NFS from the AMD server the network stops working, or sometimes is very slow. Doing an ifconfig eth0 down followed by an ifconfig eth0 up restores communication for a while. I have also tested to change the line "static int max_interrupt_work = 5;" to "static int max_interrupt_work = 15;" in forcedeth.c but could not see any difference. The following NFS tests are with mount options: rsize=8192,wsize=8192 (Note, most of the tests below are only performced once due to the time it it takes to run them.) SLAMD64, kernel 2.6.27.7 SMP ==> hangs / slow after 50-100 GB. Added a script doing "ifconfig down/up" every 5 minutes ==> kernel panic Removed config option "CONFIG_FORCEDETH_NAPI" ==> kernel panic, picture of screen can be found on: http://w1.874.telia.com/~u87421281/panic.jpg Slackware 12.1 kernel 2.6.24.5 SMP ==> hangs / slow after 50-100 GB. Slackware 12.1 kernel 2.6.28.7 SMP ==> hangs / slow after 50-100 GB. used ethtool to disable sg, tco, gso ==> panic Skackware 12.1 kernel 2.6.28.7 NOSMP ==> hangs / slow after 50-100 GB, also hung switch so no machine on it was accesible. WinXP, SP3 (32bit), copied from celeron server with samba ==> OK. SLAMD64, kernel 2.6.27.7 SMP, on RTL-8169 nic ==> OK. SLAMD64, kernel 2.6.27.7 SMP, MCP78S nic with removed config option "CONFIG_FORCEDETH_NAPI", copying using scp ==> OK. Copying using NFS from celeron server ==> OK. The following NFS tests are with mount options: rsize=1024,wsize=1024 Copying using NFS from AMD server ==> hangs after 7 GB. used ethtool to disable sg, tco, gso ==> hangs after 3 GB. Copying using NFS from celeron server ==> have copied 183 GB in 12 hours. used ethtool to turn off all options (could not turn off tx) ==> copied about 48 GB in 11 hours. # Drivef info # ethtool -i eth0 driver: forcedeth version: 0.61 firmware-version: bus-info: 0000:00:0a.0 # Last tested options: # ethtool -k eth0 Offload parameters for eth0: rx-checksumming: off tx-checksumming: on scatter-gather: off tcp segmentation offload: off udp fragmentation offload: off generic segmentation offload: off # Default options: # ethtool -k eth0 Offload parameters for eth0: rx-checksumming: on tx-checksumming: on scatter-gather: on tcp segmentation offload: on udp fragmentation offload: off generic segmentation offload: on # cat /proc/modules forcedeth 61072 0 - Live 0xffffffffa005c000 ipv6 286280 20 - Live 0xffffffffa092c000 pcmcia 40984 0 - Live 0xffffffffa0920000 pcmcia_core 39844 1 pcmcia, Live 0xffffffffa0915000 lp 16132 0 - Live 0xffffffffa0910000 ppdev 12552 0 - Live 0xffffffffa090b000 parport_pc 32296 0 - Live 0xffffffffa0902000 parport 41136 3 lp,ppdev,parport_pc, Live 0xffffffffa08f6000 fuse 60992 3 - Live 0xffffffffa08e6000 sg 35936 0 - Live 0xffffffffa08dc000 nvidia 8118520 0 - Live 0xffffffffa011c000 (P) snd_hda_intel 473420 0 - Live 0xffffffffa00a7000 snd_pcm 83336 1 snd_hda_intel, Live 0xffffffffa0091000 snd_timer 26256 1 snd_pcm, Live 0xffffffffa0089000 snd_page_alloc 12816 2 snd_hda_intel,snd_pcm, Live 0xffffffffa0084000 fan 9480 0 - Live 0xffffffffa0080000 snd_hwdep 12552 1 snd_hda_intel, Live 0xffffffffa007b000 evdev 15648 3 - Live 0xffffffffa0074000 thermal 23200 0 - Live 0xffffffffa006d000 rtc_cmos 15160 0 - Live 0xffffffffa0057000 processor 42548 1 thermal, Live 0xffffffffa004b000 button 11680 0 - Live 0xffffffffa0045000 shpchp 36892 0 - Live 0xffffffffa003a000 wmi 10944 0 - Live 0xffffffffa0036000 r8169 35204 0 - Live 0xffffffffa002c000 mii 9344 1 r8169, Live 0xffffffffa0028000 snd 64456 4 snd_hda_intel,snd_pcm,snd_timer,snd_hwdep, Live 0xffffffffa0017000 soundcore 11296 1 snd, Live 0xffffffffa0013000 rtc_core 22596 1 rtc_cmos, Live 0xffffffffa000c000 rtc_lib 7168 1 rtc_core, Live 0xffffffffa0009000 thermal_sys 17216 3 fan,thermal,processor, Live 0xffffffffa0003000 hwmon 6984 1 thermal_sys, Live 0xffffffffa0000000 # lspci -vvv 00:00.0 RAM memory: nVidia Corporation MCP78S [GeForce 8200] Memory Controller (rev a2) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Capabilities: [94] HyperTransport: #1a Capabilities: [60] HyperTransport: Retry Mode Capabilities: [44] HyperTransport: Slave or Primary Interface Command: BaseUnitID=0 UnitCnt=0 MastHost- DefDir- DUL- Link Control 0: CFlE- CST- CFE- <LkFail- Init+ EOC- TXO- <CRCErr=0 IsocEn- LSEn- ExtCTL- 64b- Link Config 0: MLWI=16bit DwFcIn- MLWO=16bit DwFcOut- LWI=16bit DwFcInEn- LWO=16bit DwFcOutEn- Link Control 1: CFlE- CST- CFE- <LkFail+ Init- EOC+ TXO+ <CRCErr=0 IsocEn- LSEn- ExtCTL- 64b- Link Config 1: MLWI=8bit DwFcIn- MLWO=8bit DwFcOut- LWI=8bit DwFcInEn- LWO=8bit DwFcOutEn- Revision ID: 3.00 Link Frequency 0: [b] Link Error 0: <Prot- <Ovfl- <EOC- CTLTm- Link Frequency Capability 0: 200MHz+ 300MHz- 400MHz+ 500MHz- 600MHz+ 800MHz+ 1.0GHz+ 1.2GHz+ 1.4GHz- 1.6GHz- Vend- Feature Capability: IsocFC+ LDTSTOP+ CRCTM- ECTLT- 64bA- UIDRD+ Link Frequency 1: 200MHz Link Error 1: <Prot- <Ovfl- <EOC- CTLTm- Link Frequency Capability 1: 200MHz- 300MHz- 400MHz- 500MHz- 600MHz- 800MHz- 1.0GHz- 1.2GHz- 1.4GHz- 1.6GHz- Vend- Error Handling: PFlE- OFlE- PFE- OFE- EOCFE- RFE- CRCFE- SERRFE- CF- RE- PNFE- ONFE- EOCNFE- RNFE- CRCNFE- SERRNFE- Prefetchable memory behind bridge Upper: 00-00 Bus Number: 00 Capabilities: [d0] HyperTransport: #1c 00:01.0 ISA bridge: nVidia Corporation MCP78S [GeForce 8200] LPC Bridge (rev a2) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap- 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 00:01.1 SMBus: nVidia Corporation MCP78S [GeForce 8200] SMBus (rev a1) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O+ Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Interrupt: pin A routed to IRQ 5 Region 0: I/O ports at fc00 [size=64] Region 4: I/O ports at 1c00 [size=64] Region 5: I/O ports at 1c40 [size=64] Capabilities: [44] Power Management version 2 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=0 PME- 00:01.2 RAM memory: nVidia Corporation MCP78S [GeForce 8200] Memory Controller (rev a1) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap- 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- 00:01.3 Co-processor: nVidia Corporation MCP78S [GeForce 8200] Co-Processor (rev a2) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap- 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 (750ns min, 250ns max) Interrupt: pin B routed to IRQ 11 Region 0: Memory at fdf80000 (32-bit, non-prefetchable) [size=512K] 00:01.4 RAM memory: nVidia Corporation MCP78S [GeForce 8200] Memory Controller (rev a1) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap- 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- 00:02.0 USB Controller: nVidia Corporation MCP78S [GeForce 8200] OHCI USB 1.1 Controller (rev a1) (prog-if 10 [OHCI]) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 (750ns min, 250ns max) Interrupt: pin A routed to IRQ 20 Region 0: Memory at fe02f000 (32-bit, non-prefetchable) [size=4K] Capabilities: [44] Power Management version 2 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Kernel driver in use: ohci_hcd Kernel modules: ohci-hcd 00:02.1 USB Controller: nVidia Corporation MCP78S [GeForce 8200] EHCI USB 2.0 Controller (rev a1) (prog-if 20 [EHCI]) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 (750ns min, 250ns max) Interrupt: pin B routed to IRQ 22 Region 0: Memory at fe02e000 (32-bit, non-prefetchable) [size=256] Capabilities: [44] Debug port: BAR=1 offset=00a0 Capabilities: [80] Power Management version 2 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Kernel driver in use: ehci_hcd Kernel modules: ehci-hcd 00:04.0 USB Controller: nVidia Corporation MCP78S [GeForce 8200] OHCI USB 1.1 Controller (rev a1) (prog-if 10 [OHCI]) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 (750ns min, 250ns max) Interrupt: pin A routed to IRQ 23 Region 0: Memory at fe02d000 (32-bit, non-prefetchable) [size=4K] Capabilities: [44] Power Management version 2 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Kernel driver in use: ohci_hcd Kernel modules: ohci-hcd 00:04.1 USB Controller: nVidia Corporation MCP78S [GeForce 8200] EHCI USB 2.0 Controller (rev a1) (prog-if 20 [EHCI]) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 (750ns min, 250ns max) Interrupt: pin B routed to IRQ 21 Region 0: Memory at fe02c000 (32-bit, non-prefetchable) [size=256] Capabilities: [44] Debug port: BAR=1 offset=00a0 Capabilities: [80] Power Management version 2 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Kernel driver in use: ehci_hcd Kernel modules: ehci-hcd 00:06.0 IDE interface: nVidia Corporation MCP78S [GeForce 8200] IDE (rev a1) (prog-if 8a [Master SecP PriP]) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 (750ns min, 250ns max) Region 0: [virtual] Memory at 000001f0 (32-bit, non-prefetchable) [disabled] [size=8] Region 1: [virtual] Memory at 000003f0 (type 3, non-prefetchable) [disabled] [size=1] Region 2: [virtual] Memory at 00000170 (32-bit, non-prefetchable) [disabled] [size=8] Region 3: [virtual] Memory at 00000370 (type 3, non-prefetchable) [disabled] [size=1] Region 4: I/O ports at f000 [size=16] Capabilities: [44] Power Management version 2 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Kernel driver in use: AMD_IDE Kernel modules: pata_amd 00:07.0 Audio device: nVidia Corporation MCP78S [GeForce 8200] High Definition Audio (rev a1) Subsystem: ASUSTeK Computer Inc. Unknown device 82fe Control: I/O- Mem+ BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Interrupt: pin A routed to IRQ 21 Region 0: Memory at fe020000 (32-bit, non-prefetchable) [size=16K] Capabilities: [44] Power Management version 2 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [50] Message Signalled Interrupts: Mask+ 64bit+ Queue=0/0 Enable- Address: 0000000000000000 Data: 0000 Masking: 00000000 Pending: 00000000 Capabilities: [6c] HyperTransport: MSI Mapping Enable- Fixed+ Kernel modules: snd-hda-intel 00:08.0 PCI bridge: nVidia Corporation MCP78S [GeForce 8200] PCI Bridge (rev a1) (prog-if 01 [Subtractive decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Bus: primary=00, secondary=01, subordinate=01, sec-latency=32 I/O behind bridge: 0000c000-0000cfff Memory behind bridge: fde00000-fdefffff Prefetchable memory behind bridge: fdd00000-fddfffff Secondary status: 66MHz- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort+ <SERR- <PERR- BridgeCtl: Parity- SERR- NoISA+ VGA- MAbort- >Reset- FastB2B- PriDiscTmr- SecDiscTmr+ DiscTmrStat+ DiscTmrSERREn- Capabilities: [b8] Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Capabilities: [8c] HyperTransport: MSI Mapping Enable- Fixed- Mapping Address Base: 00000000fee00000 00:09.0 IDE interface: nVidia Corporation MCP78S [GeForce 8200] SATA Controller (non-AHCI mode) (rev a2) (prog-if 85 [Master SecO PriO]) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 (750ns min, 250ns max) Interrupt: pin A routed to IRQ 23 Region 0: I/O ports at 09f0 [size=8] Region 1: I/O ports at 0bf0 [size=4] Region 2: I/O ports at 0970 [size=8] Region 3: I/O ports at 0b70 [size=4] Region 4: I/O ports at dc00 [size=16] Region 5: Memory at fe026000 (32-bit, non-prefetchable) [size=8K] Capabilities: [44] Power Management version 2 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [8c] SATA HBA <?> Capabilities: [b0] Message Signalled Interrupts: Mask- 64bit+ Queue=0/3 Enable- Address: 0000000000000000 Data: 0000 Capabilities: [ec] HyperTransport: MSI Mapping Enable- Fixed+ Kernel driver in use: ahci 00:0a.0 Ethernet controller: nVidia Corporation MCP78S [GeForce 8200] Ethernet (rev a2) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 (250ns min, 5000ns max) Interrupt: pin A routed to IRQ 22 Region 0: Memory at fe02b000 (32-bit, non-prefetchable) [size=4K] Region 1: I/O ports at d800 [size=8] Region 2: Memory at fe02a000 (32-bit, non-prefetchable) [size=256] Region 3: Memory at fe029000 (32-bit, non-prefetchable) [size=16] Capabilities: [44] Power Management version 2 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold+) Status: D0 PME-Enable+ DSel=0 DScale=0 PME- Capabilities: [50] Message Signalled Interrupts: Mask+ 64bit+ Queue=0/4 Enable- Address: 0000000000000000 Data: 0000 Masking: 00000000 Pending: 00000000 Capabilities: [6c] HyperTransport: MSI Mapping Enable- Fixed+ Kernel driver in use: forcedeth Kernel modules: forcedeth-old, forcedeth 00:0b.0 PCI bridge: nVidia Corporation MCP78S [GeForce 8200] PCI Express Bridge (rev a1) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0, Cache Line Size: 32 bytes Bus: primary=00, secondary=02, subordinate=02, sec-latency=0 I/O behind bridge: 0000b000-0000bfff Memory behind bridge: fb000000-fcffffff Prefetchable memory behind bridge: 00000000d8000000-00000000e7ffffff Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- <SERR- <PERR- BridgeCtl: Parity- SERR- NoISA+ VGA+ MAbort- >Reset- FastB2B- PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn- Capabilities: [40] Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Capabilities: [48] Power Management version 2 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [60] HyperTransport: MSI Mapping Enable- Fixed- Mapping Address Base: 00000000fee00000 Kernel modules: shpchp 00:10.0 PCI bridge: nVidia Corporation MCP78S [GeForce 8200] PCI Express Bridge (rev a1) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0, Cache Line Size: 32 bytes Bus: primary=00, secondary=03, subordinate=03, sec-latency=0 I/O behind bridge: 0000a000-0000afff Memory behind bridge: fdc00000-fdcfffff Prefetchable memory behind bridge: 00000000fdb00000-00000000fdbfffff Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- <SERR- <PERR- BridgeCtl: Parity- SERR- NoISA+ VGA- MAbort- >Reset- FastB2B- PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn- Capabilities: [40] Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Capabilities: [48] Power Management version 3 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [50] Message Signalled Interrupts: Mask- 64bit+ Queue=0/1 Enable- Address: 0000000000000000 Data: 0000 Capabilities: [60] HyperTransport: MSI Mapping Enable+ Fixed- Mapping Address Base: 00000000fee00000 Capabilities: [80] Express (v2) Root Port (Slot+), MSI 00 DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s <64ns, L1 <1us ExtTag+ RBE+ FLReset- DevCtl: Report errors: Correctable+ Non-Fatal+ Fatal+ Unsupported+ RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+ MaxPayload 256 bytes, MaxReadReq 512 bytes DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend- LnkCap: Port #0, Speed 2.5GT/s, Width x16, ASPM L0s L1, Latency L0 <512ns, L1 <4us ClockPM- Suprise- LLActRep+ BwNot+ LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain- CommClk- ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt- LnkSta: Speed 2.5GT/s, Width x16, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt- SltCap: AttnBtn- PwrCtrl- MRL- AttnInd- PwrInd- HotPlug- Surpise- Slot # 1, PowerLimit 75.000000; Interlock- NoCompl- SltCtl: Enable: AttnBtn- PwrFlt- MRL- PresDet- CmdCplt- HPIrq- LinkChg- Control: AttnInd Off, PwrInd On, Power- Interlock- SltSta: Status: AttnBtn- PowerFlt- MRL- CmdCplt- PresDet- Interlock- Changed: MRL- PresDet- LinkState- RootCtl: ErrCorrectable- ErrNon-Fatal- ErrFatal- PMEIntEna- CRSVisible- RootCap: CRSVisible- RootSta: PME ReqID 0000, PMEStatus- PMEPending- Kernel driver in use: pcieport-driver Kernel modules: shpchp 00:12.0 PCI bridge: nVidia Corporation MCP78S [GeForce 8200] PCI Express Bridge (rev a1) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0, Cache Line Size: 32 bytes Bus: primary=00, secondary=04, subordinate=04, sec-latency=0 I/O behind bridge: 00009000-00009fff Memory behind bridge: fda00000-fdafffff Prefetchable memory behind bridge: 00000000fd900000-00000000fd9fffff Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- <SERR- <PERR- BridgeCtl: Parity- SERR- NoISA+ VGA- MAbort- >Reset- FastB2B- PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn- Capabilities: [40] Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Capabilities: [48] Power Management version 3 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [50] Message Signalled Interrupts: Mask- 64bit+ Queue=0/1 Enable- Address: 0000000000000000 Data: 0000 Capabilities: [60] HyperTransport: MSI Mapping Enable+ Fixed- Mapping Address Base: 00000000fee00000 Capabilities: [80] Express (v1) Root Port (Slot+), MSI 00 DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s <64ns, L1 <1us ExtTag+ RBE+ FLReset- DevCtl: Report errors: Correctable+ Non-Fatal+ Fatal+ Unsupported+ RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+ MaxPayload 256 bytes, MaxReadReq 512 bytes DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend- LnkCap: Port #2, Speed 2.5GT/s, Width x1, ASPM L0s L1, Latency L0 <512ns, L1 <4us ClockPM- Suprise- LLActRep+ BwNot- LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain- CommClk- ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt- LnkSta: Speed 2.5GT/s, Width x4, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt- SltCap: AttnBtn- PwrCtrl- MRL- AttnInd- PwrInd- HotPlug- Surpise- Slot # 3, PowerLimit 10.000000; Interlock- NoCompl- SltCtl: Enable: AttnBtn- PwrFlt- MRL- PresDet- CmdCplt- HPIrq- LinkChg- Control: AttnInd Off, PwrInd On, Power- Interlock- SltSta: Status: AttnBtn- PowerFlt- MRL- CmdCplt- PresDet- Interlock- Changed: MRL- PresDet- LinkState- RootCtl: ErrCorrectable- ErrNon-Fatal- ErrFatal- PMEIntEna- CRSVisible- RootCap: CRSVisible- RootSta: PME ReqID 0000, PMEStatus- PMEPending- Kernel driver in use: pcieport-driver Kernel modules: shpchp 00:13.0 PCI bridge: nVidia Corporation MCP78S [GeForce 8200] PCI Bridge (rev a1) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0, Cache Line Size: 32 bytes Bus: primary=00, secondary=05, subordinate=05, sec-latency=0 I/O behind bridge: 00008000-00008fff Memory behind bridge: fd800000-fd8fffff Prefetchable memory behind bridge: 00000000fd700000-00000000fd7fffff Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- <SERR- <PERR- BridgeCtl: Parity- SERR- NoISA+ VGA- MAbort- >Reset- FastB2B- PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn- Capabilities: [40] Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Capabilities: [48] Power Management version 3 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [50] Message Signalled Interrupts: Mask- 64bit+ Queue=0/1 Enable- Address: 0000000000000000 Data: 0000 Capabilities: [60] HyperTransport: MSI Mapping Enable+ Fixed- Mapping Address Base: 00000000fee00000 Capabilities: [80] Express (v1) Root Port (Slot+), MSI 00 DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s <64ns, L1 <1us ExtTag+ RBE+ FLReset- DevCtl: Report errors: Correctable+ Non-Fatal+ Fatal+ Unsupported+ RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+ MaxPayload 256 bytes, MaxReadReq 512 bytes DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend- LnkCap: Port #3, Speed 2.5GT/s, Width x1, ASPM L0s L1, Latency L0 <512ns, L1 <4us ClockPM- Suprise- LLActRep+ BwNot- LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain- CommClk- ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt- LnkSta: Speed 2.5GT/s, Width x1, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt- SltCap: AttnBtn- PwrCtrl- MRL- AttnInd- PwrInd- HotPlug- Surpise- Slot # 4, PowerLimit 10.000000; Interlock- NoCompl- SltCtl: Enable: AttnBtn- PwrFlt- MRL- PresDet- CmdCplt- HPIrq- LinkChg- Control: AttnInd Off, PwrInd On, Power- Interlock- SltSta: Status: AttnBtn- PowerFlt- MRL- CmdCplt- PresDet- Interlock- Changed: MRL- PresDet- LinkState- RootCtl: ErrCorrectable- ErrNon-Fatal- ErrFatal- PMEIntEna- CRSVisible- RootCap: CRSVisible- RootSta: PME ReqID 0000, PMEStatus- PMEPending- Kernel driver in use: pcieport-driver Kernel modules: shpchp 00:14.0 PCI bridge: nVidia Corporation MCP78S [GeForce 8200] PCI Bridge (rev a1) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0, Cache Line Size: 32 bytes Bus: primary=00, secondary=06, subordinate=06, sec-latency=0 I/O behind bridge: 00007000-00007fff Memory behind bridge: fd600000-fd6fffff Prefetchable memory behind bridge: 00000000fd500000-00000000fd5fffff Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- <SERR- <PERR- BridgeCtl: Parity- SERR- NoISA+ VGA- MAbort- >Reset- FastB2B- PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn- Capabilities: [40] Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Capabilities: [48] Power Management version 3 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [50] Message Signalled Interrupts: Mask- 64bit+ Queue=0/1 Enable- Address: 0000000000000000 Data: 0000 Capabilities: [60] HyperTransport: MSI Mapping Enable+ Fixed- Mapping Address Base: 00000000fee00000 Capabilities: [80] Express (v1) Root Port (Slot+), MSI 00 DevCap: MaxPayload 256 bytes, PhantFunc 0, Latency L0s <64ns, L1 <1us ExtTag+ RBE+ FLReset- DevCtl: Report errors: Correctable+ Non-Fatal+ Fatal+ Unsupported+ RlxdOrd+ ExtTag- PhantFunc- AuxPwr- NoSnoop+ MaxPayload 128 bytes, MaxReadReq 512 bytes DevSta: CorrErr- UncorrErr- FatalErr- UnsuppReq- AuxPwr- TransPend- LnkCap: Port #4, Speed 2.5GT/s, Width x1, ASPM L0s L1, Latency L0 <512ns, L1 <4us ClockPM- Suprise- LLActRep+ BwNot- LnkCtl: ASPM Disabled; RCB 64 bytes Disabled- Retrain- CommClk- ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt- LnkSta: Speed 2.5GT/s, Width x1, TrErr- Train- SlotClk+ DLActive- BWMgmt- ABWMgmt- SltCap: AttnBtn- PwrCtrl- MRL- AttnInd- PwrInd- HotPlug- Surpise- Slot # 5, PowerLimit 10.000000; Interlock- NoCompl- SltCtl: Enable: AttnBtn- PwrFlt- MRL- PresDet- CmdCplt- HPIrq- LinkChg- Control: AttnInd Off, PwrInd On, Power- Interlock- SltSta: Status: AttnBtn- PowerFlt- MRL- CmdCplt- PresDet- Interlock- Changed: MRL- PresDet- LinkState- RootCtl: ErrCorrectable- ErrNon-Fatal- ErrFatal- PMEIntEna- CRSVisible- RootCap: CRSVisible- RootSta: PME ReqID 0000, PMEStatus- PMEPending- Kernel driver in use: pcieport-driver Kernel modules: shpchp 00:18.0 Host bridge: Advanced Micro Devices [AMD] Family 10h [Opteron, Athlon64, Sempron] HyperTransport Configuration Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Capabilities: [80] HyperTransport: Host or Secondary Interface Command: WarmRst+ DblEnd- DevNum=0 ChainSide- HostHide+ Slave- <EOCErr- DUL- Link Control: CFlE- CST- CFE- <LkFail- Init+ EOC- TXO- <CRCErr=0 IsocEn- LSEn- ExtCTL- 64b- Link Config: MLWI=16bit DwFcIn- MLWO=16bit DwFcOut- LWI=16bit DwFcInEn- LWO=16bit DwFcOutEn- Revision ID: 3.00 Link Frequency: [b] Link Error: <Prot- <Ovfl- <EOC- CTLTm- Link Frequency Capability: 200MHz+ 300MHz- 400MHz+ 500MHz- 600MHz+ 800MHz+ 1.0GHz+ 1.2GHz+ 1.4GHz- 1.6GHz- Vend- Feature Capability: IsocFC+ LDTSTOP+ CRCTM- ECTLT- 64bA+ UIDRD- ExtRS- UCnfE- 00:18.1 Host bridge: Advanced Micro Devices [AMD] Family 10h [Opteron, Athlon64, Sempron] Address Map Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- 00:18.2 Host bridge: Advanced Micro Devices [AMD] Family 10h [Opteron, Athlon64, Sempron] DRAM Controller Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- 00:18.3 Host bridge: Advanced Micro Devices [AMD] Family 10h [Opteron, Athlon64, Sempron] Miscellaneous Control Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Capabilities: [f0] Secure device <?> 00:18.4 Host bridge: Advanced Micro Devices [AMD] Family 10h [Opteron, Athlon64, Sempron] Link Control Control: I/O- Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- 01:0a.0 FireWire (IEEE 1394): Agere Systems FW323 (rev 70) (prog-if 10 [OHCI]) Subsystem: ASUSTeK Computer Inc. Unknown device 8294 Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 32 (3000ns min, 6000ns max), Cache Line Size: 32 bytes Interrupt: pin A routed to IRQ 19 Region 0: Memory at fdeff000 (32-bit, non-prefetchable) [size=4K] Capabilities: [44] Power Management version 2 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME+ Kernel driver in use: ohci1394 Kernel modules: ohci1394 01:0b.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8169 Gigabit Ethernet (rev 10) Subsystem: Realtek Semiconductor Co., Ltd. RTL-8169 Gigabit Ethernet Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 64 (8000ns min, 16000ns max), Cache Line Size: 32 bytes Interrupt: pin A routed to IRQ 18 Region 0: I/O ports at cc00 [size=256] Region 1: Memory at fdefe000 (32-bit, non-prefetchable) [size=256] Expansion ROM at fdd00000 [disabled] [size=64K] Capabilities: [dc] Power Management version 2 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=375mA PME(D0-,D1+,D2+,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Kernel driver in use: r8169 Kernel modules: r8169 02:00.0 VGA compatible controller: nVidia Corporation GeForce 8300 (rev a2) (prog-if 00 [VGA controller]) Subsystem: ASUSTeK Computer Inc. Unknown device 82e2 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx- Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin A routed to IRQ 20 Region 0: Memory at fb000000 (32-bit, non-prefetchable) [size=16M] Region 1: Memory at d8000000 (64-bit, prefetchable) [size=128M] Region 3: Memory at e6000000 (64-bit, prefetchable) [size=32M] Region 5: I/O ports at bc00 [size=128] [virtual] Expansion ROM at e0000000 [disabled] [size=128K] Capabilities: [60] Power Management version 2 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [68] Message Signalled Interrupts: Mask- 64bit+ Queue=0/0 Enable- Address: 0000000000000000 Data: 0000 Kernel driver in use: nvidia Kernel modules: nvidiafb, nvidia
Seems to be fixed for other reporters in 2.6.29
Problem is still there, maybe a little more reproducible and takes a little longer, now about 70GB at small rsize/wsize instead of less than 10GB. I tested with kernel 2.6.29.1, took .config from slamd64-12.2 2.6.27.7 kernel, made oldconfig, gave default anwers for all new options. Tested with the following NSF mount options: rsize=1024,wsize=1024 After about 70 GB copied from NFS server to local disk the network hangs. Doing an ifconfig eth0 down followed by an ifconfig eth0 up started communication again but after a while the kernel paniced. See http://web.telia.com/~u87421281/panic1.jpg for a screen image. Reconfigured the screen to try to get more info from panic. Repeated the test, hung again after about 70 GB copied network hangs again. Took some printouts from ifconfig and ethtool (shown below) did ifconfig down/up, comm retarts and hangs after a little while, repeated ifconfig down/up, and a while later the kernel panics. See: http://web.telia.com/~u87421281/panic2.jpg Status when network is hung: #ifconfig eth0 eth0 Link encap:Ethernet HWaddr 00:22:15:00:4b:31 inet addr:192.168.12.56 Bcast:192.168.12.255 Mask:255.255.255.0 inet6 addr: fe80::222:15ff:fe00:4b31/64 Scope:Link UP BROADCAST NOTRAILERS RUNNING MULTICAST MTU:1500 Metric:1 RX packets:67334342 errors:2 dropped:0 overruns:0 frame:2 TX packets:67401699 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:82007943400 (76.3 GiB) TX bytes:15458771279 (14.3 GiB) Interrupt:21 Base address:0xe000 #ethtool -i eth0 driver: forcedeth version: 0.62 firmware-version: bus-info: 0000:00:0a.0 #ethtool -g eth0 Ring parameters for eth0: Pre-set maximums: RX: 16384 RX Mini: 0 RX Jumbo: 0 TX: 16384 Current hardware settings: RX: 128 RX Mini: 0 RX Jumbo: 0 TX: 256 #ethtool -S eth0 NIC statistics: tx_bytes: 15458771745 tx_zero_rexmt: 67401721 tx_one_rexmt: 0 tx_many_rexmt: 0 tx_late_collision: 0 tx_fifo_errors: 0 tx_carrier_errors: 0 tx_excess_deferral: 0 tx_retry_error: 0 rx_frame_error: 0 rx_extra_byte: 0 rx_late_collision: 0 rx_runt: 0 rx_frame_too_long: 0 rx_over_errors: 0 rx_crc_errors: 2 rx_frame_align_error: 0 rx_length_error: 0 rx_unicast: 67333558 rx_multicast: 0 rx_broadcast: 789 rx_packets: 67334347 rx_errors_total: 2 tx_errors_total: 0 tx_deferral: 0 tx_packets: 0 rx_bytes: 0 tx_pause: 0 rx_pause: 0 rx_drop_frame: 0 tx_unicast: 67401652 tx_multicast: 6 tx_broadcast: 63 #ethtool -d eth0 Offset Values -------- ----- 000: 00 00 00 00 ff 00 00 00 03 00 00 00 ca 03 47 03 010: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 020: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 030: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 040: 0e e2 20 04 55 a8 00 00 20 2e 00 00 00 00 00 00 050: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 060: 00 00 00 00 00 00 00 00 00 00 00 00 ff ff 00 00 070: ff ff 00 00 ff ff 00 00 ff ff 00 00 00 00 00 00 080: 3d 0f 3b 00 01 00 00 40 00 00 04 00 28 00 7f 00 090: 1c 06 00 00 01 00 00 00 00 00 10 00 00 ff 03 00 0a0: 0f 05 14 00 16 00 00 00 00 22 15 00 4b 31 00 00 0b0: 01 00 00 00 00 01 00 00 cd cc 00 ff b4 cf 00 00 0c0: 02 00 00 1c ce b3 18 7f 01 00 00 00 01 00 00 00 0d0: 01 00 00 00 01 00 00 00 01 00 00 00 01 00 00 00 0e0: 01 00 00 00 01 00 00 00 01 00 00 00 01 00 00 00 0f0: 01 00 00 00 01 00 00 00 01 00 00 00 00 00 00 00 100: 00 48 c1 3c 00 40 c1 3c ff 00 7f 00 00 80 00 00 110: 32 00 01 00 00 00 00 00 03 00 00 00 c0 4e c1 3c 120: 00 46 c1 3c 40 10 97 4c eb ff 00 a0 00 41 48 00 130: 2c 05 00 80 cc 4e c1 3c 40 45 c1 3c 00 80 e0 0f 140: 20 41 30 00 00 26 c0 80 02 00 00 00 02 00 00 00 150: 01 00 00 00 00 00 00 00 01 00 00 00 00 00 00 00 160: 02 00 00 00 02 00 00 00 02 00 00 00 02 00 00 00 170: f0 03 60 05 00 c0 00 00 00 00 00 00 00 00 00 00 180: 16 00 00 00 08 00 00 00 6d 79 94 01 03 81 00 00 190: 2a 00 00 00 00 3c 00 00 80 00 00 00 83 bd 00 00 1a0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 1b0: 00 00 00 00 00 00 00 00 00 00 00 00 b4 cf 00 00 1c0: 02 00 00 1c ce b3 18 7f 01 00 00 00 01 00 00 00 1d0: 01 00 00 00 01 00 00 00 01 00 00 00 01 00 00 00 1e0: 01 00 00 00 01 00 00 00 01 00 00 00 01 00 00 00 1f0: 01 00 00 00 01 00 00 00 01 00 00 00 00 00 00 00 200: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 210: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 220: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 230: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 240: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 250: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 260: 00 00 00 00 00 00 00 00 01 50 02 fe 00 01 00 00 270: 30 00 00 00 a2 00 00 00 00 00 00 00 a2 01 00 00 280: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 290: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 2a0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 2b0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 2c0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 2d0: 00 00 00 00 00 00 00 00 00 00 00 00 80 02 00 00 2e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 2f0: 00 00 00 00 40 01 00 00 00 00 00 00 40 01 00 00 300: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 310: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 320: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 330: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 340: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 350: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 360: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 370: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 380: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 390: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 3a0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 3b0: 00 00 00 00 04 00 00 00 ff ff 00 00 ff ff 00 00 3c0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 3d0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 3e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 3f0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 400: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 410: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 420: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 430: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 440: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 450: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 460: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 470: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 480: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 490: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 4a0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 4b0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 4c0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 4d0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 4e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 4f0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 500: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 510: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 520: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 530: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 540: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 550: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 560: 00 00 00 00 00 00 00 00 01 50 02 fe 00 01 00 00 570: 30 00 00 00 a2 00 00 00 00 00 00 00 a2 01 00 00 580: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 590: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 5a0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 5b0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 5c0: 00 00 06 00 ff ff 00 00 00 00 00 00 00 00 00 00 5d0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 5e0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 5f0: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 600: 00 00 00 00
I have the same problem with kernel versions 2.6.29.4 and 2.6.27 vanilla + openvz briullov.1 patches so I am now using my old 2.6.21-gentoo-r3 kernel. For me the problem occurred after copying files onto the server, it hanged after 300MB of files and on 40GB another time. I have an "MSI K9N Ultra" motherboard with two gigabit "nVidia Corporation MCP55 Ethernet (rev a2)" network controllers. On my machine found the kernel not panicing but maybe i rebootet to fast with a another kernel version ;-) Greets, Dennis
I forgot to mention I tried the tx timeout patch from bug #13258 (on my 2.6.29.4 from kernel.org) but it also does not fix the problem.
I would like to add some more information and possible work around to this issue. I have been experiencing similar issues on my MythTV frontend ( 2.6.30 custom kernel, AMD Athlon(tm) 64 X2 Dual Core Processor 4400+, nVIDIA MCP67 PCI bridge and MCP67 ethernet controller. The front end is NFS-rooted (no harddisk) and the on-board NIC was hanging at random points and vidio playback sometimes stalling. For quite some time I have had a realtek NIC in the system to overcome this problem, however I have just revisited this problem with the on-board nVIDIA NIC to get rid for the PCI realtek NIC. (This point is important to establish that it is the nVidia NIC and not another part of the system) Emperically I found that if I kept a constant stream of network traffic addressed to the device the slowness/stalling would subside. The system would behave as expected whilst the receive network activity was active. Removing the network activity would cause the issue to re-appear. I achieved this network traffic by issueing a flood ping to the network card from another box on the network. (ping -f <ip address of nvidia card> ) I have since discovered that disabling MSI interrupts globally also eleviates the issue. This was acheived by booting the kernel with the "pci=nomsi" kernel option. Since doing so the specific video clip which reliably caused the issue plays faultlessly. (Dont ask me why this video in particular tickled the problem repeatably - no idea) Having found the gem that turning of MSI interrupts resolves the problem I looking into the kernel driver and found the follwoing code in the forcedeth driver. static void nv_msi_workaround(struct fe_priv *np) /* Need to toggle the msi irq mask within the ethernet device, * otherwise, future interrupts will not be detected. */ if (np->msi_flags & NV_MSI_ENABLED) { u8 __iomem *base = np->base; writel(0, base + NvRegMSIIrqMask); writel(NVREG_MSI_VECTOR_0_ENABLED, base + NvRegMSIIrqMask); } } The above code being related to the following forum thread. http://kerneltrap.org/mailarchive/linux-netdev/2008/6/7/2060274 It is my hypothesis is that the MSI irq issue is still there in silicon of the MCP67 generation and the 'workaround' does not completely overcome the issue. By bombarding the interface with inbound traffic I was causing many more RX interupts to occur so the occational missed interrupt does not have any effect which under normal conditions was causing the network instability. This scenario would fit my symtoms almost perfectly. Maybe disabling MSI irq globally is quite harsh however the system is stable so I will probably leave it in this manner. I think there may be a method to only disable MSI interrupts of a sindle PCI device which may help isolate the issue. Please let me know if there is anything further I can add to the bug report that would make it more useful. I am happy to further testing now that I have a repeatable way of causing the issue. Regard, David System Details: uname -a Linux tvnow 2.6.30-tvnow #1 SMP Sun Mar 21 14:49:27 EST 2010 i686 GNU/Linux cat /proc/cpuinfo Linux tvnow 2.6.30-tvnow #1 SMP Sun Mar 21 14:49:27 EST 2010 i686 GNU/Linux tvnow:~# cat /proc/cpuinfo processor : 0 vendor_id : AuthenticAMD cpu family : 15 model : 107 model name : AMD Athlon(tm) 64 X2 Dual Core Processor 4400+ stepping : 1 cpu MHz : 2300.058 cache size : 512 KB physical id : 0 siblings : 1 core id : 0 cpu cores : 1 apicid : 0 initial apicid : 0 fdiv_bug : no hlt_bug : no f00f_bug : no coma_bug : no fpu : yes fpu_exception : yes cpuid level : 1 wp : yes flags : fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat pse36 clflush mmx fxsr sse sse2 ht syscall nx mmxext fxsr_opt rdtscp lm 3dnowext 3dnow up extd_apicid pni cx16 lahf_lm cmp_legacy svm extapic cr8_legacy 3dnowprefetch bogomips : 4600.11 clflush size : 64 power management: ts fid vid ttp tm stc 100mhzsteps tvnow:~# lspci 00:00.0 RAM memory: nVidia Corporation MCP67 Memory Controller (rev a2) 00:01.0 ISA bridge: nVidia Corporation MCP67 ISA Bridge (rev a2) 00:01.1 SMBus: nVidia Corporation MCP67 SMBus (rev a2) 00:02.0 USB Controller: nVidia Corporation MCP67 OHCI USB 1.1 Controller (rev a2) 00:02.1 USB Controller: nVidia Corporation MCP67 EHCI USB 2.0 Controller (rev a2) 00:04.0 USB Controller: nVidia Corporation MCP67 OHCI USB 1.1 Controller (rev a2) 00:04.1 USB Controller: nVidia Corporation MCP67 EHCI USB 2.0 Controller (rev a2) 00:07.0 Audio device: nVidia Corporation MCP67 High Definition Audio (rev a1) 00:08.0 PCI bridge: nVidia Corporation MCP67 PCI Bridge (rev a2) 00:09.0 IDE interface: nVidia Corporation MCP67 AHCI Controller (rev a2) 00:0a.0 Ethernet controller: nVidia Corporation MCP67 Ethernet (rev a2) 00:0b.0 PCI bridge: nVidia Corporation MCP67 PCI Express Bridge (rev a2) 00:0c.0 PCI bridge: nVidia Corporation MCP67 PCI Express Bridge (rev a2) 00:0d.0 PCI bridge: nVidia Corporation MCP67 PCI Express Bridge (rev a2) 00:0e.0 PCI bridge: nVidia Corporation MCP67 PCI Express Bridge (rev a2) 00:0f.0 PCI bridge: nVidia Corporation MCP67 PCI Express Bridge (rev a2) 00:10.0 PCI bridge: nVidia Corporation MCP67 PCI Express Bridge (rev a2) 00:11.0 PCI bridge: nVidia Corporation MCP67 PCI Express Bridge (rev a2) 00:12.0 VGA compatible controller: nVidia Corporation GeForce 7050 PV / nForce 630a (rev a2) 00:18.0 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] HyperTransport Technology Configuration 00:18.1 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Address Map 00:18.2 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] DRAM Controller 00:18.3 Host bridge: Advanced Micro Devices [AMD] K8 [Athlon64/Opteron] Miscellaneous Control 01:07.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8139/8139C/8139C+ (rev 10) tvnow:~# ethtool eth0 Settings for eth0: Supported ports: [ MII ] Supported link modes: 10baseT/Half 10baseT/Full 100baseT/Half 100baseT/Full 1000baseT/Full Supports auto-negotiation: Yes Advertised link modes: 10baseT/Half 10baseT/Full 100baseT/Half 100baseT/Full 1000baseT/Full Advertised auto-negotiation: Yes Speed: 100Mb/s Duplex: Full Port: MII PHYAD: 1 Transceiver: external Auto-negotiation: on Supports Wake-on: g Wake-on: d Link detected: yes ppdev 6348 0 - Live 0xf8123000 nvidia 8862988 28 - Live 0xfa3bf000 (P) agpgart 30836 1 nvidia, Live 0xf80cf000 binfmt_misc 7120 1 - Live 0xf8025000 vboxnetflt 72328 0 - Live 0xf9328000 vboxnetadp 66864 0 - Live 0xf92fb000 vboxdrv 102576 1 vboxnetflt, Live 0xf92c7000 nfs 221584 3 - Live 0xf926b000 lockd 57972 1 nfs, Live 0xf920c000 fscache 34440 1 nfs, Live 0xf91e8000 nfs_acl 2640 1 nfs, Live 0xf91d6000 auth_rpcgss 31420 1 nfs, Live 0xf91c5000 sunrpc 163792 15 nfs,lockd,nfs_acl,auth_rpcgss, Live 0xf918c000 fuse 47752 1 - Live 0xf9148000 w83627ehf 17204 0 - Live 0xf8fdd000 hwmon_vid 2480 1 w83627ehf, Live 0xf8fd1000 firewire_sbp2 12700 0 - Live 0xf8fc4000 loop 13324 0 - Live 0xf8fb3000 snd_hda_codec_analog 53456 1 - Live 0xf8e0f000 tuner_xc2028 17512 2 - Live 0xf8cbe000 snd_hda_intel 22192 0 - Live 0xf8ca6000 snd_hda_codec 63580 2 snd_hda_codec_analog,snd_hda_intel, Live 0xf8c89000 snd_hwdep 6120 1 snd_hda_codec, Live 0xf8c6c000 zl10353 6340 2 - Live 0xf8c63000 snd_pcm_oss 32228 0 - Live 0xf8c51000 mt352 5352 1 - Live 0xf8c40000 snd_mixer_oss 12368 1 snd_pcm_oss, Live 0xf8afd000 dvb_usb_cxusb 39104 25 - Live 0xf8ad9000 snd_pcm 62396 3 snd_hda_intel,snd_hda_codec,snd_pcm_oss, Live 0xf8ab6000 snd_seq_midi 5688 0 - Live 0xf8a99000 dvb_bt8xx 13044 12 - Live 0xf8955000 bt878 8736 1 dvb_bt8xx, Live 0xf8941000 snd_rawmidi 18596 1 snd_seq_midi, Live 0xf8931000 dib7000p 15128 1 dvb_usb_cxusb, Live 0xf891b000 dibx000_common 3876 1 dib7000p, Live 0xf890d000 snd_seq_midi_event 6212 1 snd_seq_midi, Live 0xf88fe000 dvb_usb 17340 1 dvb_usb_cxusb, Live 0xf88e9000 bttv 104796 2 dvb_bt8xx,bt878, Live 0xf889c000 snd_seq 42436 2 snd_seq_midi,snd_seq_midi_event, Live 0xf8867000 v4l2_common 16092 1 bttv, Live 0xf8848000 videodev 32728 2 bttv,v4l2_common, Live 0xf8826000 v4l1_compat 11204 1 videodev, Live 0xf880e000 dvb_core 78532 3 dvb_bt8xx,dib7000p,dvb_usb, Live 0xf87d4000 joydev 8676 0 - Live 0xf87b1000 dib0070 7940 1 dvb_usb_cxusb, Live 0xf87a2000 snd_timer 17436 2 snd_pcm,snd_seq, Live 0xf8793000 ir_common 46716 1 bttv, Live 0xf876d000 i2c_algo_bit 4860 1 bttv, Live 0xf8756000 snd_seq_device 6136 3 snd_seq_midi,snd_rawmidi,snd_seq, Live 0xf874b000 videobuf_dma_sg 11064 1 bttv, Live 0xf873a000 videobuf_core 16028 2 bttv,videobuf_dma_sg, Live 0xf8725000 btcx_risc 4372 1 bttv, Live 0xf8715000 tveeprom 10704 1 bttv, Live 0xf8706000 i2c_nforce2 6448 0 - Live 0xf86f9000 k8temp 3984 0 - Live 0xf86ee000 ir_core 6504 2 bttv,ir_common, Live 0xf86de000 i2c_core 20844 15 nvidia,tuner_xc2028,zl10353,mt352,dvb_bt8xx,dib7000p,dibx000_common,dvb_usb,bttv,v4l2_common,videodev,dib0070,i2c_algo_bit,tveeprom,i2c_nforce2, Live 0xf86ce000 evdev 8028 16 - Live 0xf86b7000 pcspkr 2104 0 - Live 0xf86ab000 snd 49060 11 snd_hda_codec_analog,snd_hda_intel,snd_hda_codec,snd_hwdep,snd_pcm_oss,snd_mixer_oss,snd_pcm,snd_rawmidi,snd_seq,snd_timer,snd_seq_device, Live 0xf8696000 soundcore 6184 1 snd, Live 0xf867c000 snd_page_alloc 8180 2 snd_hda_intel,snd_pcm, Live 0xf8670000 parport_pc 22392 0 - Live 0xf8654000 parport 31144 2 ppdev,parport_pc, Live 0xf862f000 asus_atk0110 6616 0 - Live 0xf861b000 psmouse 37528 0 - Live 0xf8607000 button 5060 0 - Live 0xf85ec000 serio_raw 4560 0 - Live 0xf85db000 processor 34560 0 - Live 0xf85b9000 ext3 107172 3 - Live 0xf858a000 jbd 41036 1 ext3, Live 0xf8554000 mbcache 6924 1 ext3, Live 0xf853d000 dm_mod 49992 10 - Live 0xf8394000 ohci_hcd 19884 0 - Live 0xf8369000 ide_gd_mod 19856 3 - Live 0xf8356000 ide_cd_mod 24484 0 - Live 0xf8341000 cdrom 30316 1 ide_cd_mod, Live 0xf8329000 usbhid 31568 0 - Live 0xf82bf000 hid 34528 1 usbhid, Live 0xf82a3000 ata_generic 4340 0 - Live 0xf828f000 uhci_hcd 19136 0 - Live 0xf8280000 ide_pci_generic 3632 0 - Live 0xf8265000 sata_nv 19688 0 - Live 0xf8232000 ehci_hcd 29812 0 - Live 0xf821b000 amd74xx 5428 2 - Live 0xf8208000 firewire_ohci 18952 0 - Live 0xf81f6000 firewire_core 37736 2 firewire_sbp2,firewire_ohci, Live 0xf81de000 crc_itu_t 2148 1 firewire_core, Live 0xf81cc000 libata 151044 2 ata_generic,sata_nv, Live 0xf819e000 forcedeth 46584 0 - Live 0xf8159000 ide_core 87940 4 ide_gd_mod,ide_cd_mod,ide_pci_generic,amd74xx, Live 0xf812c000 usbcore 126052 9 dvb_usb_cxusb,dvb_usb,ohci_hcd,usbhid,uhci_hcd,ehci_hcd, Live 0xf80e9000 scsi_mod 131800 2 firewire_sbp2,libata, Live 0xf8097000 floppy 46220 0 - Live 0xf8058000 thermal 12580 0 - Live 0xf803c000 fan 4044 0 - Live 0xf802e000 thermal_sys 13140 3 processor,thermal,fan, Live 0xf800b000