Most recent kernel where this bug did not occur:2.6.17.1 vanilla Distribution:Gentoo Hardware Environment: Software Environment: Problem Description:Internet connection randomly freezes with Motorola SB5100 Cable Modem (USB), with no dmesg messages. Steps to reproduce: I just installed 2.6.17.1 vanilla kernel. --------------------------------------------------------------------------- Hi.. I am using kernel-2.6.17.1 for a few days and i had a few problems with my network connections. My eth1 (Motorola SB5100 Cable Modem(USB)) freezes randomly, with no justification. It keeps showing com ifconfig and also show an ip. It happens more freqnuently when i am using amule, dmesg don't show anything. To have connection again i can unplug and plug the usb cable, or simply do /etc/init.d/net.eth1 restart. I am also using dhcpcd 2.0.7, but i think there is no problem is that, because with 2.6.16 kernel i have no problems at all. -- cat /proc/version Linux version 2.6.17-ck1 (root@khona4) (gcc version 4.1.1 (Gentoo 4.1.1)) #3 SMP PREEMPT Sun Jun 25 15:47:02 WEST 2006 -- cat /proc/modules cat /proc/modules w83627hf 22672 0 - Live 0xfe54a000 hwmon_vid 2816 1 w83627hf, Live 0xfe529000 i2c_isa 3712 1 w83627hf, Live 0xfe514000 snd_seq_midi 6432 0 - Live 0xfe509000 snd_emu10k1_synth 6656 0 - Live 0xf987d000 snd_emux_synth 31616 1 snd_emu10k1_synth, Live 0xfe538000 snd_seq_virmidi 5760 1 snd_emux_synth, Live 0xfe506000 snd_seq_midi_emul 6016 1 snd_emux_synth, Live 0xf9877000 snd_pcm_oss 33696 0 - Live 0xfe52e000 snd_mixer_oss 15232 1 snd_pcm_oss, Live 0xfe50c000 snd_seq_oss 29952 0 - Live 0xfd8f7000 snd_seq_midi_event 6144 3 snd_seq_midi,snd_seq_virmidi,snd_seq_oss, Live 0xf987a000 snd_seq 46928 8 snd_seq_midi,snd_emux_synth,snd_seq_virmidi,snd_seq_midi_emul,snd_seq_oss,snd_seq_midi_event, Live 0xfe516000 snd_emu10k1 112292 2 snd_emu10k1_synth, Live 0xfd881000 snd_rawmidi 19232 3 snd_seq_midi,snd_seq_virmidi,snd_emu10k1, Live 0xf9855000 snd_ac97_codec 88352 1 snd_emu10k1, Live 0xfd89e000 snd_ac97_bus 2176 1 snd_ac97_codec, Live 0xf982b000 snd_pcm 65284 3 snd_pcm_oss,snd_emu10k1,snd_ac97_codec, Live 0xf9862000 snd_seq_device 6412 7 snd_seq_midi,snd_emu10k1_synth,snd_emux_synth,snd_seq_oss,snd_seq,snd_emu10k1,snd_rawmidi, Live 0xf984b000 snd_timer 18820 3 snd_seq,snd_emu10k1,snd_pcm, Live 0xf9838000 snd_page_alloc 7816 2 snd_emu10k1,snd_pcm, Live 0xf9835000 snd_util_mem 3584 2 snd_emux_synth,snd_emu10k1, Live 0xf9833000 snd_hwdep 6916 2 snd_emux_synth,snd_emu10k1, Live 0xf9830000 snd 43748 15 snd_emux_synth,snd_seq_virmidi,snd_pcm_oss,snd_mixer_oss,snd_seq_oss,snd_seq,snd_emu10k1,snd_rawmidi,snd_ac97_codec,snd_pcm,snd_seq_device,snd_timer,snd_hwdep, Live 0xf983f000 soundcore 7648 1 snd, Live 0xf982d000 --- cat /proc/ioports 0000-001f : dma1 0020-0021 : pic1 0040-0043 : timer0 0050-0053 : timer1 0060-006f : keyboard 0080-008f : dma page reg 00a0-00a1 : pic2 00c0-00df : dma2 00f0-00ff : fpu 0170-0177 : ide1 01f0-01f7 : ide0 0295-0296 : w83627hf 02f8-02ff : serial 0376-0376 : ide1 03c0-03df : vesafb 03f6-03f6 : ide0 03f8-03ff : serial 0400-041f : 0000:00:1f.3 0480-04bf : 0000:00:1f.0 0800-087f : 0000:00:1f.0 d000-dfff : PCI Bus #02 d800-d8ff : 0000:02:05.0 d800-d8ff : sk98lin df80-df9f : 0000:02:0c.0 df80-df9f : EMU10K1 dfe0-dfe7 : 0000:02:0c.1 eec0-eedf : 0000:00:1d.0 eec0-eedf : uhci_hcd ef00-ef1f : 0000:00:1d.1 ef00-ef1f : uhci_hcd ef20-ef3f : 0000:00:1d.2 ef20-ef3f : uhci_hcd ef40-ef5f : 0000:00:1d.3 ef40-ef5f : uhci_hcd ef90-ef9f : 0000:00:1f.2 ef90-ef9f : libata efa0-efa7 : 0000:00:1f.2 efa0-efa7 : libata efa8-efab : 0000:00:1f.2 efa8-efab : libata efac-efaf : 0000:00:1f.2 efac-efaf : libata efe0-efe7 : 0000:00:1f.2 efe0-efe7 : libata fc00-fc0f : 0000:00:1f.1 fc00-fc07 : ide0 fc08-fc0f : ide1 --- cat /proc/iomem # cat /proc/iomem 00000000-0009fbff : System RAM 0009fc00-0009ffff : reserved 000a0000-000bffff : Video RAM area 000c0000-000cfbff : Video ROM 000d0000-000d49ff : Adapter ROM 000f0000-000fffff : System ROM 00100000-3ff2ffff : System RAM 00100000-0035ecb4 : Kernel code 0035ecb5-0040c467 : Kernel data 3ff30000-3ff3ffff : ACPI Tables 3ff40000-3ffeffff : ACPI Non-volatile Storage 3fff0000-3fffffff : reserved 50000000-500003ff : 0000:00:1f.1 d7f00000-f7efffff : PCI Bus #01 e0000000-efffffff : 0000:01:00.0 e0000000-efffffff : nvidiafb f8000000-fbffffff : 0000:00:00.0 fc900000-fe9fffff : PCI Bus #01 fd000000-fdffffff : 0000:01:00.0 fd000000-fdffffff : nvidiafb fe9e0000-fe9fffff : 0000:01:00.0 fea00000-feafffff : PCI Bus #02 feafc000-feafffff : 0000:02:05.0 feafc000-feafffff : sk98lin febffc00-febfffff : 0000:00:1d.7 febffc00-febfffff : ehci_hcd ffb80000-ffffffff : reserved -- lspci -vvv 00:00.0 Host bridge: Intel Corporation 82875P/E7210 Memory Controller Hub (rev 02) Subsystem: ASUSTeK Computer Inc. Unknown device 80f6 Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort+ >SERR- <PERR- Latency: 0 Region 0: Memory at f8000000 (32-bit, prefetchable) [size=64M] Capabilities: [e4] Vendor Specific Information Capabilities: [a0] AGP version 3.0 Status: RQ=32 Iso- ArqSz=2 Cal=2 SBA+ ITACoh- GART64- HTrans- 64bit- FW+ AGP3+ Rate=x4,x8 Command: RQ=1 ArqSz=0 Cal=2 SBA+ AGP- GART64- 64bit- FW- Rate=<none> 00:01.0 PCI bridge: Intel Corporation 82875P Processor to AGP Controller (rev 02) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- Status: Cap- 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 96 Bus: primary=00, secondary=01, subordinate=01, sec-latency=96 I/O behind bridge: 0000f000-00000fff Memory behind bridge: fc900000-fe9fffff Prefetchable memory behind bridge: d7f00000-f7efffff Secondary status: 66MHz+ FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort+ <SERR- <PERR- BridgeCtl: Parity- SERR- NoISA- VGA+ MAbort- >Reset- FastB2B- 00:1d.0 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #1 (rev 02) (prog-if 00 [UHCI]) Subsystem: ASUSTeK Computer Inc. P5P800-MX Mainboard Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 0 Interrupt: pin A routed to IRQ 16 Region 4: I/O ports at eec0 [size=32] 00:1d.1 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #2 (rev 02) (prog-if 00 [UHCI]) Subsystem: ASUSTeK Computer Inc. P5P800-MX Mainboard Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 0 Interrupt: pin B routed to IRQ 19 Region 4: I/O ports at ef00 [size=32] 00:1d.2 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #3 (rev 02) (prog-if 00 [UHCI]) Subsystem: ASUSTeK Computer Inc. P5P800-MX Mainboard Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 0 Interrupt: pin C routed to IRQ 18 Region 4: I/O ports at ef20 [size=32] 00:1d.3 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB UHCI Controller #4 (rev 02) (prog-if 00 [UHCI]) Subsystem: ASUSTeK Computer Inc. P5P800-MX Mainboard Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 0 Interrupt: pin A routed to IRQ 16 Region 4: I/O ports at ef40 [size=32] 00:1d.7 USB Controller: Intel Corporation 82801EB/ER (ICH5/ICH5R) USB2 EHCI Controller (rev 02) (prog-if 20 [EHCI]) Subsystem: ASUSTeK Computer Inc. P5P800-MX Mainboard Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 0 Interrupt: pin D routed to IRQ 23 Region 0: Memory at febffc00 (32-bit, non-prefetchable) [size=1K] Capabilities: [50] Power Management version 2 Flags: PMEClk- DSI- D1- D2- AuxCurrent=375mA PME(D0+,D1-,D2-,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [58] Debug port 00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev c2) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 0 Bus: primary=00, secondary=02, subordinate=02, sec-latency=96 I/O behind bridge: 0000d000-0000dfff Memory behind bridge: fea00000-feafffff Prefetchable memory behind bridge: fff00000-000fffff Secondary status: 66MHz- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort+ <SERR- <PERR- BridgeCtl: Parity- SERR+ NoISA- VGA- MAbort- >Reset- FastB2B- 00:1f.0 ISA bridge: Intel Corporation 82801EB/ER (ICH5/ICH5R) LPC Interface Bridge (rev 02) Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 0 00:1f.1 IDE interface: Intel Corporation 82801EB/ER (ICH5/ICH5R) IDE Controller (rev 02) (prog-if 8a [Master SecP PriP]) Subsystem: ASUSTeK Computer Inc. P5P800-MX Mainboard Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 0 Interrupt: pin A routed to IRQ 18 Region 0: I/O ports at <unassigned> Region 1: I/O ports at <unassigned> Region 2: I/O ports at <unassigned> Region 3: I/O ports at <unassigned> Region 4: I/O ports at fc00 [size=16] Region 5: Memory at 50000000 (32-bit, non-prefetchable) [size=1K] 00:1f.2 RAID bus controller: Intel Corporation 82801ER (ICH5R) SATA Controller (rev 02) (prog-if 8f) Subsystem: ASUSTeK Computer Inc. Unknown device 80a6 Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 0 Interrupt: pin A routed to IRQ 18 Region 0: I/O ports at efe0 [size=8] Region 1: I/O ports at efac [size=4] Region 2: I/O ports at efa0 [size=8] Region 3: I/O ports at efa8 [size=4] Region 4: I/O ports at ef90 [size=16] 00:1f.3 SMBus: Intel Corporation 82801EB/ER (ICH5/ICH5R) SMBus Controller (rev 02) Subsystem: ASUSTeK Computer Inc. P4P800 Mainboard Control: I/O+ Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Interrupt: pin B routed to IRQ 17 Region 4: I/O ports at 0400 [size=32] 01:00.0 VGA compatible controller: nVidia Corporation NV31 [GeForce FX 5600] (rev a1) (prog-if 00 [VGA]) Subsystem: Micro-Star International Co., Ltd. Unknown device 9124 Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 96 (1250ns min, 250ns max) Interrupt: pin A routed to IRQ 16 Region 0: Memory at fd000000 (32-bit, non-prefetchable) [size=16M] Region 1: Memory at e0000000 (32-bit, prefetchable) [size=256M] Expansion ROM at fe9e0000 [disabled] [size=128K] Capabilities: [60] Power Management version 2 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- Capabilities: [44] AGP version 3.0 Status: RQ=32 Iso- ArqSz=0 Cal=3 SBA+ ITACoh- GART64- HTrans- 64bit- FW+ AGP3+ Rate=x4,x8 Command: RQ=1 ArqSz=0 Cal=0 SBA- AGP- GART64- 64bit- FW- Rate=<none> 02:05.0 Ethernet controller: 3Com Corporation 3c940 10/100/1000Base-T [Marvell] (rev 12) Subsystem: ASUSTeK Computer Inc. A7V600/P4P800/K8V motherboard Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR+ FastB2B- Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 96 (5750ns min, 7750ns max), Cache Line Size: 16 bytes Interrupt: pin A routed to IRQ 22 Region 0: Memory at feafc000 (32-bit, non-prefetchable) [size=16K] Region 1: I/O ports at d800 [size=256] Capabilities: [48] Power Management version 2 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold+) Status: D0 PME-Enable- DSel=0 DScale=1 PME- Capabilities: [50] Vital Product Data 02:0c.0 Multimedia audio controller: Creative Labs SB Live! EMU10k1 (rev 08) Subsystem: Creative Labs CT4832 SBLive! Value Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 96 (500ns min, 5000ns max) Interrupt: pin A routed to IRQ 20 Region 0: I/O ports at df80 [size=32] Capabilities: [dc] Power Management version 1 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- 02:0c.1 Input device controller: Creative Labs SB Live! Game Port (rev 08) Subsystem: Creative Labs Gameport Joystick Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- Latency: 96 Region 0: I/O ports at dfe0 [size=8] Capabilities: [dc] Power Management version 1 Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-) Status: D0 PME-Enable- DSel=0 DScale=0 PME- -- cat /proc/scsi/scsi cat /proc/scsi/scsi Attached devices: Host: scsi0 Channel: 00 Id: 00 Lun: 00 Vendor: ATA Model: ST3120026AS Rev: 3.05 Type: Direct-Access ANSI SCSI revision: 05 Host: scsi1 Channel: 00 Id: 00 Lun: 00 Vendor: ATA Model: ST3120827AS Rev: 3.43 Type: Direct-Access -- I am following just the indications to submit a bug, i neved had done it before. If you need something more just ask. Is .config file needed? (both of them: 2.6.16-ck11 with no problems and 2.6.17.1+2.6.17-ck1 problematic.
On Sun, 25 Jun 2006 09:02:03 -0700 bugme-daemon@bugzilla.kernel.org wrote: > Most recent kernel where this bug did not occur:2.6.17.1 vanilla I think you mean it _did_ occur in 2.6.17.1. That question is asking what is the most recent kernel in which it did _not_ occur. What net driver is that machine using? tg3? skge?
The last version that it did not occour is 2.6.16-ck11 (i used to use ck sources.. i only tested vanilla just to be shore) drivers: - CONFIG_USB_USBNET=y CONFIG_USB_NET_CDCETHER=y CONFIG_USB=y CONFIG_USB_UHCI_HCD=y CONFIG_SK98LIN=y - is this what you are asking for? i will also submit .config Another thing could be important is that i cannot access to my modem status page (http://192.168.100.1).
Created attachment 8413 [details] kernel configuration file
I've noticed another thing. With 2.6.16-ck11 my network was no errors. eth1 Link encap:Ethernet HWaddr 00:11:80:EB:65:9A inet addr:217.129.133.137 Bcast:217.129.143.255 Mask:255.255.240.0 UP BROADCAST NOTRAILERS RUNNING MULTICAST MTU:1500 Metric:1 RX packets:699616 errors:0 dropped:0 overruns:0 frame:0 TX packets:586397 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:666051647 (635.1 Mb) TX bytes:130582486 (124.5 Mb) while with 2.6.17.1 i noticed errors RX packets:699616 errors:!!WHERE!! dropped:0 overruns:0 frame:0 TX packets:586397 errors:(and maybe here?) dropped:0 overruns:0 I can re-test it if you want (for TX packets i am not shore)
I am now using 2.6.17.3 and i noticed that I'm now capable to stay connected, but also having lots of eth1 errors. eth1 Link encap:Ethernet HWaddr 00:11:80:EB:65:9A inet addr:217.129.133.137 Bcast:217.129.143.255 Mask:255.255.240.0 UP BROADCAST NOTRAILERS RUNNING MULTICAST MTU:1500 Metric:1 RX packets:615602 errors:4027 dropped:0 overruns:0 frame:4027 TX packets:674816 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:319488901 (304.6 Mb) TX bytes:241017064 (229.8 Mb) 4027 and raising. This bug is now 1/2 corrected. I don't know if I should downgrade to 2.6.16 you stay ussing 2.6.17 in this conditions.
After all, it stills crashing my network. I don't know if the fact that now it took a little longer is to take in consideration. Also, should this be moved to Networking category instead Drivers category? (am i alone? lol)
i had the same problem with 2.6.17.5, after a downgrade to 2.6.16.16 it disapeared. 2.6.16.26 seems to have the same problem too.
still not corrected on 2.6.17.17.. please.. can i submit any more usefull information? eth1 Link encap:Ethernet HWaddr 00:11:80:EB:65:9A inet addr:217.129.133.137 Bcast:217.129.143.255 Mask:255.255.240.0 UP BROADCAST NOTRAILERS RUNNING MULTICAST MTU:1500 Metric:1 RX packets:6498 errors:36 dropped:0 overruns:0 frame:36 TX packets:5639 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:2071217 (1.9 Mb) TX bytes:2857804 (2.7 Mb)
I again! I tested A LOT Of kernels versions and i found where the bug happen.. Kernel verion <= 2.6.16.23 is CLEAN (i think that 2.6.16.23 is the last one of 2.6.16 series) But right on kernel >= 2.6.17 I have problems with my modem. So, from 2.6.16.23 to 2.6.17 there is something new that cause this driver to fail. The changelog is very very extensive and I can't figure out wich commit cause this fail. Also, the 2.6.17.9 stills with this bug. I would like to know if i can do something more.
I'm sorry, where i mean 2.6.16.23 is 2.6.16.27, on my last reply. So, after some help, i think i found the commint. --- commit e03d72b99e4027504ada134bf1804d6ea792b206 Author: Adrian Bunk <bunk@stusta.de> Date: Mon Jan 9 18:34:08 2006 -0800 [PATCH] drivers/net/sk98lin/: possible cleanups This patch contains the following possible cleanups: - make needlessly global functions static - remove unused code Signed-off-by: Adrian Bunk <bunk@stusta.de> Cc: Stephen Hemminger <shemminger@osdl.org> Cc: Jeff Garzik <jgarzik@pobox.com> Signed-off-by: Andrew Morton <akpm@osdl.org> Signed-off-by: Jeff Garzik <jgarzik@pobox.com> ---
I don't see anything in this patch that could even remotely cause your problem. Except if there's a persistent bug in the driver that wasn't visible before due to pure luck. Since sk98lin is a driver that will not stay long-term in the kernel, can you check whether the skge driver works for you?
I was just trying to guess wich commit made my modem start to bug.. yes, you're right.. sk98lin is for my network adapter, not for my modem.. I will look once more to changelog...
First, to make sure we are not hunting an already fixed issue: - still present in 2.6.17.9? - still present in 2.6.18-rc4? If you want to find the commit that was causing your problem, you have to do git bisecting: # install git and cogito on your computer # clone the 2.6.17 tree: cg-clone \ git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-2.6.17.y.git # start bisecting: cd linux-2.6.17.y git bisect start git bisect bad git bisect good v2.6.16 # start round cp /patch/to/.config . make oldconfig make # install kernel, boot it, check whether it's good or bad, then: git bisect [bad|good] # start next round After at about 12 reboots, you'll have found the guilty commit ("... is first bad commit"). More information on git bisecting: man git-bisect
2.6.18-rc4 has the bug OFFTOPIC: and some stupid mouse bug (it jumps on the screen) 2.6.17.9 has also the bug after use git I think i found the bug: dccf4a48d47120a42382ba526f1a0848c13ba2a4 is first bad commit commit dccf4a48d47120a42382ba526f1a0848c13ba2a4 Author: Alan Stern <stern@rowland.harvard.edu> Date: Sat Dec 17 17:58:46 2005 -0500 [PATCH] UHCI: use one QH per endpoint, not per URB This patch (as623) changes the uhci-hcd driver to make it use one QH per device endpoint, instead of a QH per URB as it does now. Numerous areas of the code are affected by this. For example, the distinction between "queued" URBs and non-"queued" URBs no longer exists; all URBs belong to a queue and some just happen to be at the queue's head. Signed-off-by: Alan Stern <stern@rowland.harvard.edu> Signed-off-by: Greg Kroah-Hartman <gregkh@suse.de> :040000 040000 2210205f1df6153ddaa93cf25eb0300df45b5405 c287ccdd16dae4ef945ecc1e3e58a163ba27ebad M drivers -- I will now restart git for 2.6.17.y and try and option named "revert" just to be shore (i think it can help)
Thanks for the bisecting! Reverting this patch in 2.6.17 seems to require reverting some other patches first. But it seems to work the other way: Can you create a non-working kernel by applying this patch agains 2.6.16.27?
YEs.. i tried.. but.. # git clone git://git.kernel.org/pub/scm/linux/kernel/git/stable/linux-2.6.16.y.git git cherry-pick dccf4a48d47120a42382ba526f1a0848c13ba2a4 fatal: Needed a single revision Not a single commit dccf4a48d47120a42382ba526f1a0848c13ba2a4 #
The patch you identified (as623) did contain a bug, and a fix for it was added in 2.6.17.8 and 2.6.18-rc4. So your git-bisect result probably doesn't mean anything -- unless there is a second, unidentified bug still present. Can you provide the contents of /proc/bus/usb/devices (you may need to do "mount -t usbfs none /proc/bus/usb" first)?
@luminoso: The bad commit is not in the 2.6.16 tree. Do a git-show dccf4a48d47120a42382ba526f1a0848c13ba2a4 > /tmp/patch-bad in the 2.6.17 git tree. Then go to the 2.6.16.27 sources (no matter whether they are from a git tree or directly downloaded) and do a patch -p1 < /tmp/patch-bad and compile this kernel.
@Alan: Since both 2.6.17.9 and 2.6.18-rc4 don't work for him, it seems there is a second bug...
Created attachment 8842 [details] Bugfix for uhci-hcd in 2.6.17.8 In fact there must be two bugs: the known bug in the as623 patch plus whatever else is still causing problems. The second bug might not be in uhci-hcd, however. In any case, it would be a mistake to apply as623 and predecessors without also applying the bug-fix patch (attached).
cat /proc/bus/usb/devices T: Bus=05 Lev=00 Prnt=00 Port=00 Cnt=00 Dev#= 1 Spd=12 MxCh= 2 B: Alloc= 0/900 us ( 0%), #Int= 0, #Iso= 0 D: Ver= 1.10 Cls=09(hub ) Sub=00 Prot=00 MxPS=64 #Cfgs= 1 P: Vendor=0000 ProdID=0000 Rev= 2.06 S: Manufacturer=Linux 2.6.16 uhci_hcd S: Product=UHCI Host Controller S: SerialNumber=0000:00:1d.3 C:* #Ifs= 1 Cfg#= 1 Atr=c0 MxPwr= 0mA I: If#= 0 Alt= 0 #EPs= 1 Cls=09(hub ) Sub=00 Prot=00 Driver=hub E: Ad=81(I) Atr=03(Int.) MxPS= 2 Ivl=255ms T: Bus=04 Lev=00 Prnt=00 Port=00 Cnt=00 Dev#= 1 Spd=12 MxCh= 2 B: Alloc= 0/900 us ( 0%), #Int= 0, #Iso= 0 D: Ver= 1.10 Cls=09(hub ) Sub=00 Prot=00 MxPS=64 #Cfgs= 1 P: Vendor=0000 ProdID=0000 Rev= 2.06 S: Manufacturer=Linux 2.6.16 uhci_hcd S: Product=UHCI Host Controller S: SerialNumber=0000:00:1d.2 C:* #Ifs= 1 Cfg#= 1 Atr=c0 MxPwr= 0mA I: If#= 0 Alt= 0 #EPs= 1 Cls=09(hub ) Sub=00 Prot=00 Driver=hub E: Ad=81(I) Atr=03(Int.) MxPS= 2 Ivl=255ms T: Bus=03 Lev=00 Prnt=00 Port=00 Cnt=00 Dev#= 1 Spd=12 MxCh= 2 B: Alloc= 17/900 us ( 2%), #Int= 1, #Iso= 0 D: Ver= 1.10 Cls=09(hub ) Sub=00 Prot=00 MxPS=64 #Cfgs= 1 P: Vendor=0000 ProdID=0000 Rev= 2.06 S: Manufacturer=Linux 2.6.16 uhci_hcd S: Product=UHCI Host Controller S: SerialNumber=0000:00:1d.1 C:* #Ifs= 1 Cfg#= 1 Atr=c0 MxPwr= 0mA I: If#= 0 Alt= 0 #EPs= 1 Cls=09(hub ) Sub=00 Prot=00 Driver=hub E: Ad=81(I) Atr=03(Int.) MxPS= 2 Ivl=255ms T: Bus=03 Lev=01 Prnt=01 Port=00 Cnt=01 Dev#= 2 Spd=12 MxCh= 0 D: Ver= 1.10 Cls=02(comm.) Sub=00 Prot=00 MxPS=32 #Cfgs= 1 P: Vendor=07b2 ProdID=5100 Rev= 1.01 S: Manufacturer=Broadcom Corporation S: Product=USB Cable Modem S: SerialNumber=001180EB659A C:* #Ifs= 2 Cfg#= 1 Atr=e0 MxPwr=100mA I: If#= 0 Alt= 0 #EPs= 1 Cls=02(comm.) Sub=06 Prot=00 Driver=cdc_ether E: Ad=85(I) Atr=03(Int.) MxPS= 8 Ivl=64ms I: If#= 1 Alt= 0 #EPs= 0 Cls=0a(data ) Sub=00 Prot=00 Driver=cdc_ether I: If#= 1 Alt= 1 #EPs= 2 Cls=0a(data ) Sub=00 Prot=00 Driver=cdc_ether E: Ad=81(I) Atr=02(Bulk) MxPS= 64 Ivl=0ms E: Ad=02(O) Atr=02(Bulk) MxPS= 64 Ivl=0ms T: Bus=03 Lev=01 Prnt=01 Port=01 Cnt=02 Dev#= 3 Spd=1.5 MxCh= 0 D: Ver= 1.10 Cls=00(>ifc ) Sub=00 Prot=00 MxPS= 8 #Cfgs= 1 P: Vendor=06da ProdID=0003 Rev= 0.00 S: Manufacturer=OMRON S: Product=87XXUPS C:* #Ifs= 1 Cfg#= 1 Atr=a0 MxPwr=100mA I: If#= 0 Alt= 0 #EPs= 1 Cls=03(HID ) Sub=00 Prot=00 Driver=usbhid E: Ad=81(I) Atr=03(Int.) MxPS= 8 Ivl=20ms T: Bus=02 Lev=00 Prnt=00 Port=00 Cnt=00 Dev#= 1 Spd=12 MxCh= 2 B: Alloc= 0/900 us ( 0%), #Int= 0, #Iso= 0 D: Ver= 1.10 Cls=09(hub ) Sub=00 Prot=00 MxPS=64 #Cfgs= 1 P: Vendor=0000 ProdID=0000 Rev= 2.06 S: Manufacturer=Linux 2.6.16 uhci_hcd S: Product=UHCI Host Controller S: SerialNumber=0000:00:1d.0 C:* #Ifs= 1 Cfg#= 1 Atr=c0 MxPwr= 0mA I: If#= 0 Alt= 0 #EPs= 1 Cls=09(hub ) Sub=00 Prot=00 Driver=hub E: Ad=81(I) Atr=03(Int.) MxPS= 2 Ivl=255ms T: Bus=01 Lev=00 Prnt=00 Port=00 Cnt=00 Dev#= 1 Spd=480 MxCh= 8 B: Alloc= 0/800 us ( 0%), #Int= 0, #Iso= 0 D: Ver= 2.00 Cls=09(hub ) Sub=00 Prot=01 MxPS=64 #Cfgs= 1 P: Vendor=0000 ProdID=0000 Rev= 2.06 S: Manufacturer=Linux 2.6.16 ehci_hcd S: Product=EHCI Host Controller S: SerialNumber=0000:00:1d.7 C:* #Ifs= 1 Cfg#= 1 Atr=e0 MxPwr= 0mA I: If#= 0 Alt= 0 #EPs= 1 Cls=09(hub ) Sub=00 Prot=00 Driver=hub E: Ad=81(I) Atr=03(Int.) MxPS= 2 Ivl=256ms Thanks for all help! I will now work on patches... I will reply soon
I patched 2.6.27 sources with commit dccf4a48d47120a42382ba526f1a0848c13ba2a4 and it was... clean! It should be buggy, right? Then i patched again the same sources with patch "Bugfix for uhci-hcd in 2.6.17.8" and it still clean.. Should I bisect again? I still have shore that this bug IS present in 2.6.17.9 and 2.6.18-rc4, also, I am shore, that it happears from 2.6.16.27 to 2.6.17.
This is why I don't trust git-bisect. Just as an experiment, you could take all the drivers/usb/host/uhci* files from the 2.6.17.9 source tree and copy them into the corresponding 2.6.16.23 directory. I think it should build okay. Then test the resulting 2.6.16.23 kernel with the transplanted uhci-hcd driver. If it works, you'll know the bug is in some other driver.
@Alan: I followed your sugestion but it doesn't work: CC drivers/usb/host/uhci-hcd.o drivers/usb/host/uhci-hcd.c:53:24: error: pci-quirks.h: No such file or directory drivers/usb/host/uhci-hcd.c: In function
Whoops... Looks like you also need to copy drivers/usb/host/pci-quirks.h from 2.6.17.9 into the 2.6.16.23 directory. I forgot about that one.
@Alan: I did what you said and... yes.. the bug is where! =) Now.. what can I do? Will I have this bug corrected?
Okay, good. You have definitely proved there is a bug in the 2.6.17.9 uhci-hcd driver. Before it can be fixed, though, we need to find out exactly where it is. The next step is to use the usbmon facility in 2.6.17.9. Instructions are in the kernel source file Documentation/usb/usbmon.txt. You will need to turn on CONFIG_USB_MON and CONFIG_DEBUG_FS, and you should also set CONFIG_USB_DEBUG. Capture the usbmon data during a test run, and make sure the bug occurs. Then attach it here, along the dmesg log for the same period.
Created attachment 8859 [details] dmesg for bug 6747 from 2.6.17.9 kernel
Created attachment 8860 [details] from /sys/kernel/debug/usbmon/3t from kernel 2.6.17.9
When start using usbmon: eth1 Link encap:Ethernet HWaddr 00:11:80:EB:65:9A inet addr:217.129.133.137 Bcast:217.129.143.255 Mask:255.255.240.0 UP BROADCAST NOTRAILERS RUNNING MULTICAST MTU:1500 Metric:1 RX packets:564 errors:3 dropped:0 overruns:0 frame:3 TX packets:336 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:233158 (227.6 Kb) TX bytes:37243 (36.3 Kb) At the end: eth1 Link encap:Ethernet HWaddr 00:11:80:EB:65:9A inet addr:217.129.133.137 Bcast:217.129.143.255 Mask:255.255.240.0 UP BROADCAST NOTRAILERS RUNNING MULTICAST MTU:1500 Metric:1 RX packets:1908 errors:12 dropped:0 overruns:0 frame:12 TX packets:1565 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:1588133 (1.5 Mb) TX bytes:166834 (162.9 Kb) #cat /proc/bus/usb/devices T: Bus=03 Lev=01 Prnt=01 Port=00 Cnt=01 Dev#= 2 Spd=12 MxCh= 0 D: Ver= 1.10 Cls=02(comm.) Sub=00 Prot=00 MxPS=32 #Cfgs= 1 P: Vendor=07b2 ProdID=5100 Rev= 1.01 S: Manufacturer=Broadcom Corporation S: Product=USB Cable Modem S: SerialNumber=001180EB659A C:* #Ifs= 2 Cfg#= 1 Atr=e0 MxPwr=100mA I: If#= 0 Alt= 0 #EPs= 1 Cls=02(comm.) Sub=06 Prot=00 Driver=cdc_ether E: Ad=85(I) Atr=03(Int.) MxPS= 8 Ivl=64ms I: If#= 1 Alt= 0 #EPs= 0 Cls=0a(data ) Sub=00 Prot=00 Driver=cdc_ether I: If#= 1 Alt= 1 #EPs= 2 Cls=0a(data ) Sub=00 Prot=00 Driver=cdc_ether E: Ad=81(I) Atr=02(Bulk) MxPS= 64 Ivl=0ms E: Ad=02(O) Atr=02(Bulk) MxPS= 64 Ivl=0ms If I can do anything more just ask!
I don't see any indication of problems in the usbmon log, and none in the dmesg log except possibly the line saying "eth1: rxqlen 0 --> 4". Are you sure that the problem occurred and the interface froze while you were collecting the logs? If it did, try again and after the freeze occurs do this: echo 2 >/sys/module/uhci_hcd/parameters/debug and then attach a copy of /sys/kernel/debug/uhci/0000:00:1d.1. By the way, only the last hundred lines or so of the usbmon log are important. You can chop off the rest.
Hi.. I performed what you said with 2.6.17.10 kernel. But with this kernel eth1 doesn't freeze. But, still making errors (saw in infconfig) usbmon here: ftp://ftp.ua.pt/incoming/luminoso/2.mon.out.tar.bz2 do you need more logging?
You're talking about two different problems: the framing errors and the interface freezing. Freezing is almost certainly caused by a bug in uhci-hcd, and that's what I want to fix. Now go back over your earlier tests. With which versions of the driver does the interface freeze? Did it freeze when you transplanted the 2.6.17 driver back to 2.6.16? Does it freeze under 2.6.18-rc4? The framing errors you see in ifconfig are a completely different matter. They have nothing to do with uhci-hcd. If you didn't see any of these errors in 2.6.16, it may be a result of changes in the USB network driver. (Unless you can prove that the framing errors never occur in vanilla 2.6.16 but they do occur when you transplant the 2.6.17 version of uhci-hcd to 2.6.16. If that happens then I am wrong.) Adrian Bunk or someone else can tell you more about the framing errors than I can. For the time being, let's just work on the freezing issue.
Point of situation: -On 2.6.17.9 eth1 still freeze, but not so frequently. -Also, when I copy uhci drivers from 2.6.17.* to 2.6.16.27 ifconfig starts to make errors, but i didn't test if eth1 died. -Ifconfig errors was always present with freezing issuse. It's a quite frustrating.. it's really random.. When I started this bug it happens in 5 minutes after computer start.. now is taking more than one day long. (it freezes yesterday.. but i decided:let's use a clean 2.6.17.9! It's was giving framing erros but isn't freezing.. so why not use 2.6.17.{9,10}? I deleted all sources and unpacked clean tarball again.. when i was not debugging it happend.. now I am again debugging and waiting with 2.6.17.10) I am now again waiting.. I hope report a new usefull usbmon debug soon.
Don't forget to report the results from the test in comment #31 also!
Created attachment 8881 [details] Try to detect changes to rx_frame_errors Here's a patch for 2.6.17.10 which I hope will tell us exactly when the RX frame-error count changes. If it works, it will add some very disruptive messages to the system log every time the value is increased. The patch will create a new configuration option, CONFIG_KWATCH. You will have to turn that on and then rebuild the kernel. Unfortunately I can't test the patch because I don't have a USB ethernet interface.
Created attachment 8886 [details] logs generated from patch Hi.. I applied the patch sucefully on 2.6.17.10 sources but I am getting problems on torning on 'CONFIG_KWATCH=y' on .config. After editing it and performing "make clean && make" it simply disapear from .config. I think it isn't enabled. Anyway, I make a tarball of /var/log. Maybe logs are there and I just didn't see. This logs are clean. I deleted all of them before restart. Offtopic: Even when using clean tarballs my kernel is named 2.6.17.10-g9e88eb4d-dirty. How do i get rid of "-g9e88eb4d-dirty"?
The option depends on CONFIG_DEBUG_KERNEL - you have to also set this option to y.
Yes, I forgot to mention CONFIG_DEBUG_KERNEL because I _always_ keep that option turned on for my work! I don't know where the "-g9e88eb4d-dirty" comes from. Check the topmost Makefile to make sure it's not there. It might have something to do with the way you install the new kernel for booting. What do you get from "uname -a"?
Created attachment 8895 [details] 2nd log generated from patch Hi.. There it is new logs generated with patch, now with CONFIG_KWATCH=y working. I also included .config of current kernel. I checked if 'ifconfig' is showing errors and it look like this before making the tarball: eth1 Link encap:Ethernet HWaddr 00:11:80:EB:65:9A inet addr:217.129.133.137 Bcast:217.129.143.255 Mask:255.255.240.0 UP BROADCAST NOTRAILERS RUNNING MULTICAST MTU:1500 Metric:1 RX packets:218676 errors:1225 dropped:0 overruns:0 frame:1225 TX packets:242498 errors:0 dropped:0 overruns:0 carrier:0 collisions:0 txqueuelen:1000 RX bytes:78791751 (75.1 Mb) TX bytes:89479444 (85.3 Mb) About the "dirty" it started to appear since Comment #18. From that patch on all new clean tarballs have "dirty" on kernel name. uname -a Linux khona4 2.6.17.10-g9e88eb4d-dirty #6 SMP PREEMPT Tue Aug 29 13:00:40 WEST 2006 i686 Intel(R) Pentium(R) 4 CPU 2.80GHz GenuineIntel GNU/Linux
Created attachment 8896 [details] usbnet rx-frame-errors test patch I can't tell what's going on. If that patch was working correctly, we should have seen any change to the rx frame error counter. Try this patch instead of 8881. Attaching kern.log alone will be sufficient. The "dirty" thing might be a leftover from your attempt at using git-bisect. maybe something didn't get cleaned up all the way.
Created attachment 8899 [details] kernel.log output Here it is.. I think now it worked. =)
It did work. At least, the error counter remained at 0. But I didn't change anything! All the patch does is print out the value of the counter whenever a packet is sent or received. I don't know. Try running with and without the patch (all you have to do is keep two separate copies of usbnet.ko, one built with the patch and one without, and insmod one or the other to switch between them). See if you can get the error counter to change from 0.
Created attachment 8908 [details] Log usbnet errors I just got a clue, and now I understand a lot better what's going on. Forget about those previous patches and use this one instead. It will put a message in the kernel log every time one of those network errors occurs. This way we'll be able to see exactly what sort of error it is.
Created attachment 8910 [details] output of patch8908 Huuuuugeeeee thanks! I spend all day patching and restarting, trying to find anything new but nothing. Where it is the results (thanks) of the 8908patch. I will keep it running just for in case you need more.
Created attachment 8912 [details] Print rx header You don't have to keep running that patch; what you posted is good enough. It's now clear where the errors are coming from. The driver is receiving packets that are shorter than 14 bytes (the minimum for Ethernet). I have no idea why this happens only under 2.6.17 and not under 2.6.16. Here's a different patch to try. This one will log the headers in first 8 packets received and then after that the first 8 error packets. I hope that seeing the header contents will give a clue as to what's wrong.
Created attachment 8913 [details] results of patch8912 here it is... it slowed down after 45mins +/-
Created attachment 8920 [details] Bugfix for uhci-hcd: skip to end of TD list during toggle fixup I think we found it. This patch fixes a definite bug in uhci-hcd, and it's almost certainly the bug causing your problems. Try it and see, but keep the previous patch installed as well just in case.
Created attachment 8921 [details] dmesg output IT WORKS!!!!!!!!!!! :) dmegs still prints attached messages, but only at startup. Maybe it's revelant: yesterday, when testing patch8912 even it stoped, like today, showing new information on kernel.log errors still keep raising. But now no errors on ifconfig. Works perfect! :)
Patch 8912 was designed to stop after printing 16 lines (or after 8 lines if there are no errors). I didn't need any more debugging info than that, and so of course you saw the errors continuing to increase even after the patch had stopped printing. You can drop 8912 now. It isn't needed, because 8920 fixes the problem. I'll submit 8920 for inclusion in the 2.6.17.x stable series and also for 2.6.18. If you want, you can apply 8920 to 2.6.18-rc4 (or -rc5) right now. It should apply to that kernel with no changes and it should fix exactly the same problem. Since everything is now working, I'll close this bug report.
Ok, thanks for all!!! (should i also report mouse-jumping-bug that i found (i'm a sad sad man lol) on 2.6.18-rc4?)
If it's still present in 2.6.18-rc5, go ahead and report it. But start a separate Bugzilla entry.
Here it is: bug 7008 Once again, thanks for everything.