Bug 11721 - after upgrade to 2.6.27 i cannot navigate
Summary: after upgrade to 2.6.27 i cannot navigate
Status: CLOSED CODE_FIX
Alias: None
Product: Networking
Classification: Unclassified
Component: Other (show other bugs)
Hardware: All Linux
: P1 blocking
Assignee: networking_netfilter-iptables@kernel-bugs.osdl.org
URL:
Keywords:
Depends on:
Blocks: Regressions-2.6.26
  Show dependency tree
 
Reported: 2008-10-08 08:08 UTC by Aldo Maggi
Modified: 2008-11-08 22:04 UTC (History)
3 users (show)

See Also:
Kernel Version: 2.6.27-rc8
Subsystem:
Regression: Yes
Bisected commit-id:


Attachments
output of the commands you asked for except that of tcpdump (3.07 KB, application/octet-stream)
2008-10-08 10:19 UTC, Aldo Maggi
Details
output of tcpdump (68.74 KB, application/octet-stream)
2008-10-08 10:21 UTC, Aldo Maggi
Details
output of tcpdump while apt-get was in execution (19.78 KB, application/octet-stream)
2008-10-08 10:22 UTC, Aldo Maggi
Details
config-2.6.27-rc8.200810071002 (89.65 KB, application/octet-stream)
2008-10-08 13:22 UTC, Aldo Maggi
Details
config-2.6.26-1-686 (89.57 KB, application/octet-stream)
2008-10-08 23:34 UTC, Aldo Maggi
Details
tcpdump -i eth0 -v using kernel 2.6.26.6 (24.53 KB, application/octet-stream)
2008-10-17 10:08 UTC, Aldo Maggi
Details
tcpdump -i eth2 -v on the home server/gateway (7.96 KB, application/octet-stream)
2008-10-17 10:09 UTC, Aldo Maggi
Details
tcpdump -i eth0 -v using kernel 2.6.27-rc1-git1 (7.84 KB, application/octet-stream)
2008-10-17 10:10 UTC, Aldo Maggi
Details
tcpdump -i eth2 -v on the home server/gateway (25.13 KB, application/octet-stream)
2008-10-17 10:13 UTC, Aldo Maggi
Details
ifconfig, route -n, and iptables rules on 192.168.1.1 (topolino) home server/gw (13.18 KB, application/octet-stream)
2008-10-18 07:42 UTC, Aldo Maggi
Details
ifconfig, route -n, on 192.168.1.3 (paperino) (1.84 KB, application/octet-stream)
2008-10-18 07:43 UTC, Aldo Maggi
Details
tcpdump on the wan eth on the home server/gw (29.73 KB, application/octet-stream)
2008-10-18 07:45 UTC, Aldo Maggi
Details
tcpdump on the wan eth on the home server/gw (11.71 KB, application/octet-stream)
2008-10-18 07:47 UTC, Aldo Maggi
Details
tcpdump -i eth0 -nXX -c3 'dst port 80 and tcp-syn != 0' on topolino (1.19 KB, application/octet-stream)
2008-10-18 13:23 UTC, Aldo Maggi
Details
tcpdump -i eth0 -nXX -c3 'dst port 80 and tcp-syn != 0' on topolino (1.19 KB, application/octet-stream)
2008-10-18 13:25 UTC, Aldo Maggi
Details
tcpdump -i eth0 -nXX -c3 'dst port 80 and tcp-syn != 0' on topolino (1.30 KB, application/octet-stream)
2008-10-18 13:27 UTC, Aldo Maggi
Details
kernel 2.6.26.6 is running on paperino and tcp_window_scaling, tcp_timestamps, tcp_sack are set to "1" (1.42 KB, application/octet-stream)
2008-10-20 12:37 UTC, Aldo Maggi
Details
kernel 2.6.26.6 is running on paperino and tcp_window_scaling, tcp_timestamps, tcp_sack are set to "0" (1.19 KB, application/octet-stream)
2008-10-20 12:38 UTC, Aldo Maggi
Details
kernel 2.6.27-rc1-git1 is running on paperino and tcp_window_scaling, tcp_timestamps, tcp_sack are set to "1" (1.41 KB, application/octet-stream)
2008-10-20 12:40 UTC, Aldo Maggi
Details
kernel 2.6.27-rc1-git1 is running on paperino and tcp_window_scaling, tcp_timestamps, tcp_sack are set to "0" (1.19 KB, application/octet-stream)
2008-10-20 12:40 UTC, Aldo Maggi
Details
Output of tcpdump scenarios; different kernels and tcp /proc settings (6.21 KB, text/plain)
2008-11-08 22:04 UTC, Dean Holland
Details

Description Aldo Maggi 2008-10-08 08:08:09 UTC
Latest working kernel version: 2.6.26
Earliest failing kernel version: 2.6.27-rc8
Distribution: debian, ubuntu
Hardware Environment:

less /proc/cpuinfo
processor       : 0
vendor_id       : AuthenticAMD
cpu family      : 6
model           : 6
model name      : AMD Athlon(tm) XP 1700+
stepping        : 2
cpu MHz         : 1460.519
cache size      : 256 KB
fdiv_bug        : no
hlt_bug         : no
f00f_bug        : no
coma_bug        : no
fpu             : yes
fpu_exception   : yes
cpuid level     : 1
wp              : yes
flags           : fpu vme de pse tsc msr pae mce cx8 sep mtrr pge mca cmov pat pse36 mmx fxsr sse syscall mp mmxext 3dnowext 3dnow up
bogomips        : 2922.96
clflush size    : 32
power management: ts

/ver_linux

Linux paperino 2.6.27-rc8.200810071002 #1 SMP Tue Oct 7 10:17:39 CEST 2008 i686 GNU/Linux
 
Gnu C                  4.3.2
Gnu make               3.81
binutils               2.18.0.20080103
util-linux             2.13.1.1
mount                  2.13.1.1
module-init-tools      3.4
e2fsprogs              1.41.0
reiserfsprogs          3.6.19
Linux C Library        2.7
Dynamic linker (ldd)   2.7
Procps                 3.2.7
Net-tools              1.60
Kbd                    [opzione...]
Console-tools          0.2.3
Sh-utils               6.10
udev                   125
wireless-tools         29
Modules Loaded         rfcomm l2cap bluetooth ppdev lp video output ac battery cpufreq_powersave cpufreq_stats cpufreq_userspa
ce cpufreq_ondemand cpufreq_conservative freq_table ipv6 ext3 jbd mbcache fuse loop snd_ens1371 gameport snd_seq_dummy snd_seq
_oss snd_seq_midi snd_seq_midi_event snd_seq snd_rawmidi snd_seq_device arc4 ecb crypto_blkcipher snd_ac97_codec ac97_bus snd_
pcm_oss snd_mixer_oss snd_pcm psmouse serio_raw snd_timer snd pcspkr rtl8180 mac80211 soundcore snd_page_alloc eeprom_93cx6 cf
g80211 parport_pc parport button i2c_sis96x i2c_sis630 shpchp i2c_core pci_hotplug sis_agp agpgart evdev reiserfs sg sr_mod cd
rom sd_mod ide_pci_generic sis5513 ide_core pata_sis floppy ata_generic libata ehci_hcd uhci_hcd sis900 3c59x mii scsi_mod doc
k ohci_hcd usbcore thermal processor fan thermal_sys

cat /proc/ioports

0000-001f : dma1
0020-0021 : pic1
0040-0043 : timer0
0050-0053 : timer1
0060-0060 : keyboard
0064-0064 : keyboard
0070-0071 : rtc0
0080-008f : dma page reg
00a0-00a1 : pic2
00c0-00df : dma2
00f0-00ff : fpu
0170-0177 : 0000:00:02.5
  0170-0177 : pata_sis
01f0-01f7 : 0000:00:02.5
  01f0-01f7 : pata_sis
02f8-02ff : serial
0376-0376 : 0000:00:02.5
  0376-0376 : pata_sis
0378-037a : parport0
03c0-03df : vga+
03f2-03f5 : floppy
03f6-03f6 : 0000:00:02.5
  03f6-03f6 : pata_sis
03f7-03f7 : floppy DIR
03f8-03ff : serial
0800-0803 : ACPI PM1a_EVT_BLK
0804-0805 : ACPI PM1a_CNT_BLK
0808-080b : ACPI PM_TMR
0820-0823 : ACPI GPE0_BLK
0830-0833 : ACPI GPE1_BLK
0c00-0c1f : 0000:00:02.1
  0c00-0c1f : sis96x_smbus
0cf8-0cff : PCI conf1
c800-c81f : 0000:00:13.0
  c800-c81f : uhci_hcd
cc00-cc1f : 0000:00:13.1
  cc00-cc1f : uhci_hcd
d000-d03f : 0000:00:11.0
  d000-d03f : Ensoniq AudioPCI
d400-d47f : 0000:00:0d.0
d800-d8ff : 0000:00:0b.0
  d800-d8ff : rtl8180
dc00-dcff : 0000:00:03.0
  dc00-dcff : sis900
ff00-ff0f : 0000:00:02.5
  ff00-ff0f : pata_sis

cat /proc/iomem

00000000-0009fbff : System RAM
0009fc00-0009ffff : reserved
000a0000-000bffff : Video RAM area
000c0000-000cf3ff : Video ROM
000cf800-000d77ff : Adapter ROM
000f0000-000fffff : reserved
  000f0000-000fffff : System ROM
00100000-2ffeffff : System RAM
  00100000-002cb2d7 : Kernel code
  002cb2d8-00387307 : Kernel data
  003d3000-00436b47 : Kernel bss
  01604000-01604fff : Local APIC
2fff0000-2fff7fff : ACPI Tables
2fff8000-2fffffff : ACPI Non-volatile Storage
bdc00000-cdcfffff : PCI Bus 0000:01
  c0000000-c7ffffff : 0000:01:00.0
cde00000-cfefffff : PCI Bus 0000:01
  ce000000-ceffffff : 0000:01:00.0
  cfee0000-cfefffff : 0000:01:00.0
cffa0000-cffbffff : 0000:00:0d.0
cffc0000-cffdffff : 0000:00:03.0
cfffca00-cfffcaff : 0000:00:13.2
  cfffca00-cfffcaff : ehci_hcd
cfffcb80-cfffcbff : 0000:00:0d.0
cfffcc00-cfffcfff : 0000:00:0b.0
  cfffcc00-cfffcfff : rtl8180
cfffd000-cfffdfff : 0000:00:03.0
  cfffd000-cfffdfff : sis900
cfffe000-cfffefff : 0000:00:02.2
  cfffe000-cfffefff : ohci_hcd
cffff000-cfffffff : 0000:00:02.3
  cffff000-cfffffff : ohci_hcd
d0000000-d3ffffff : 0000:00:00.0
fec00000-fec00fff : reserved
fee00000-fee00fff : reserved
ffee0000-ffefffff : reserved
fffc0000-ffffffff : reserved

lspci -vvv

00:00.0 Host bridge: Silicon Integrated Systems [SiS] 735 Host (rev 01)
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort+ >SERR- <PERR- INTx-
        Latency: 32
        Region 0: Memory at d0000000 (32-bit, non-prefetchable) [size=64M]
        Capabilities: [c0] AGP version 2.0
                Status: RQ=32 Iso- ArqSz=0 Cal=0 SBA+ ITACoh- GART64- HTrans- 64bit- FW- AGP3- Rate=x1,x2,x4
                Command: RQ=1 ArqSz=0 Cal=0 SBA- AGP- GART64- 64bit- FW- Rate=<none>
        Kernel driver in use: agpgart-sis
        Kernel modules: sis-agp

00:01.0 PCI bridge: Silicon Integrated Systems [SiS] Virtual PCI-to-PCI bridge (AGP)
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-/ver_linux

Linux paperino 2.6.27-rc8.200810071002 #1 SMP Tue Oct 7 10:17:39 CEST 2008 i686 GNU/Linux
 
Gnu C                  4.3.2
Gnu make               3.81
binutils               2.18.0.20080103
util-linux             2.13.1.1
mount                  2.13.1.1
module-init-tools      3.4
e2fsprogs              1.41.0
reiserfsprogs          3.6.19
Linux C Library        2.7
Dynamic linker (ldd)   2.7
Procps                 3.2.7
Net-tools              1.60
Kbd                    [opzione...]
Console-tools          0.2.3
Sh-utils               6.10
udev                   125
wireless-tools         29
Modules Loaded         rfcomm l2cap bluetooth ppdev lp video output ac battery cpufreq_powersave cpufreq_stats cpufreq_userspa
ce cpufreq_ondemand cpufreq_conservative freq_table ipv6 ext3 jbd mbcache fuse loop snd_ens1371 gameport snd_seq_dummy snd_seq
_oss snd_seq_midi snd_seq_midi_event snd_seq snd_rawmidi snd_seq_device arc4 ecb crypto_blkcipher snd_ac97_codec ac97_bus snd_
pcm_oss snd_mixer_oss snd_pcm psmouse serio_raw snd_timer snd pcspkr rtl8180 mac80211 soundcore snd_page_alloc eeprom_93cx6 cf
g80211 parport_pc parport button i2c_sis96x i2c_sis630 shpchp i2c_core pci_hotplug sis_agp agpgart evdev reiserfs sg sr_mod cd
rom sd_mod ide_pci_generic sis5513 ide_core pata_sis floppy ata_generic libata ehci_hcd uhci_hcd sis900 3c59x mii scsi_mod doc
k ohci_hcd usbcore thermal processor fan thermal_sys

        Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 64
        Bus: primary=00, secondary=01, subordinate=01, sec-latency=64
        I/O behind bridge: 0000f000-00000fff
        Memory behind bridge: cde00000-cfefffff
        Prefetchable memory behind bridge: bdc00000-cdcfffff
        Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort+ <SERR- <PERR-
        BridgeCtl: Parity- SERR+ NoISA- VGA+ MAbort- >Reset- FastB2B-
                PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn-
        Kernel modules: shpchp

00:02.0 ISA bridge: Silicon Integrated Systems [SiS] SiS85C503/5513 (LPC Bridge)
        Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 0
        Kernel modules: i2c-sis630

00:02.1 SMBus: Silicon Integrated Systems [SiS] SiS961/2 SMBus Controller
        Control: I/O+ Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Interrupt: pin B routed to IRQ 0
        Region 4: I/O ports at 0c00 [size=32]
        Kernel driver in use: sis96x_smbus
        Kernel modules: i2c-sis96x

00:02.2 USB Controller: Silicon Integrated Systems [SiS] USB 1.1 Controller (rev 07) (prog-if 10)
        Subsystem: Silicon Integrated Systems [SiS] USB 1.1 Controller
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
        Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 64 (20000ns max), Cache Line Size: 32 bytes
        Interrupt: pin D routed to IRQ 5
        Region 0: Memory at cfffe000 (32-bit, non-prefetchable) [size=4K]
        Kernel driver in use: ohci_hcd
        Kernel modules: ohci-hcd

00:02.3 USB Controller: Silicon Integrated Systems [SiS] USB 1.1 Controller (rev 07) (prog-if 10)
        Subsystem: Silicon Integrated Systems [SiS] USB 1.1 Controller
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
        Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 64 (20000ns max), Cache Line Size: 32 bytes
        Interrupt: pin A routed to IRQ 10
        Region 0: Memory at cffff000 (32-bit, non-prefetchable) [size=4K]
        Kernel driver in use: ohci_hcd
        Kernel modules: ohci-hcd

00:02.5 IDE interface: Silicon Integrated Systems [SiS] 5513 [IDE] (rev d0) (prog-if 80 [Master])
        Subsystem: Silicon Integrated Systems [SiS] 5513 [IDE]
        Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap- 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 128
        Region 0: [virtual] Memory at 000001f0 (32-bit, non-prefetchable) [disabled] [size=8]
        Region 1: [virtual] Memory at 000003f0 (type 3, non-prefetchable) [disabled] [size=1]
        Region 2: [virtual] Memory at 00000170 (32-bit, non-prefetchable) [disabled] [size=8]
        Region 3: [virtual] Memory at 00000370 (type 3, non-prefetchable) [disabled] [size=1]
        Region 4: I/O ports at ff00 [size=16]
        Kernel driver in use: pata_sis
        Kernel modules: pata_sis, sis5513

00:03.0 Ethernet controller: Silicon Integrated Systems [SiS] SiS900 PCI Fast Ethernet (rev 90)
        Subsystem: Silicon Integrated Systems [SiS] SiS900 PCI Fast Ethernet
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 64 (13000ns min, 2750ns max)
        Interrupt: pin A routed to IRQ 10
        Region 0: I/O ports at dc00 [size=256]

       Region 1: Memory at cfffd000 (32-bit, non-prefetchable) [size=4K]
        Expansion ROM at cffc0000 [disabled] [size=128K]
        Capabilities: [40] Power Management version 2
                Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0+,D1+,D2+,D3hot+,D3cold+)
                Status: D0 PME-Enable- DSel=0 DScale=0 PME-
        Kernel driver in use: sis900
        Kernel modules: sis900

00:0b.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8185 IEEE 802.11a/b/g Wireless LAN Controller (rev 20)
        Subsystem: Realtek Semiconductor Co., Ltd. RTL-8185 IEEE 802.11a/b/g Wireless LAN Controller
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 64 (8000ns min, 16000ns max), Cache Line Size: 64 bytes
        Interrupt: pin A routed to IRQ 11
        Region 0: I/O ports at d800 [size=256]
        Region 1: Memory at cfffcc00 (32-bit, non-prefetchable) [size=1K]
        Capabilities: [50] Power Management version 2
                Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=375mA PME(D0-,D1+,D2+,D3hot+,D3cold+)
                Status: D0 PME-Enable- DSel=0 DScale=0 PME-
        Kernel driver in use: rtl8180
        Kernel modules: rtl8180

00:0d.0 Ethernet controller: 3Com Corporation 3c905B 100BaseTX [Cyclone] (rev 30)
        Subsystem: 3Com Corporation 3c905B 100BaseTX [Cyclone]
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 64 (2500ns min, 2500ns max), Cache Line Size: 32 bytes
        Interrupt: pin A routed to IRQ 10
        Region 0: I/O ports at d400 [size=128]
        Region 1: Memory at cfffcb80 (32-bit, non-prefetchable) [size=128]
        Expansion ROM at cffa0000 [disabled] [size=128K]
        Capabilities: [dc] Power Management version 1
                Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA PME(D0-,D1+,D2+,D3hot+,D3cold-)
                Status: D0 PME-Enable- DSel=0 DScale=0 PME-
        Kernel driver in use: 3c59x
        Kernel modules: 3c59x

00:11.0 Multimedia audio controller: Ensoniq ES1371 [AudioPCI-97] (rev 06)
        Subsystem: Ensoniq ES1371 [AudioPCI-97]
        Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR+ FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=slow >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 64 (3000ns min, 32000ns max)
        Interrupt: pin A routed to IRQ 11
        Region 0: I/O ports at d000 [size=64]
        Capabilities: [dc] Power Management version 1
                Flags: PMEClk- DSI+ D1- D2+ AuxCurrent=0mA PME(D0+,D1-,D2+,D3hot+,D3cold-)
                Status: D0 PME-Enable- DSel=0 DScale=0 PME-
        Kernel driver in use: ENS1371
        Kernel modules: snd-ens1371

00:13.0 USB Controller: VIA Technologies, Inc. VT82xxxxx UHCI USB 1.1 Controller (rev 50)
        Subsystem: First International Computer, Inc. Device 1234
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 64, Cache Line Size: 32 bytes
        Interrupt: pin A routed to IRQ 11
        Region 4: I/O ports at c800 [size=32]
        Capabilities: [80] Power Management version 2
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=375mA PME(D0+,D1-,D2-,D3hot+,D3cold+)
                Status: D0 PME-Enable- DSel=0 DScale=0 PME-
        Kernel driver in use: uhci_hcd
        Kernel modules: uhci-hcd

00:13.1 USB Controller: VIA Technologies, Inc. VT82xxxxx UHCI USB 1.1 Controller (rev 50)
        Subsystem: First International Computer, Inc. Device 1234
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 64, Cache Line Size: 32 bytes
        Interrupt: pin B routed to IRQ 11
        Region 4: I/O ports at cc00 [size=32]
        Capabilities: [80] Power Management version 2
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=375mA PME(D0+,D1-,D2-,D3hot+,D3cold+)
                Status: D0 PME-Enable- DSel=0 DScale=0 PME-
        Kernel driver in use: uhci_hcd
        Kernel modules: uhci-hcd

00:13.2 USB Controller: VIA Technologies, Inc. USB 2.0 (rev 51) (prog-if 20)
        Subsystem: First International Computer, Inc. Device 1234
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 64, Cache Line Size: 64 bytes
        Interrupt: pin C routed to IRQ 10

        Region 0: Memory at cfffca00 (32-bit, non-prefetchable) [size=256]
        Capabilities: [80] Power Management version 2
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=375mA PME(D0+,D1-,D2-,D3hot+,D3cold+)
                Status: D0 PME-Enable- DSel=0 DScale=0 PME-
        Kernel driver in use: ehci_hcd
        Kernel modules: ehci-hcd

01:00.0 VGA compatible controller: nVidia Corporation NV34 [GeForce FX 5200] (rev a1)
        Subsystem: ASUSTeK Computer Inc. Device 80df
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx-
        Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx-
        Latency: 64 (1250ns min, 250ns max)
        Interrupt: pin A routed to IRQ 11
        Region 0: Memory at ce000000 (32-bit, non-prefetchable) [size=16M]
        Region 1: Memory at c0000000 (32-bit, prefetchable) [size=128M]
        Expansion ROM at cfee0000 [disabled] [size=128K]
        Capabilities: [60] Power Management version 2
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0-,D1-,D2-,D3hot-,D3cold-)
                Status: D0 PME-Enable- DSel=0 DScale=0 PME-
        Capabilities: [44] AGP version 3.0
                Status: RQ=32 Iso- ArqSz=0 Cal=0 SBA+ ITACoh- GART64- HTrans- 64bit- FW+ AGP3- Rate=x1,x2,x4
                Command: RQ=1 ArqSz=0 Cal=0 SBA- AGP- GART64- 64bit- FW- Rate=<none>
        Kernel modules: nvidiafb


cat /proc/scsi/scsi

Attached devices:
Host: scsi0 Channel: 00 Id: 00 Lun: 00
  Vendor: ATA      Model: SAMSUNG SV0842D  Rev: MB10
  Type:   Direct-Access                    ANSI  SCSI revision: 05
Host: scsi0 Channel: 00 Id: 01 Lun: 00
  Vendor: ATA      Model: Maxtor 6Y080L0   Rev: YAR4
  Type:   Direct-Access                    ANSI  SCSI revision: 05
Host: scsi1 Channel: 00 Id: 00 Lun: 00
  Vendor: HL-DT-ST Model: DVD-RAM GSA-H55N Rev: 1.02
  Type:   CD-ROM                           ANSI  SCSI revision: 05
Host: scsi1 Channel: 00 Id: 01 Lun: 00
  Vendor: HL-DT-ST Model: DVD-ROM GDR8163B Rev: 0L23
  Type:   CD-ROM                           ANSI  SCSI revision: 05

 
a couple of weeks ago i updated ubuntu intrepid and kernel 2.6.27 was installed.
once i rebooted i noticed i could "ping" and "host" but could not navigate internet via firefox, w3m, etc or update.
i've thought that an ubuntu bug was the reason of it, so i filed a bug with ubuntu, but no solution being found i've decided to download 2.6.26 source from kernel.org, patch it with 2.6.27-rc8 prepatch, run make oldconfig and compile my own kernel. in make oldconfig i tried both accepting the defaults which were proposed and saying "no" , in both cases after having conmpiled the kernel i still experienced the same problems as with ubuntu intrepid 2.6.27 kernel.
since i have debian lenny on another partition of the hd in the same machine i've tried there as well following the same procedure as in the case of ubuntu intrepid, but i've got the same results, i can ping and host but cannot navigate internet.
my home lan is made up of a modem, packets are sent to my home server which routes them by means of iptables to the other three home pc's, the system is wired, no wi-fi involved. 
i experienced the above problems in the two pc's where i tried the above attempts.   
if i launch iptraf in the home server i can see that udp requests to the dns servers from the machine where intrepid is installed receive answers (this is the reason why "host" and "ping" work, i think) while in the case of tcp connections the syn packet goes through but it does not receive ack.

needless to say that if i boot with 2.6.26 kernel everything works flawlessly.

thanks
aldo
Comment 1 Patrick McHardy 2008-10-08 08:13:19 UTC
There's no indication of netfilter being involved here, not a single netfilter module loaded. So this appears to be a generic networking or network driver bug (please reclassify to networking for now).

Some information that will most likely be needed:

- ip -s link list
- ip route
- ip addr
- tcpdump of the interface the traffic is going through

Thanks.
Comment 2 Aldo Maggi 2008-10-08 10:13:16 UTC
Il giorno Wed,  8 Oct 2008 08:13:19 -0700 (PDT)
bugme-daemon@bugzilla.kernel.org ha scritto:

> http://bugzilla.kernel.org/show_bug.cgi?id=11721
> 
> 
> 
> 
> 
> ------- Comment #1 from kaber@trash.net  2008-10-08 08:13 -------
> There's no indication of netfilter being involved here, not a single
> netfilter module loaded. So this appears to be a generic networking
> or network driver bug (please reclassify to networking for now).
> 
> Some information that will most likely be needed:
> 
> - ip -s link list
> - ip route
> - ip addr
> - tcpdump of the interface the traffic is going through
> 
> Thanks.
> 


please find herewith attached the following files:

bug11721_aggiornamenti containing:
- ip -s link list
- ip route
- ip addr

tcpdump_output containing the output of:
- tcpdump -i eth0 -v

tcpdump_output_during_apt-get containing the output of:
- tcpdump -i eth0 -v
while apt-get update was in execution

thanks
aldo


 
Comment 3 Aldo Maggi 2008-10-08 10:19:32 UTC
Created attachment 18213 [details]
output of the commands you asked for except that of tcpdump
Comment 4 Aldo Maggi 2008-10-08 10:21:25 UTC
Created attachment 18214 [details]
output of tcpdump
Comment 5 Aldo Maggi 2008-10-08 10:22:58 UTC
Created attachment 18215 [details]
output of tcpdump while apt-get was in execution
Comment 6 Patrick McHardy 2008-10-08 10:28:10 UTC
I can't spot anything not working in those dumps (I'm in a hurry though and might have missed it). Are those dumps from a failing session?

Thanks.
Comment 7 Aldo Maggi 2008-10-08 10:42:02 UTC
yes they are.
in fact ping google.it works host google.it works as well, w3m google.it doesn't , neither does apt-get update.

bear with me if i repeat what i wrote in my first msg: ... if i launch iptraf in the home server i can see that udp requests to the dns
servers from the machine where intrepid is installed receive answers (this is
the reason why "host" and "ping" work, i think) while in the case of tcp
connections the syn packet goes through but it does not receive ack.
needless to say that if i boot with 2.6.26 kernel everything works flawlessly.
Comment 8 Rafael J. Wysocki 2008-10-08 12:38:58 UTC
Please attach .config from 2.6.27-rc.
Comment 9 Aldo Maggi 2008-10-08 13:22:37 UTC
Created attachment 18222 [details]
config-2.6.27-rc8.200810071002
Comment 10 Rafael J. Wysocki 2008-10-08 14:55:29 UTC
And the one from 2.6.26 (ie. known good), please.
Comment 11 Aldo Maggi 2008-10-08 23:34:45 UTC
Created attachment 18232 [details]
config-2.6.26-1-686

could it be useful if i send you the tcpdump of the interface in the home server/gateway to which the pc we are testing is linked from which it is clear that if 2.6.26 is used in the latter ack are received if 2.6.27 is used no ack appears?
Comment 12 Rafael J. Wysocki 2008-10-09 08:43:47 UTC
Well, please test the latest mainline (2.6.27-rc9-git2).  Chances are that your problem has just been fixed.
Comment 13 Aldo Maggi 2008-10-09 17:59:52 UTC
no, i'm sorry, i've compiled the new kernel but results are still the same.
i wonder if the modules of 2.6.27 kernel used with my hw malform, somehow, the syn packets .... sorry i'm repeating what i said in my last message!
Comment 14 Rafael J. Wysocki 2008-10-10 05:09:29 UTC
Ok, the only thing you can do in that case it to try to identify the change that causes the problem to appear using bisection.

See http://desprofundis.blogspot.com/2008/06/git-bisect-instructions.html for instructions.
Comment 15 Jarek Poplawski 2008-10-17 00:16:12 UTC
Could you try to turn off window scaling like here?:
http://lkml.org/lkml/2008/2/7/471
Comment 16 Aldo Maggi 2008-10-17 00:59:47 UTC
ok, i did so:
echo 0 > /proc/sys/net/ipv4/tcp_window_scaling
both in my machine and in the pc i use as home_server/gateway
but the problem was not solved
anyway "1" was the value of proc/sys/net/ipv4/tcp_window_scaling before.
and, as far as i understand from the article you cite: http://lwn.net/Articles/92727/, the problem with window_scaling was already noticed in kernel 2.6.7 while in my case everything was alright till 2.6.26.6
Comment 17 Jarek Poplawski 2008-10-17 01:23:39 UTC
Sure, but tcp changes and this lkml thread was about later kernels
too. Btw, could you repeat this tcpdump with 2.6.26 and 2.6.27 in
working/not working case? Thanks.
Comment 18 Aldo Maggi 2008-10-17 10:07:02 UTC
i set echo 0 > /proc/sys/net/ipv4/tcp_window_scaling both on the pc and on the home server/gw
then i run w3m kernel.org  and
tcpdump both on the pc and on the home server/gw
i've repeated the above procedure twice once using kernel 2.6.26.6 and another using kernel 2.6.27-rc1-git1
see please the attached files
thanks
aldo
Comment 19 Aldo Maggi 2008-10-17 10:08:31 UTC
Created attachment 18350 [details]
tcpdump -i eth0 -v using kernel 2.6.26.6
Comment 20 Aldo Maggi 2008-10-17 10:09:26 UTC
Created attachment 18351 [details]
tcpdump -i eth2 -v  on the home server/gateway
Comment 21 Aldo Maggi 2008-10-17 10:10:17 UTC
Created attachment 18352 [details]
tcpdump -i eth0 -v using kernel 2.6.27-rc1-git1
Comment 22 Aldo Maggi 2008-10-17 10:13:31 UTC
Created attachment 18353 [details]
tcpdump -i eth2 -v  on the home server/gateway

while on the other pc kernel 2.6.26.6 was running
Comment 23 Jarek Poplawski 2008-10-18 03:57:08 UTC
Thanks for these logs Aldo. Anyway, I'm still a bit confused, so could
we clear some doubts?:
1. server/gw is 192.168.1.1, topolino, always 2.6.26 , and there is no
problem with www connections?
2. pc is 192.168.1.3, paperino,  with 2.6.26 no problem, with 2.6.27
problems? 
3. tcpdump on gw is probably done on the local network interface; if
so could you try this on the internet interface (so 2 tcpdumps yet)?

Btw., let us know ifconfig, route -n, and iptables rules on this box.
(If you need to mask some private data let us know too.) Could you set
mtu on paperino's eth0 to 1400 before these tries?

Correct me if I get this wrong. (I assume you're changing to 2.6.27
only one box (paperino?) during these tests.)
Comment 24 Aldo Maggi 2008-10-18 07:12:10 UTC
sorry for not having been clearer! :-(
1. yes, server/gw is 192.168.1.1, topolino, always 2.6.26 (actually 2.6.26-1-686 linux-image from debian) , and there is no problem with www connections.
2. pc 192.168.1.3, paperino, where i'm running the tests, has no problem with 2.6.26 (both debian lenny and ubuntu intrepid are installed on that machine) problems have arosen when ubuntu intrepid upgraded to 2.6.27 (but i do not rely very much on ubuntu so my tests are made on debian lenny), at the moment 2.6.26.6 (which i've compiled from kernel.org sources) is used with lenny , i repeat with no problem, problems arise when i use kernel 2.6.27 on it (whatever is the rc, i've tried -rc1, -rc8, -rc9)
3. see please the attached files (they are the output while on paperino w3m kernel.org was executed), yes, i'm changing to 2.6.27 only paperino (192.168.1.3).

just to let you know, this morning i've bought a new modem (a recent one) to see if the problems persist (anyway i'd need it ... sooner or later, it is an "adsl2+" :-D ).
many thanks for your attention!
aldo
Comment 25 Aldo Maggi 2008-10-18 07:42:07 UTC
Created attachment 18361 [details]
ifconfig, route -n, and iptables rules on 192.168.1.1 (topolino) home server/gw
Comment 26 Aldo Maggi 2008-10-18 07:43:31 UTC
Created attachment 18362 [details]
ifconfig, route -n, on 192.168.1.3 (paperino)
Comment 27 Aldo Maggi 2008-10-18 07:45:57 UTC
Created attachment 18363 [details]
tcpdump on the wan eth on the home server/gw

while on 192.168.1.3 (paperino) was running 2.6.26.6 and w3m kernel.org was executed
Comment 28 Aldo Maggi 2008-10-18 07:47:29 UTC
Created attachment 18364 [details]
tcpdump on the wan eth on the home server/gw

while on 192.168.1.3 (paperino) 2.6.27-rc1-git1 was running and w3m kernel was executed
Comment 29 Jarek Poplawski 2008-10-18 11:08:55 UTC
Alas still no idea. Looks like these syn packets to kernel.org could
be different, so more details are needed. If you are not fed up with
this all try to repeat these last 2 tcpdumps with these parameters:

tcpdump -i eth0 -nXX -c3 'dst port 80 and tcp-syn != 0'

Additionally you could try this on paperino:
echo 0 > /proc/sys/net/ipv4/tcp_window_scaling
echo 0 > /proc/sys/net/ipv4/tcp_timestamps
echo 0 > /proc/sys/net/ipv4/tcp_sack

many thanks for your assistance!
Jarek P.
Comment 30 Aldo Maggi 2008-10-18 13:20:56 UTC
absolutely i'm not fed up! and actually i thank you for your patience.

i set paperino as you wrote and added ifconfig eth0 mtu 1400 as well
i set the same value of mtu in topolino as well, i hope i was not wrong doing so!

i attach the files you ask for, but now i can navigate with 2.6.27-rc1-git1 as well!

afterwards i've run some more tests and have found out that the culprit is tcp_sack, if using kernel 2.6.27-rc1-gt1 it is set to "1" i cannot navigate if to "0" i can.
so i enclose, should it be of any help, also the tcpdump on eth0 of topolino when on paperino tcp_sack is set to "1".

thanks
aldo
Comment 31 Aldo Maggi 2008-10-18 13:23:53 UTC
Created attachment 18366 [details]
tcpdump -i eth0 -nXX -c3 'dst port 80 and tcp-syn != 0'  on topolino

while using  kernel 2.6.26.6 on paperino
Comment 32 Aldo Maggi 2008-10-18 13:25:43 UTC
Created attachment 18367 [details]
tcpdump -i eth0 -nXX -c3 'dst port 80 and tcp-syn != 0'  on topolino

while using kernel 2.6.27-rc1-git1 on paperino
Comment 33 Aldo Maggi 2008-10-18 13:27:35 UTC
Created attachment 18368 [details]
tcpdump -i eth0 -nXX -c3 'dst port 80 and tcp-syn != 0'  on topolino

while using kernel 2.6.27-rc1-git un paperino and on the same machine /proc/sys/net/ipv4/tcp_sack is set to "1"
Comment 34 Jarek Poplawski 2008-10-18 14:03:00 UTC
Nice job Aldo!

I forward this message to netdev and Cc our best tcp expert.
Any new replies should be rather "Reply All" (not bugzilla only).

Thanks,
Jarek P.

On Sat, Oct 18, 2008 at 01:20:56PM -0700, bugme-daemon@bugzilla.kernel.org wrote:
> http://bugzilla.kernel.org/show_bug.cgi?id=11721
> 
...
> ------- Comment #30 from sentiniate@tiscali.it  2008-10-18 13:20 -------
> absolutely i'm not fed up! and actually i thank you for your patience.
> 
> i set paperino as you wrote and added ifconfig eth0 mtu 1400 as well
> i set the same value of mtu in topolino as well, i hope i was not wrong doing
> so!
> 
> i attach the files you ask for, but now i can navigate with 2.6.27-rc1-git1
> as
> well!
> 
> afterwards i've run some more tests and have found out that the culprit is
> tcp_sack, if using kernel 2.6.27-rc1-gt1 it is set to "1" i cannot navigate
> if
> to "0" i can.
> so i enclose, should it be of any help, also the tcpdump on eth0 of topolino
> when on paperino tcp_sack is set to "1".
> 
> thanks
> aldo
> 
> 
> -- 
> Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email
> ------- You are receiving this mail because: -------
> You are on the CC list for the bug, or are watching someone who is.
Comment 35 Jarek Poplawski 2008-10-18 14:50:16 UTC
[RESEND] I forgot to add Aldo's email before - sorry!

On Sat, Oct 18, 2008 at 11:02:52PM +0200, Jarek Poplawski wrote:
> Nice job Aldo!
> 
> I forward this message to netdev and Cc our best tcp expert.
> Any new replies should be rather "Reply All" (not bugzilla only).
> 
> Thanks,
> Jarek P.
> 
> On Sat, Oct 18, 2008 at 01:20:56PM -0700, bugme-daemon@bugzilla.kernel.org
> wrote:
> > http://bugzilla.kernel.org/show_bug.cgi?id=11721
> > 
> ...
> > ------- Comment #30 from sentiniate@tiscali.it  2008-10-18 13:20 -------
> > absolutely i'm not fed up! and actually i thank you for your patience.
> > 
> > i set paperino as you wrote and added ifconfig eth0 mtu 1400 as well
> > i set the same value of mtu in topolino as well, i hope i was not wrong
> doing
> > so!
> > 
> > i attach the files you ask for, but now i can navigate with 2.6.27-rc1-git1
> as
> > well!
> > 
> > afterwards i've run some more tests and have found out that the culprit is
> > tcp_sack, if using kernel 2.6.27-rc1-gt1 it is set to "1" i cannot navigate
> if
> > to "0" i can.
> > so i enclose, should it be of any help, also the tcpdump on eth0 of
> topolino
> > when on paperino tcp_sack is set to "1".
> > 
> > thanks
> > aldo
> > 
> > 
> > -- 
> > Configure bugmail: http://bugzilla.kernel.org/userprefs.cgi?tab=email
> > ------- You are receiving this mail because: -------
> > You are on the CC list for the bug, or are watching someone who is.
Comment 36 Jarek Poplawski 2008-10-20 02:38:49 UTC
>> On Sat, Oct 18, 2008 at 01:20:56PM -0700, bugme-daemon@bugzilla.kernel.org
>> wrote:
>>> http://bugzilla.kernel.org/show_bug.cgi?id=11721
...
>>> ------- Comment #30 from sentiniate@tiscali.it  2008-10-18 13:20 -------
...
>>> afterwards i've run some more tests and have found out that the culprit is
>>> tcp_sack, if using kernel 2.6.27-rc1-gt1 it is set to "1" i cannot navigate
>>> if
>>> to "0" i can.
...

Aldo, for curiosity, could you also try if this change only
(instead of tcp_sack etc.) can't do similar effect?:

echo 0 > /proc/sys/net/ipv4/tcp_timestamps

Thanks,
Jarek P.
Comment 37 Ilpo Järvinen 2008-10-20 02:39:01 UTC
On Sat, 18 Oct 2008, Jarek Poplawski wrote:

> [RESEND] I forgot to add Aldo's email before - sorry!
> 
> On Sat, Oct 18, 2008 at 11:02:52PM +0200, Jarek Poplawski wrote:
> > Nice job Aldo!
> > 
> > I forward this message to netdev and Cc our best tcp expert.
> > Any new replies should be rather "Reply All" (not bugzilla only).
> > 
> > Thanks,
> > Jarek P.
> > 
> > On Sat, Oct 18, 2008 at 01:20:56PM -0700, bugme-daemon@bugzilla.kernel.org
> wrote:
> > > http://bugzilla.kernel.org/show_bug.cgi?id=11721
> > > 
> > ...
> > > ------- Comment #30 from sentiniate@tiscali.it  2008-10-18 13:20 -------
> > > absolutely i'm not fed up! and actually i thank you for your patience.
> > > 
> > > i set paperino as you wrote and added ifconfig eth0 mtu 1400 as well
> > > i set the same value of mtu in topolino as well, i hope i was not wrong
> doing
> > > so!
> > > 
> > > i attach the files you ask for, but now i can navigate with
> 2.6.27-rc1-git1 as
> > > well!
> > > 
> > > afterwards i've run some more tests and have found out that the culprit
> is
> > > tcp_sack, if using kernel 2.6.27-rc1-gt1 it is set to "1" i cannot
> navigate if
> > > to "0" i can.
> > > so i enclose, should it be of any help, also the tcpdump on eth0 of
> topolino
> > > when on paperino tcp_sack is set to "1".

So this ended up into tcp domain after all (I took earlier a brief 
anyway and found out that there are not that many changes 2.6.26..2.6.27 
-- net/ipv4/tcp*.c include/net/tcp.h)...

I compared your packet against a good one from elsewhere.. I couldn't 
compare your latest dumps fully because attachments 18366 and 18367 are 
with different TCP options (you forgot zeros to sysctls in them?)... 
Anyway, only thing that seemed to be different to that case from elsewhere 
were those extra bytes in the beginning (some below ip protocol that 
gets captured by tcpdump?) which are equal in both working and broken case 
of yours and the different ordering of the tcp options as noted by Jarek 
earlier. I tried to go through the fields one by one but nothing seemed to 
be wrong...

...Might be something crazy in the way that is too picky on tcp option 
ordering which wouldn't surprise me that much... :-) Please try if
the patch below does any difference (on paperino is enough, the gw seems 
innocent here).

If that didn't help, can you please restore the sysctls to 1 and redo
2.6.26.6 dump (like in attachment 18366 [details]) so that I get a fully comparable 
sample.

-- 
 i.


diff --git a/net/ipv4/tcp_output.c b/net/ipv4/tcp_output.c
index 990a584..850a4e9 100644
--- a/net/ipv4/tcp_output.c
+++ b/net/ipv4/tcp_output.c
@@ -376,6 +376,12 @@ static void tcp_options_write(__be32 *ptr, struct tcp_sock *tp,
 		*md5_hash = NULL;
 	}
 
+	if (unlikely(opts->mss)) {
+		*ptr++ = htonl((TCPOPT_MSS << 24) |
+			       (TCPOLEN_MSS << 16) |
+			       opts->mss);
+	}
+
 	if (likely(OPTION_TS & opts->options)) {
 		if (unlikely(OPTION_SACK_ADVERTISE & opts->options)) {
 			*ptr++ = htonl((TCPOPT_SACK_PERM << 24) |
@@ -392,12 +398,6 @@ static void tcp_options_write(__be32 *ptr, struct tcp_sock *tp,
 		*ptr++ = htonl(opts->tsecr);
 	}
 
-	if (unlikely(opts->mss)) {
-		*ptr++ = htonl((TCPOPT_MSS << 24) |
-			       (TCPOLEN_MSS << 16) |
-			       opts->mss);
-	}
-
 	if (unlikely(OPTION_SACK_ADVERTISE & opts->options &&
 		     !(OPTION_TS & opts->options))) {
 		*ptr++ = htonl((TCPOPT_NOP << 24) |
Comment 38 Jarek Poplawski 2008-10-20 02:51:26 UTC
On Mon, Oct 20, 2008 at 12:38:51PM +0300, Ilpo J
Comment 39 Ilpo Järvinen 2008-10-20 02:59:34 UTC
On Mon, 20 Oct 2008, Jarek Poplawski wrote:

> On Mon, Oct 20, 2008 at 12:38:51PM +0300, Ilpo J
Comment 40 Aldo Maggi 2008-10-20 08:47:36 UTC
ok setting echo 0 > /proc/sys/net/ipv4/tcp_timestamps solves the problem as well.

anyway i'm willing to do whatever test you deem necessary if this can help you solve the issue, so as soon as i can i'll apply the patch provided by  Ilpo Järvinen and compile (do not worry i've been a linux user for the last 10 years and up to 4 or 5 yrs ago i used to compile even three times a week!)

thanks
aldo
Comment 41 Jarek Poplawski 2008-10-20 10:15:29 UTC
On Mon, Oct 20, 2008 at 08:47:37AM -0700, bugme-daemon@bugzilla.kernel.org wrote:
> http://bugzilla.kernel.org/show_bug.cgi?id=11721
...
> ------- Comment #40 from sentiniate@tiscali.it  2008-10-20 08:47 -------
> ok setting echo 0 > /proc/sys/net/ipv4/tcp_timestamps solves the problem as
> well.
> 
> anyway i'm willing to do whatever test you deem necessary if this can help
> you
> solve the issue, so as soon as i can i'll apply the patch provided by  Ilpo
> J
Comment 42 Aldo Maggi 2008-10-20 12:35:14 UTC
Ilpo,
i'm sending herewith attached the output of
tcpdump -i eth0 -nXX -c3 'dst port 80 and tcp-syn != 0' on topolino when:
1) kernel 2.6.26.6 is running on paperino and tcp_window_scaling, tcp_timestamps, tcp_sack are set to "1"
2) kernel 2.6.26.6 is running on paperino and tcp_window_scaling, tcp_timestamps, tcp_sack are set to "0"
3) kernel 2.6.27-rc1-git1 is running on paperino and tcp_window_scaling, tcp_timestamps, tcp_sack are set to "1"
4) kernel 2.6.27-rc1-git1 is running on paperino and tcp_window_scaling, tcp_timestamps, tcp_sack are set to "0"

moreover, i read more carefully what you and Jarek wrote above, please let me know if you deem still necessary that i apply the patch you provided and compile or if you like i compile the latest kernel or whatever.

thanks
aldo
Comment 43 Aldo Maggi 2008-10-20 12:37:31 UTC
Created attachment 18381 [details]
kernel 2.6.26.6 is running on paperino and tcp_window_scaling, tcp_timestamps, tcp_sack are set to "1"
Comment 44 Aldo Maggi 2008-10-20 12:38:46 UTC
Created attachment 18382 [details]
kernel 2.6.26.6 is running on paperino and tcp_window_scaling, tcp_timestamps, tcp_sack are set to "0"
Comment 45 Aldo Maggi 2008-10-20 12:40:07 UTC
Created attachment 18383 [details]
kernel 2.6.27-rc1-git1 is running on paperino and tcp_window_scaling, tcp_timestamps, tcp_sack are set to "1"
Comment 46 Aldo Maggi 2008-10-20 12:40:47 UTC
Created attachment 18384 [details]
kernel 2.6.27-rc1-git1 is running on paperino and tcp_window_scaling, tcp_timestamps, tcp_sack are set to "0"
Comment 47 Ilpo Järvinen 2008-10-21 02:36:48 UTC
Ah, I forgot to add bugzilla back last time, so added it there now.

On Tue, 21 Oct 2008, Aldo Maggi wrote:

> just as matter of information, two other cases similar to mine were
> reported in ubuntu bug pages:
> https://bugs.launchpad.net/ubuntu/+source/linux/+bug/272896
> https://bugs.launchpad.net/ubuntu/+bug/285430
> i originated the first one and gave Jarek's first solution:
> tcp_sack=0
> which worked for the two other users.
> 
> maybe they could be contacted in order to perform further tests in a
> different environment.

It's hardly surprising that I couldn't reproduce this, the non-compliance 
here is most probably in isp's device or in some end-user sold embedded 
device.

Can you try this another debug patch below (on 2.6.27.2 is fine). It moves 
the mss to the last position but should keep timestamps in place by making 
wscale as first option. It is well possible that you won't get it working 
at all except with all ts,sack and wscale set to 0 (the most likely 
result). Please try with all wscale,sack,ts combinations (no need to 
provide dumps, just working/not working per case)... This should 
tell us for quite high certaintity what is the actual option which is 
causing this (would it not be the mss-at-beginning which is the most 
likely cause), however, your finding may well be specific to your network 
while the other people might a bit different results.

In order to provide maximal compatibility, I think we just restore the 
previous ordering of the fields (basically the first patch you tested).
It has no additional cost, so it won't hurt any, but it's quite ridiculous 
still that some devices care so little about basic tcp spec which has 
devastating effect on interoperatibility here, they should really fix
the devices instead but knowing how little most of the isp & etc. care
(or even understand) I'm not expecting too much to happen on that
front, and those who care probably run some semi-sane stuff already
anyway... :-). ...Sadly, it's much easier and cheaper to blame the 
end-user's equipment or Linux (if/once it becomes known that it's in use) 
and do nothing in case one is fool enough to complain to them.


-- 
 i.


--
diff --git a/net/ipv4/tcp_output.c b/net/ipv4/tcp_output.c
index de54f02..63b0a3f 100644
--- a/net/ipv4/tcp_output.c
+++ b/net/ipv4/tcp_output.c
@@ -376,6 +376,13 @@ static void tcp_options_write(__be32 *ptr, struct tcp_sock *tp,
 		*md5_hash = NULL;
 	}
 
+	if (unlikely(opts->ws)) {
+		*ptr++ = htonl((TCPOPT_NOP << 24) |
+			       (TCPOPT_WINDOW << 16) |
+			       (TCPOLEN_WINDOW << 8) |
+			       opts->ws);
+	}
+
 	if (likely(OPTION_TS & opts->options)) {
 		if (unlikely(OPTION_SACK_ADVERTISE & opts->options)) {
 			*ptr++ = htonl((TCPOPT_SACK_PERM << 24) |
@@ -392,12 +399,6 @@ static void tcp_options_write(__be32 *ptr, struct tcp_sock *tp,
 		*ptr++ = htonl(opts->tsecr);
 	}
 
-	if (unlikely(opts->mss)) {
-		*ptr++ = htonl((TCPOPT_MSS << 24) |
-			       (TCPOLEN_MSS << 16) |
-			       opts->mss);
-	}
-
 	if (unlikely(OPTION_SACK_ADVERTISE & opts->options &&
 		     !(OPTION_TS & opts->options))) {
 		*ptr++ = htonl((TCPOPT_NOP << 24) |
@@ -406,11 +407,10 @@ static void tcp_options_write(__be32 *ptr, struct tcp_sock *tp,
 			       TCPOLEN_SACK_PERM);
 	}
 
-	if (unlikely(opts->ws)) {
-		*ptr++ = htonl((TCPOPT_NOP << 24) |
-			       (TCPOPT_WINDOW << 16) |
-			       (TCPOLEN_WINDOW << 8) |
-			       opts->ws);
+	if (unlikely(opts->mss)) {
+		*ptr++ = htonl((TCPOPT_MSS << 24) |
+			       (TCPOLEN_MSS << 16) |
+			       opts->mss);
 	}
 
 	if (unlikely(opts->num_sack_blocks)) {
Comment 48 Jarek Poplawski 2008-10-21 03:09:38 UTC
On Tue, Oct 21, 2008 at 12:36:33PM +0300, Ilpo J
Comment 49 Ilpo Järvinen 2008-10-21 03:52:02 UTC
On Tue, 21 Oct 2008, Jarek Poplawski wrote:

> On Tue, Oct 21, 2008 at 12:36:33PM +0300, Ilpo J
Comment 50 Jarek Poplawski 2008-10-21 04:12:36 UTC
On Tue, Oct 21, 2008 at 01:51:10PM +0300, Ilpo J
Comment 51 Ilpo Järvinen 2008-10-21 05:19:42 UTC
On Tue, 21 Oct 2008, Jarek Poplawski wrote:

> On Tue, Oct 21, 2008 at 01:51:10PM +0300, Ilpo J
Comment 52 Jarek Poplawski 2008-10-21 05:34:22 UTC
On Tue, Oct 21, 2008 at 03:18:57PM +0300, Ilpo J
Comment 53 Ilpo Järvinen 2008-10-21 07:17:32 UTC
On Tue, 21 Oct 2008, Jarek Poplawski wrote:

> On Tue, Oct 21, 2008 at 03:18:57PM +0300, Ilpo J
Comment 54 Jarek Poplawski 2008-10-21 09:29:23 UTC
On Tue, Oct 21, 2008 at 05:16:43PM +0300, Ilpo J
Comment 55 Aldo Maggi 2008-10-21 11:19:20 UTC
Il giorno Tue, 21 Oct 2008 12:36:33 +0300 (EEST)
"Ilpo Järvinen" <ilpo.jarvinen@helsinki.fi> ha scritto:

[...]

> 
> Can you try this another debug patch below (on 2.6.27.2 is fine). It
> moves the mss to the last position but should keep timestamps in
> place by making wscale as first option. It is well possible that you
> won't get it working at all except with all ts,sack and wscale set to
> 0 (the most likely result). Please try with all wscale,sack,ts
> combinations (no need to provide dumps, just working/not working per
> case)... This should tell us for quite high certaintity what is the
> actual option which is causing this (would it not be the
> mss-at-beginning which is the most likely cause), however, your
> finding may well be specific to your network while the other people
> might a bit different results.

i've compiled 2.6.27.2 source after having patched it with your today's
patch.

it works! i.e. i can navigate (w3m kernel.org) and update (apt-get
update)

i send you anyway the usual tcpdump as in previous msgs, should it be
of any help.

do you want me to provide some more commands output?

waiting...

aldo
Comment 56 Jarek Poplawski 2008-10-21 11:45:16 UTC
On Tue, Oct 21, 2008 at 08:18:57PM +0200, Aldo Maggi wrote:
...
> i've compiled 2.6.27.2 source after having patched it with your today's
> patch.
> 
> it works! i.e. i can navigate (w3m kernel.org) and update (apt-get
> update)

Ilpo, I should say you're incredible! ...But, since I've promised to
myself not to disturb you anymore (again), I can't do this, sorry :-(

Jarek P.
Comment 57 David S. Miller 2008-10-21 16:32:33 UTC
From: Jarek Poplawski <jarkao2@gmail.com>
Date: Tue, 21 Oct 2008 20:45:39 +0200

> On Tue, Oct 21, 2008 at 08:18:57PM +0200, Aldo Maggi wrote:
> ...
> > i've compiled 2.6.27.2 source after having patched it with your today's
> > patch.
> > 
> > it works! i.e. i can navigate (w3m kernel.org) and update (apt-get
> > update)
> 
> Ilpo, I should say you're incredible! ...But, since I've promised to
> myself not to disturb you anymore (again), I can't do this, sorry :-(

Indeed, excellent work Ilpo.

Ilpo, now that we know this fixes things for sure, could you submit
this formally with a proper signoff?

I'll queue it up for -stable too.

Thanks!
Comment 58 Ilpo Järvinen 2008-10-22 03:02:02 UTC
On Tue, 21 Oct 2008, David Miller wrote:

> From: Jarek Poplawski <jarkao2@gmail.com>
> Date: Tue, 21 Oct 2008 20:45:39 +0200
> 
> > On Tue, Oct 21, 2008 at 08:18:57PM +0200, Aldo Maggi wrote:
> > ...
> > > i've compiled 2.6.27.2 source after having patched it with your today's
> > > patch.
> > > 
> > > it works! i.e. i can navigate (w3m kernel.org) and update (apt-get
> > > update)
> > 
> > Ilpo, I should say you're incredible! ...But, since I've promised to
> > myself not to disturb you anymore (again), I can't do this, sorry :-(
> 
> Indeed, excellent work Ilpo.
> 
> Ilpo, now that we know this fixes things for sure, could you submit
> this formally with a proper signoff?
> 
> I'll queue it up for -stable too.
> 
> Thanks!

Sure, here below is one with a warning (it's the first patch + comment).

Olon, can you please check this as well if it affect to your case too 
(though the symptoms were not that clear in your case).

It would be nice for Aldo to check what the result will be with my second 
patch (only) using sack=1,ts=0,wscale=0. I guess it does but it's a bit 
unclear if nop's in front help or not (having the patch below should 
anyway help also in that case as the mss option gets moved before it 
anyway).

--
[PATCH] tcp: Restore ordering of TCP options for the sake of inter-operability

This is not our bug! Sadly some devices cannot cope with the change
of TCP option ordering which was a result of the recent rewrite of
the option code (not that there was some particular reason steming
from the rewrite for the reordering) though any ordering of TCP
options is perfectly legal. Thus we restore the original ordering
to allow interoperability with/through such broken devices and add
some warning about this trap. Since the reordering just happened
without any particular reason, this change shouldn't cost us
anything.

There are already couple of known failure reports (within close
proximity of the last release), so the problem might be more
wide-spread than a single device. And other reports which may
be due to the same problem though the symptoms were less obvious.
Analysis of one of the case revealed (with very high probability)
that sack capability cannot be negotiated as the first option
(SYN never got a response).

Signed-off-by: Ilpo J
Comment 59 Aldo Maggi 2008-10-22 04:50:31 UTC
Il giorno Wed, 22 Oct 2008 13:00:01 +0300 (EEST)
"Ilpo Järvinen" <ilpo.jarvinen@helsinki.fi> ha scritto:

[...]

> 
> It would be nice for Aldo to check what the result will be with my
> second patch (only) using sack=1,ts=0,wscale=0. I guess it does but
> it's a bit unclear if nop's in front help or not (having the patch
> below should anyway help also in that case as the mss option gets
> moved before it anyway).

in order to avoid misunderstandings, i list herebelow my actions:

i've used on paperino the kernel 2.6.27.2 patched with ilpo's
21.10.2008 patch (see please comment 47 in
http://bugzilla.kernel.org/show_bug.cgi?id=11721, and NOT with the patch
contained in comment 58, right?)
i've modified the following files:
echo 0 > /proc/sys/net/ipv4/tcp_window_scaling
echo 0 > /proc/sys/net/ipv4/tcp_timestamps
echo 1 > /proc/sys/net/ipv4/tcp_sack

the results are that i CAN navigate (w3m kernel.org) and update my
system (apt-get update in debian).

should it be of any help to ilpo, i attach the usual tcpdump on the wan
eth of topolino (my home server/gw)

ciao!
aldo



[...]
Comment 60 Ilpo Järvinen 2008-10-22 05:11:06 UTC
On Wed, 22 Oct 2008, Aldo Maggi wrote:

> Il giorno Wed, 22 Oct 2008 13:00:01 +0300 (EEST)
> "Ilpo J
Comment 61 David S. Miller 2008-10-23 14:07:48 UTC
From: "Ilpo J
Comment 63 Jarek Poplawski 2008-10-26 23:52:02 UTC
On Sat, Oct 25, 2008 at 10:15:15PM +0200, Aldo Maggi wrote:
> Il giorno Tue, 21 Oct 2008 07:49:54 +0000
> Jarek Poplawski <jarkao2@gmail.com> ha scritto:
> 
> > On Tue, Oct 21, 2008 at 09:27:21AM +0200, Aldo Maggi wrote:
> > ...
> > > as soon as i've time i'll replace the modem and run some tests.
> 
> just to let you know!
> i have changed my modem with a new one.
> the bug has disappeared with no modification of the tcp files.
> this shows that the problem was due to my old zyxel.
> 
> ciao!
> aldo
> 

Aldo, I think it's a very useful information, so I forward this to the
people.

Thanks again,
Jarek P.
Comment 64 David Woodhouse 2008-10-30 02:25:44 UTC
Do we know which are the affected routers/firewalls? We should name and shame them, and ensure that bug reports are filed. 
Comment 65 Ilpo Järvinen 2008-10-30 13:13:57 UTC
Just in case you're reading this bug report and wondering what you should do with this bug...

There are some analysis which are claimed to be "very good" circulating around which tell you that Linux now enabled timestamps in 2.6.27. Sadly that is a _false_ claim, the timestamps were _not_ enabled for 2.6.27 (or -rc1). Don't be fooled wide-spreadness of the claims, sadly multiple distros seem to repeat and support the false claim in their "semi-official" documentation. Timestamps have been enabled already for a very long time before 2.6.27...

The real change that happened from 2.6.26 to 2.6.27, as described in the commit which fixes this particular bug, was a change in the _order_ of the TCP options (timestamps are tcp options). The order change was not intentional, not that it should have broken something. The correct fix is to restore the original ordering. ...It is very easy to verify with tcpdump if 2.6.26 does send timestamps or not (hint: look into SYN packet's TCP options).
Comment 66 Dean Holland 2008-11-08 22:03:17 UTC
I am still seeing this issue on a PPP connection with 2.6.27. 2.6.24-21 (the previous Ubuntu kernel) and a vanilla 2.6.26 kernel from kernel.org are fine. Aldo has suggested I reopen this and add the outputs of the scenarios as per comment #42.
Comment 67 Dean Holland 2008-11-08 22:04:11 UTC
Created attachment 18741 [details]
Output of tcpdump scenarios; different kernels and tcp /proc settings
Comment 68 Jarek Poplawski 2008-11-12 23:05:08 UTC
> ------- Comment #66 from speedster@haveacry.com  2008-11-08 22:03 -------
> I am still seeing this issue on a PPP connection with 2.6.27. 2.6.24-21 (the
> previous Ubuntu kernel) and a vanilla 2.6.26 kernel from kernel.org are fine.
> Aldo has suggested I reopen this and add the outputs of the scenarios as per
> comment #42.

Dean, it looks like a different problem yet, so better try to open a
new bugzilla report with more description, .configs, dmesgs, ifconfigs
etc. for working and not working case. BTW, what about ping or
connecting to other sites? Could you attach more verbose, not filtered
tcpdump on an interface closest to the internet while pinging and www?

Thanks,
Jarek P.

Note You need to log in before you can comment on or make changes to this bug.