Bug 14807 - r8168b with r8169 driver hangs after system startup, not after reboot
Summary: r8168b with r8169 driver hangs after system startup, not after reboot
Status: RESOLVED OBSOLETE
Alias: None
Product: Drivers
Classification: Unclassified
Component: Network (show other bugs)
Hardware: All Linux
: P1 high
Assignee: Francois Romieu
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2009-12-14 08:44 UTC by Marcus Husar
Modified: 2012-11-20 17:16 UTC (History)
2 users (show)

See Also:
Kernel Version: 2.6.31.6
Subsystem:
Regression: Yes
Bisected commit-id:


Attachments

Description Marcus Husar 2009-12-14 08:44:48 UTC
Hello,

my problem with the Realtek 8168B and the r8169 driver is a little bit different than the other ones found by searching "sch_generic:246".

The network hangs only when the machine was shutdown for a few hours or the power supply was cut from electricity for a few seconds. So the network hangs at the first startup. After a reboot everything is fine. I have seen no hangs while the machine was running with a working network. 

This problem happens on 2 boards (Asus M2A-VM) with the latest BIOS. This also happens with kernel 2.6.30 (more often), but the oops appears on line 226 of sched_generic.c.

Until this problem on daily startups isn't solved I can't deliver these machines to my users.

Sincerely yours,
Marcus Husar

lspci:
02:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168B PCI Express Gigabit Ethernet controller (rev ff)

According to ifconfig the tx queue drops packets:
          UP BROADCAST RUNNING MULTICAST  MTU:1500  Metrik:1
          RX packets:0 errors:0 dropped:0 overruns:0 frame:0
          TX packets:0 errors:0 dropped:184 overruns:0 carrier:0
          Kollisionen:0 Sendewarteschlangenlänge:1000 
          RX bytes:0 (0.0 B)  TX bytes:0 (0.0 B)
          Interrupt:25 Basisadresse:0x4000 

Ethtool doens't show any problem. But mii-tool doesn't work at all in this state:
No MII transceiver present!.

[   80.816249] ------------[ cut here ]------------
[   80.816272] WARNING: at /build/buildd/linux-2.6-2.6.31/debian/build/source_amd64_none/net/sched/sch_generic.c:246 dev_watchdog+0xe2/0x194()
[   80.816281] Hardware name: System Product Name
[   80.816286] NETDEV WATCHDOG: eth0 (r8169): transmit queue 0 timed out
[   80.816291] Modules linked in: radeon drm ppdev lp parport powernow_k8 cpufreq_stats cpufreq_powersave cpufreq_conservative cpufreq_userspace fuse ext3 jbd loop snd_hda_codec_realtek snd_hda_intel snd_hda_codec snd_hwdep snd_pcm_oss snd_mixer_oss snd_pcm snd_timer snd shpchp soundcore pci_hotplug i2c_piix4 snd_page_alloc i2c_core amd64_edac_mod k8temp asus_atk0110 psmouse edac_core evdev processor button pcspkr serio_raw ext4 mbcache jbd2 crc16 dm_mod usbhid hid sd_mod crc_t10dif ata_generic ide_pci_generic ahci r8169 atiixp mii libata scsi_mod ohci_hcd ehci_hcd ide_core thermal fan thermal_sys [last unloaded: scsi_wait_scan]
[   80.816399] Pid: 0, comm: swapper Not tainted 2.6.31-1-amd64 #1
[   80.816405] Call Trace:
[   80.816410]  <IRQ>  [<ffffffff812e22fd>] ? dev_watchdog+0xe2/0x194
[   80.816431]  [<ffffffff8105c418>] ? warn_slowpath_common+0x85/0xc8
[   80.816439]  [<ffffffff812e221b>] ? dev_watchdog+0x0/0x194
[   80.816448]  [<ffffffff8105c4f5>] ? warn_slowpath_fmt+0x61/0x7e
[   80.816458]  [<ffffffff810775ff>] ? autoremove_wake_function+0x1d/0x59
[   80.816468]  [<ffffffff81046845>] ? __wake_up_common+0x54/0x9a
[   80.816476]  [<ffffffff812e21d8>] ? netif_tx_lock+0x51/0x94
[   80.816486]  [<ffffffff812c8d20>] ? netdev_drivername+0x4f/0x6c
[   80.816494]  [<ffffffff812e221b>] ? dev_watchdog+0x0/0x194
[   80.816502]  [<ffffffff812e22fd>] ? dev_watchdog+0xe2/0x194
[   80.816513]  [<ffffffff81072ce6>] ? __queue_work+0x35/0x5c
[   80.816522]  [<ffffffff8107d362>] ? sched_clock_cpu+0xb1/0x241
[   80.816533]  [<ffffffff81068e89>] ? run_timer_softirq+0x1a0/0x22f
[   80.816542]  [<ffffffff810812c6>] ? getnstimeofday+0x65/0xd7
[   80.816550]  [<ffffffff8107b192>] ? ktime_get_ts+0x31/0x71
[   80.816559]  [<ffffffff810638d6>] ? __do_softirq+0xe2/0x1bc
[   80.816569]  [<ffffffff8101302c>] ? call_softirq+0x1c/0x30
[   80.816577]  [<ffffffff8101530e>] ? do_softirq+0x4e/0xa9
[   80.816584]  [<ffffffff810634e0>] ? irq_exit+0x4f/0xba
[   80.816594]  [<ffffffff8102b1ae>] ? smp_apic_timer_interrupt+0x97/0xbc
[   80.816602]  [<ffffffff810296b3>] ? smp_reschedule_interrupt+0x28/0x4b
[   80.816611]  [<ffffffff810129f3>] ? apic_timer_interrupt+0x13/0x20
[   80.816615]  <EOI>  [<ffffffff815492a0>] ? early_idt_handler+0x0/0x71
[   80.816632]  [<ffffffff81034d58>] ? native_safe_halt+0x2/0x3
[   80.816642]  [<ffffffff8101a724>] ? default_idle+0x50/0x8d
[   80.816650]  [<ffffffff8101acb9>] ? c1e_idle+0xe3/0x120
[   80.816661]  [<ffffffff8101109c>] ? cpu_idle+0xab/0xec
[   80.816669]  [<ffffffff8154a12b>] ? start_kernel+0x3db/0x3fc
[   80.816677]  [<ffffffff815492a0>] ? early_idt_handler+0x0/0x71
[   80.816685]  [<ffffffff8154954c>] ? x86_64_start_kernel+0x107/0x129
[   80.816692] ---[ end trace b79d8e50b90acfc4 ]---
[   81.632220] r8169: eth0: link up
[  159.632235] r8169: eth0: link up
[  231.632256] r8169: eth0: link up
Comment 1 Francois Romieu 2012-08-24 21:27:23 UTC
This is a RTL_GIGA_MAC_VER_12 or RTL_GIGA_MAC_VER_17 chipset.

There have been reports in early 2012 where the 8168 nic of the M2A-VM
appeared prone to resume/suspend issues due to bad timings.

You may consider trying a post v3.2 kernel.

-- 
Ueimor
Comment 2 Alan 2012-11-20 17:16:38 UTC
Closing as obsolete, if this is still seen with modern kernels please re-open and update

Note You need to log in before you can comment on or make changes to this bug.