Bug 201233 - e1000e
Summary: e1000e
Status: NEW
Alias: None
Product: Networking
Classification: Unclassified
Component: Other (show other bugs)
Hardware: Intel Linux
: P1 high
Assignee: Stephen Hemminger
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-09-25 19:44 UTC by Paweł
Modified: 2018-09-27 22:49 UTC (History)
1 user (show)

See Also:
Kernel Version: 4.17.2
Subsystem:
Regression: No
Bisected commit-id:


Attachments

Description Paweł 2018-09-25 19:44:31 UTC
[39066.374367] ------------[ cut here ]------------
[39066.374372] NETDEV WATCHDOG: enp0s31f6 (e1000e): transmit queue 0 timed out
[39066.374399] WARNING: CPU: 5 PID: 0 at net/sched/sch_generic.c:473 dev_watchdog+0x21a/0x220
[39066.374401] Modules linked in: tun snd_hda_codec_hdmi cfg80211 rfkill 8021q mrp xt_tcpudp xt_conntrack ip6table_filter ip6_tables iptable_filter openvswitch nsh nf_conntrack_ipv6 nf_nat_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_defrag_ipv6 nf_nat nf_conntrack libcrc32c snd_hda_codec_realtek snd_hda_codec_generic nls_iso8859_1 nls_cp437 vfat fat nouveau intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel input_leds kvm i2c_algo_bit ttm drm_kms_helper irqbypass crct10dif_pclmul crc32_pclmul ghash_clmulni_intel pcbc drm aesni_intel snd_hda_intel aes_x86_64 crypto_simd cryptd glue_helper snd_hda_codec iTCO_wdt iTCO_vendor_support intel_cstate agpgart intel_uncore snd_hda_core syscopyarea sysfillrect sysimgblt fb_sys_fops mxm_wmi intel_rapl_perf led_class e1000e snd_hwdep r8169
[39066.374498]  snd_pcm pcspkr mii snd_timer mei_me snd hid_generic mei i2c_i801 soundcore intel_pch_thermal shpchp wmi rtc_cmos evdev mac_hid ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 fscrypto sd_mod usbhid hid ahci libahci xhci_pci xhci_hcd libata crc32c_intel usbcore scsi_mod usb_common
[39066.374552] CPU: 5 PID: 0 Comm: swapper/5 Not tainted 4.17.2-1-ARCH #1
[39066.374554] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./Z170 Pro4S, BIOS P7.50 01/23/2018
[39066.374559] RIP: 0010:dev_watchdog+0x21a/0x220
[39066.374562] RSP: 0018:ffff9585b6543e78 EFLAGS: 00010286
[39066.374567] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000103
[39066.374569] RDX: 0000000080000103 RSI: ffffffff93e83166 RDI: 00000000ffffffff
[39066.374572] RBP: ffff9585a1a0845c R08: 0000000000000050 R09: 0000000000000388
[39066.374575] R10: 0000000000000000 R11: 0000000000000001 R12: ffff9585a1a08478
[39066.374577] R13: ffff9585a1a08000 R14: 0000000000000001 R15: ffff9585a1116680
[39066.374581] FS:  0000000000000000(0000) GS:ffff9585b6540000(0000) knlGS:0000000000000000
[39066.374584] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[39066.374587] CR2: 00007f95007b9000 CR3: 000000033100a002 CR4: 00000000003626e0
[39066.374590] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[39066.374593] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[39066.374595] Call Trace:
[39066.374599]  <IRQ>
[39066.374607]  ? qdisc_reset+0xe0/0xe0
[39066.374611]  ? qdisc_reset+0xe0/0xe0
[39066.374619]  call_timer_fn+0x2b/0x150
[39066.374624]  ? qdisc_reset+0xe0/0xe0
[39066.374630]  expire_timers+0x99/0x110
[39066.374637]  run_timer_softirq+0x8a/0x160
[39066.374646]  ? sched_clock+0x5/0x10
[39066.374654]  ? sched_clock_cpu+0xe/0xd0
[39066.374662]  __do_softirq+0xf1/0x2e0
[39066.374669]  irq_exit+0xc9/0xe0
[39066.374676]  smp_apic_timer_interrupt+0x73/0x160
[39066.374682]  apic_timer_interrupt+0xf/0x20
[39066.374685]  </IRQ>
[39066.374692] RIP: 0010:cpuidle_enter_state+0xb7/0x2e0
[39066.374695] RSP: 0018:ffffb7f00196be98 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff13
[39066.374700] RAX: ffff9585b6540000 RBX: 00002387d95da598 RCX: 000000000000001f
[39066.374702] RDX: 00002387d95da598 RSI: ffffffff93e83166 RDI: ffffffff93e833c4
[39066.374705] RBP: 0000000000000006 R08: 00009d769b1b300b R09: 0000000000000dfa
[39066.374707] R10: 00000000000050ed R11: ffff9585b6560928 R12: ffff9585b656bb00
[39066.374710] R13: ffffffff940adc38 R14: 00002387d92f6191 R15: 0000000000000000
[39066.374720]  ? cpuidle_enter_state+0x92/0x2e0
[39066.374725]  do_idle+0x20a/0x240
[39066.374731]  cpu_startup_entry+0x6f/0x80
[39066.374738]  start_secondary+0x1aa/0x200
[39066.374745]  secondary_startup_64+0xa5/0xb0
[39066.374750] Code: 00 49 63 4c 24 e8 eb 8c 4c 89 ef c6 05 97 54 ad 00 01 e8 da 18 fd ff 89 d9 4c 89 ee 48 c7 c7 40 5e ef 93 48 89 c2 e8 00 1b a5 ff <0f> 0b eb be 66 90 0f 1f 44 00 00 48 c7 47 08 00 00 00 00 48 c7 
[39066.374840] ---[ end trace 69426933339b594f ]---

[66649.036433] e1000e 0000:00:1f.6 enp0s31f6: Detected Hardware Unit Hang:
                 TDH                  <cf>
                 TDT                  <ef>
                 next_to_use          <ef>
                 next_to_clean        <cf>
               buffer_info[next_to_clean]:
                 time_stamp           <1012faef5>
                 next_to_watch        <d0>
                 jiffies              <1012fb940>
                 next_to_watch.status <0>
               MAC Status             <40080083>
               PHY Status             <796d>
               PHY 1000BASE-T Status  <7800>
               PHY Extended Status    <3000>
               PCI Status             <10>
[66650.102864] e1000e 0000:00:1f.6 enp0s31f6: Reset adapter unexpectedly
Comment 1 Paweł 2018-09-27 22:49:11 UTC
[189201.833334] e1000e 0000:00:19.0 eno1: Detected Hardware Unit Hang:
                  TDH                  <30>                                                                                                 
                  TDT                  <69>                                                                                                 
                  next_to_use          <69>                                                                                                 
                  next_to_clean        <2f>                                                                                                 
                buffer_info[next_to_clean]:                                                                                                 
                  time_stamp           <10360b51d>                                                                                          
                  next_to_watch        <30>                                                                                                 
                  jiffies              <10360b8c0>                                                                                          
                  next_to_watch.status <0>                                                                                                  
                MAC Status             <80083>                                                                                              
                PHY Status             <796d>                                                                                               
                PHY 1000BASE-T Status  <7800>                                                                                               
                PHY Extended Status    <3000>                                                                                               
                PCI Status             <10>                                                                                                 
[189203.753309] e1000e 0000:00:19.0 eno1: Detected Hardware Unit Hang:
                  TDH                  <30>                                                                                                 
                  TDT                  <69>                                                                                                 
                  next_to_use          <69>                                                                                                 
                  next_to_clean        <2f>                                                                                                 
                buffer_info[next_to_clean]:                                                                                                 
                  time_stamp           <10360b51d>                                                                                          
                  next_to_watch        <30>                                                                                                 
                  jiffies              <10360bb00>                                                                                          
                  next_to_watch.status <0>                                                                                                  
                MAC Status             <80083>                                                                                              
                PHY Status             <796d>                                                                                               
                PHY 1000BASE-T Status  <7800>                                                                                               
                PHY Extended Status    <3000>                                                                                               
                PCI Status             <10>                                                                                                 
[189204.819768] ------------[ cut here ]------------
[189204.819784] NETDEV WATCHDOG: eno1 (e1000e): transmit queue 0 timed out
[189204.819810] WARNING: CPU: 15 PID: 0 at net/sched/sch_generic.c:461 dev_watchdog+0x21a/0x220
[189204.819812] Modules linked in: tun cfg80211 rfkill 8021q mrp cbc ceph libceph dns_resolver fscache xt_tcpudp xt_conntrack snd_hda_codec_hdmi ip6table_filter ip6_tables iptable_filter openvswitch nsh nf_conntrack_ipv6 nf_nat_ipv6 nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 nf_defrag_ipv6 nf_conncount nf_nat nf_conntrack libcrc32c nls_iso8859_1 nls_cp437 intel_rapl vfat fat nouveau x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_codec_realtek snd_hda_codec_generic ttm kvm drm_kms_helper snd_hda_intel snd_hda_codec irqbypass crct10dif_pclmul crc32_pclmul drm ghash_clmulni_intel pcbc snd_hda_core snd_hwdep snd_pcm aesni_intel aes_x86_64 crypto_simd cryptd snd_timer glue_helper agpgart syscopyarea snd sysfillrect sysimgblt iTCO_wdt intel_cstate input_leds fb_sys_fops iTCO_vendor_support
[189204.819896]  igb mxm_wmi uas e1000e intel_uncore mei_me led_class soundcore intel_rapl_perf mei i2c_algo_bit dca i2c_i801 pcspkr lpc_ich wmi evdev pcc_cpufreq mac_hid ip_tables x_tables ext4 crc32c_generic crc16 mbcache jbd2 fscrypto sd_mod hid_generic usbhid hid usb_storage ahci libahci xhci_pci ehci_pci libata xhci_hcd ehci_hcd crc32c_intel scsi_mod usbcore usb_common
[189204.819950] CPU: 15 PID: 0 Comm: swapper/15 Not tainted 4.18.6-arch1-1-ARCH #1
[189204.819952] Hardware name: MSI MS-7A54/X99A TOMAHAWK (MS-7A54), BIOS 2.20 06/15/2018
[189204.819957] RIP: 0010:dev_watchdog+0x21a/0x220
[189204.819962] Code: 49 63 4c 24 e8 eb 8c 4c 89 ef c6 05 eb ea aa 00 01 e8 5a eb fc ff 89 d9 4c 89 ee 48 c7 c7 38 dc cf 92 48 89 c2 e8 40 a7 a2 ff <0f> 0b eb be 66 90 0f 1f 44 00 00 48 c7 47 08 00 00 00 00 48 c7 07 
[189204.820040] RSP: 0018:ffff8a402f1c3e78 EFLAGS: 00010286
[189204.820044] RAX: 0000000000000000 RBX: 0000000000000000 RCX: 0000000000000103
[189204.820048] RDX: 0000000080000103 RSI: 00000000000000f6 RDI: 00000000ffffffff
[189204.820051] RBP: ffff8a4022bf445c R08: 0000000000000001 R09: 00000000000007a5
[189204.820055] R10: 0000000000000004 R11: 0000000000000000 R12: ffff8a4022bf4478
[189204.820058] R13: ffff8a4022bf4000 R14: 0000000000000001 R15: ffff8a4028be6880
[189204.820062] FS:  0000000000000000(0000) GS:ffff8a402f1c0000(0000) knlGS:0000000000000000
[189204.820065] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[189204.820068] CR2: 00007f4e4fc93728 CR3: 00000002ea20a005 CR4: 00000000003626e0
[189204.820072] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[189204.820074] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[189204.820077] Call Trace:
[189204.820081]  <IRQ>
[189204.820089]  ? qdisc_reset+0xe0/0xe0
[189204.820092]  ? qdisc_reset+0xe0/0xe0
[189204.820099]  call_timer_fn+0x2b/0x150
[189204.820105]  ? qdisc_reset+0xe0/0xe0
[189204.820108]  expire_timers+0x99/0x110
[189204.820112]  run_timer_softirq+0x8a/0x160
[189204.820117]  ? sched_clock+0x5/0x10
[189204.820121]  ? sched_clock_cpu+0xe/0xd0
[189204.820129]  __do_softirq+0x10d/0x30d
[189204.820135]  irq_exit+0xd9/0xf0
[189204.820139]  smp_apic_timer_interrupt+0x87/0x170
[189204.820143]  apic_timer_interrupt+0xf/0x20
[189204.820145]  </IRQ>
[189204.820150] RIP: 0010:cpuidle_enter_state+0xb7/0x2e0
[189204.820153] Code: e8 3e ef ad ff 80 7c 24 03 00 74 17 9c 58 0f 1f 44 00 00 f6 c4 02 0f 85 fb 01 00 00 31 ff e8 90 e2 b3 ff fb 66 0f 1f 44 00 00 <48> b8 ff ff ff ff f3 01 00 00 4c 29 f3 ba ff ff ff 7f 48 39 c3 7f 
[189204.820217] RSP: 0018:ffffb3a0019e7e98 EFLAGS: 00000246 ORIG_RAX: ffffffffffffff13
[189204.820222] RAX: ffff8a402f1c0000 RBX: 0000ac14ae123454 RCX: 000000000000001f
[189204.820224] RDX: 0000ac14ae123454 RSI: ffffffff92c8051e RDI: ffffffff92c884ed
[189204.820226] RBP: 0000000000000004 R08: 00017aa39e0e7d42 R09: 00000000000282b4
[189204.820228] R10: 0000000000035538 R11: ffff8a402f1e0a68 R12: ffff8a402f1ebe38
[189204.820230] R13: ffffffff92eaecb8 R14: 0000ac14ad791e7c R15: 0000000000000000
[189204.820235]  ? cpuidle_enter_state+0x92/0x2e0
[189204.820241]  do_idle+0x20a/0x240
[189204.820247]  cpu_startup_entry+0x6f/0x80
[189204.820253]  start_secondary+0x1aa/0x200
[189204.820258]  secondary_startup_64+0xa5/0xb0
[189204.820263] ---[ end trace a38dbf4576291580 ]---


This bug is also present on MSI X99A Tomahawk on kernel 4.18.5

Note You need to log in before you can comment on or make changes to this bug.