Bug 203929

Summary: iwlwifi 8265 microcode SW error detected, while doing large-ish upload
Product: Drivers Reporter: Steven Noonan (steven)
Component: network-wirelessAssignee: DO NOT USE - assign "network-wireless-intel" component instead (linuxwifi)
Status: CLOSED DUPLICATE    
Severity: high CC: andersk
Priority: P1    
Hardware: Intel   
OS: Linux   
Kernel Version: 5.1.9 Subsystem:
Regression: No Bisected commit-id:

Description Steven Noonan 2019-06-19 07:04:18 UTC
This happens reliably when doing large outbound TCP transfers on an Intel 8265 chip in a ThinkPad. This is triggerable by doing speedtest.net on this machine... It breaks shortly after starting the upload test on there:

[320589.350842] sandestin kernel: iwlwifi 0000:04:00.0: Microcode SW error detected.  Restarting 0x2000000.
[320589.351073] sandestin kernel: iwlwifi 0000:04:00.0: Start IWL Error Log Dump:
[320589.351077] sandestin kernel: iwlwifi 0000:04:00.0: Status: 0x00000100, count: 6
[320589.351079] sandestin kernel: iwlwifi 0000:04:00.0: Loaded firmware version: 36.9f0a2d68.0
[320589.351081] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000EDC | ADVANCED_SYSASSERT          
[320589.351082] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000280 | trm_hw_status0
[320589.351084] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | trm_hw_status1
[320589.351085] sandestin kernel: iwlwifi 0000:04:00.0: 0x0002485C | branchlink2
[320589.351087] sandestin kernel: iwlwifi 0000:04:00.0: 0x0003A7CA | interruptlink1
[320589.351088] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | interruptlink2
[320589.351090] sandestin kernel: iwlwifi 0000:04:00.0: 0x0B6C001C | data1
[320589.351091] sandestin kernel: iwlwifi 0000:04:00.0: 0x20000292 | data2
[320589.351093] sandestin kernel: iwlwifi 0000:04:00.0: 0x00001E1C | data3
[320589.351094] sandestin kernel: iwlwifi 0000:04:00.0: 0x5280A26B | beacon time
[320589.351095] sandestin kernel: iwlwifi 0000:04:00.0: 0x5BD92D96 | tsf low
[320589.351097] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000004 | tsf hi
[320589.351098] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | time gp1
[320589.351100] sandestin kernel: iwlwifi 0000:04:00.0: 0x02639C28 | time gp2
[320589.351102] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000001 | uCode revision type
[320589.351103] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000024 | uCode version major
[320589.351105] sandestin kernel: iwlwifi 0000:04:00.0: 0x9F0A2D68 | uCode version minor
[320589.351106] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000230 | hw version
[320589.351108] sandestin kernel: iwlwifi 0000:04:00.0: 0x18C89000 | board version
[320589.351109] sandestin kernel: iwlwifi 0000:04:00.0: 0x20000292 | hcmd
[320589.351110] sandestin kernel: iwlwifi 0000:04:00.0: 0x20022002 | isr0
[320589.351112] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | isr1
[320589.351114] sandestin kernel: iwlwifi 0000:04:00.0: 0x0800180A | isr2
[320589.351115] sandestin kernel: iwlwifi 0000:04:00.0: 0x00413CC0 | isr3
[320589.351117] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | isr4
[320589.351118] sandestin kernel: iwlwifi 0000:04:00.0: 0x0B33001C | last cmd Id
[320589.351120] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | wait_event
[320589.351121] sandestin kernel: iwlwifi 0000:04:00.0: 0x00004288 | l2p_control
[320589.351123] sandestin kernel: iwlwifi 0000:04:00.0: 0x00018024 | l2p_duration
[320589.351124] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | l2p_mhvalid
[320589.351125] sandestin kernel: iwlwifi 0000:04:00.0: 0x000000E7 | l2p_addr_match
[320589.351127] sandestin kernel: iwlwifi 0000:04:00.0: 0x0000000D | lmpm_pmg_sel
[320589.351128] sandestin kernel: iwlwifi 0000:04:00.0: 0x04120134 | timestamp
[320589.351130] sandestin kernel: iwlwifi 0000:04:00.0: 0x0000D0E8 | flow_handler
[320589.351218] sandestin kernel: iwlwifi 0000:04:00.0: Start IWL Error Log Dump:
[320589.351221] sandestin kernel: iwlwifi 0000:04:00.0: Status: 0x00000100, count: 7
[320589.351223] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000070 | NMI_INTERRUPT_LMAC_FATAL
[320589.351225] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | umac branchlink1
[320589.351227] sandestin kernel: iwlwifi 0000:04:00.0: 0xC00868A4 | umac branchlink2
[320589.351228] sandestin kernel: iwlwifi 0000:04:00.0: 0xC0083A94 | umac interruptlink1
[320589.351229] sandestin kernel: iwlwifi 0000:04:00.0: 0xC0083A94 | umac interruptlink2
[320589.351231] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000800 | umac data1
[320589.351232] sandestin kernel: iwlwifi 0000:04:00.0: 0xC0083A94 | umac data2
[320589.351233] sandestin kernel: iwlwifi 0000:04:00.0: 0xDEADBEEF | umac data3
[320589.351235] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000024 | umac major
[320589.351236] sandestin kernel: iwlwifi 0000:04:00.0: 0x9F0A2D68 | umac minor
[320589.351238] sandestin kernel: iwlwifi 0000:04:00.0: 0xC088628C | frame pointer
[320589.351239] sandestin kernel: iwlwifi 0000:04:00.0: 0xC088628C | stack pointer
[320589.351240] sandestin kernel: iwlwifi 0000:04:00.0: 0x00E2019C | last host cmd
[320589.351242] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | isr status reg
[320589.351246] sandestin kernel: ieee80211 phy2: Hardware restart was requested
[320589.881215] sandestin kernel: iwlwifi 0000:04:00.0: Failing on timeout while stopping DMA channel 8 [0x07fd0001]
[320591.966731] sandestin wpa_supplicant[668]: wlp4s0: CTRL-EVENT-REGDOM-CHANGE init=DRIVER type=WORLD
[320590.151350] sandestin kernel: iwlwifi 0000:04:00.0: Microcode SW error detected.  Restarting 0x2000000.
[320590.151486] sandestin kernel: iwlwifi 0000:04:00.0: Start IWL Error Log Dump:
[320590.151489] sandestin kernel: iwlwifi 0000:04:00.0: Status: 0x00000100, count: 6
[320590.151490] sandestin kernel: iwlwifi 0000:04:00.0: Loaded firmware version: 36.9f0a2d68.0
[320590.151492] sandestin kernel: iwlwifi 0000:04:00.0: 0x0000105C | ADVANCED_SYSASSERT          
[320590.151494] sandestin kernel: iwlwifi 0000:04:00.0: 0x008002A0 | trm_hw_status0
[320590.151496] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | trm_hw_status1
[320590.151497] sandestin kernel: iwlwifi 0000:04:00.0: 0x0002485C | branchlink2
[320590.151498] sandestin kernel: iwlwifi 0000:04:00.0: 0x0003A7CA | interruptlink1
[320590.151500] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | interruptlink2
[320590.151501] sandestin kernel: iwlwifi 0000:04:00.0: 0xDEADBEEF | data1
[320590.151503] sandestin kernel: iwlwifi 0000:04:00.0: 0xDEADBEEF | data2
[320590.151504] sandestin kernel: iwlwifi 0000:04:00.0: 0xDEADBEEF | data3
[320590.151505] sandestin kernel: iwlwifi 0000:04:00.0: 0x000030F1 | beacon time
[320590.151507] sandestin kernel: iwlwifi 0000:04:00.0: 0x5976AD49 | tsf low
[320590.151508] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000005 | tsf hi
[320590.151510] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | time gp1
[320590.151511] sandestin kernel: iwlwifi 0000:04:00.0: 0x00011E7F | time gp2
[320590.151512] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000001 | uCode revision type
[320590.151514] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000024 | uCode version major
[320590.151515] sandestin kernel: iwlwifi 0000:04:00.0: 0x9F0A2D68 | uCode version minor
[320590.151517] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000230 | hw version
[320590.151518] sandestin kernel: iwlwifi 0000:04:00.0: 0x18C89000 | board version
[320590.151520] sandestin kernel: iwlwifi 0000:04:00.0: 0x0BF1001C | hcmd
[320590.151521] sandestin kernel: iwlwifi 0000:04:00.0: 0x2002200A | isr0
[320590.151522] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | isr1
[320590.151524] sandestin kernel: iwlwifi 0000:04:00.0: 0x0800180A | isr2
[320590.151525] sandestin kernel: iwlwifi 0000:04:00.0: 0x004168C1 | isr3
[320590.151526] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | isr4
[320590.151528] sandestin kernel: iwlwifi 0000:04:00.0: 0x0502001C | last cmd Id
[320590.151529] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | wait_event
[320590.151531] sandestin kernel: iwlwifi 0000:04:00.0: 0x00004288 | l2p_control
[320590.151532] sandestin kernel: iwlwifi 0000:04:00.0: 0x00018020 | l2p_duration
[320590.151533] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | l2p_mhvalid
[320590.151535] sandestin kernel: iwlwifi 0000:04:00.0: 0x000000E7 | l2p_addr_match
[320590.151536] sandestin kernel: iwlwifi 0000:04:00.0: 0x0000000D | lmpm_pmg_sel
[320590.151538] sandestin kernel: iwlwifi 0000:04:00.0: 0x04120134 | timestamp
[320590.151539] sandestin kernel: iwlwifi 0000:04:00.0: 0x00005060 | flow_handler
[320590.151612] sandestin kernel: iwlwifi 0000:04:00.0: Start IWL Error Log Dump:
[320590.151613] sandestin kernel: iwlwifi 0000:04:00.0: Status: 0x00000100, count: 7
[320590.151615] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000070 | NMI_INTERRUPT_LMAC_FATAL
[320590.151617] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | umac branchlink1
[320590.151618] sandestin kernel: iwlwifi 0000:04:00.0: 0xC00868A4 | umac branchlink2
[320590.151619] sandestin kernel: iwlwifi 0000:04:00.0: 0xC0083A94 | umac interruptlink1
[320590.151621] sandestin kernel: iwlwifi 0000:04:00.0: 0xC0083A94 | umac interruptlink2
[320590.151622] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000800 | umac data1
[320590.151624] sandestin kernel: iwlwifi 0000:04:00.0: 0xC0083A94 | umac data2
[320590.151625] sandestin kernel: iwlwifi 0000:04:00.0: 0xDEADBEEF | umac data3
[320590.151626] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000024 | umac major
[320590.151628] sandestin kernel: iwlwifi 0000:04:00.0: 0x9F0A2D68 | umac minor
[320590.151629] sandestin kernel: iwlwifi 0000:04:00.0: 0xC088628C | frame pointer
[320590.151631] sandestin kernel: iwlwifi 0000:04:00.0: 0xC088628C | stack pointer
[320590.151632] sandestin kernel: iwlwifi 0000:04:00.0: 0x0051012C | last host cmd
[320590.151633] sandestin kernel: iwlwifi 0000:04:00.0: 0x00000000 | isr status reg
[320590.151637] sandestin kernel: ieee80211 phy2: Hardware restart was requested
[320590.679896] sandestin kernel: iwlwifi 0000:04:00.0: Failing on timeout while stopping DMA channel 8 [0x07fd0003]
[320592.763144] sandestin wpa_supplicant[668]: wlp4s0: CTRL-EVENT-REGDOM-CHANGE init=DRIVER type=WORLD
[320592.848739] sandestin wpa_supplicant[668]: wlp4s0: CTRL-EVENT-SIGNAL-CHANGE above=1 signal=-61 noise=9999 txrate=866700


Once this happens my machine can't communicate with anything on the network, and I have to 'rmmod iwlmvm', 'rmmod iwlwifi', 'modprobe iwlwifi' to get it back.

I'm not sure what other data is relevant here... Here's some details about the device?

[320664.914069] sandestin kernel: iwlwifi 0000:04:00.0: loaded firmware version 36.9f0a2d68.0 op_mode iwlmvm
[320664.922304] sandestin kernel: iwlwifi 0000:04:00.0: Detected Intel(R) Dual Band Wireless AC 8265, REV=0x230

$ lspci -nn
00:00.0 Host bridge [0600]: Intel Corporation Xeon E3-1200 v6/7th Gen Core Processor Host Bridge/DRAM Registers [8086:5904] (rev 02)
00:02.0 VGA compatible controller [0300]: Intel Corporation HD Graphics 620 [8086:5916] (rev 02)
00:14.0 USB controller [0c03]: Intel Corporation Sunrise Point-LP USB 3.0 xHCI Controller [8086:9d2f] (rev 21)
00:14.2 Signal processing controller [1180]: Intel Corporation Sunrise Point-LP Thermal subsystem [8086:9d31] (rev 21)
00:16.0 Communication controller [0780]: Intel Corporation Sunrise Point-LP CSME HECI #1 [8086:9d3a] (rev 21)
00:1c.0 PCI bridge [0604]: Intel Corporation Sunrise Point-LP PCI Express Root Port #1 [8086:9d10] (rev f1)
00:1c.6 PCI bridge [0604]: Intel Corporation Sunrise Point-LP PCI Express Root Port #7 [8086:9d16] (rev f1)
00:1d.0 PCI bridge [0604]: Intel Corporation Sunrise Point-LP PCI Express Root Port #9 [8086:9d18] (rev f1)
00:1d.2 PCI bridge [0604]: Intel Corporation Sunrise Point-LP PCI Express Root Port #11 [8086:9d1a] (rev f1)
00:1f.0 ISA bridge [0601]: Intel Corporation Sunrise Point LPC Controller/eSPI Controller [8086:9d4e] (rev 21)
00:1f.2 Memory controller [0580]: Intel Corporation Sunrise Point-LP PMC [8086:9d21] (rev 21)
00:1f.3 Audio device [0403]: Intel Corporation Sunrise Point-LP HD Audio [8086:9d71] (rev 21)
00:1f.4 SMBus [0c05]: Intel Corporation Sunrise Point-LP SMBus [8086:9d23] (rev 21)
00:1f.6 Ethernet controller [0200]: Intel Corporation Ethernet Connection (4) I219-LM [8086:15d7] (rev 21)
04:00.0 Network controller [0280]: Intel Corporation Wireless 8265 / 8275 [8086:24fd] (rev 78)
3e:00.0 Non-Volatile memory controller [0108]: Samsung Electronics Co Ltd NVMe SSD Controller SM981/PM981 [144d:a808]

The network connected to:

# iw dev wlp4s0 link
Connected to xx:xx:xx:2b:57:cc (on wlp4s0)
        SSID: [redacted]
        freq: 5600
        RX: 288377378 bytes (97154 packets)
        TX: 980717 bytes (8065 packets)
        signal: -55 dBm
        rx bitrate: 780.0 MBit/s VHT-MCS 8 80MHz short GI VHT-NSS 2
        tx bitrate: 866.7 MBit/s VHT-MCS 9 80MHz short GI VHT-NSS 2

        bss flags:      short-slot-time
        dtim period:    1
        beacon int:     100
Comment 1 Steven Noonan 2019-06-19 09:16:05 UTC
This happens with firmware 36.8fd77bb3.0 (from https://git.kernel.org/pub/scm/linux/kernel/git/iwlwifi/linux-firmware.git) as well.

However, there is good news: I found that setting the iwlwifi module option swcrypto=1 seems to avoid the problem. Using this option doesn't seem to have any noticeable performance impact, either.
Comment 2 Emmanuel Grumbach 2019-06-19 09:31:07 UTC

*** This bug has been marked as a duplicate of bug 203315 ***