Bug 194793

Summary: iwlwifi: 3160: ASSERT 102C in AP mode - WIFILNX-696
Product: Drivers Reporter: Jarek Kamiński (jarek)
Component: network-wirelessAssignee: DO NOT USE - assign "network-wireless-intel" component instead (linuxwifi)
Status: CLOSED WILL_NOT_FIX    
Severity: normal CC: linuxwifi
Priority: P1    
Hardware: All   
OS: Linux   
See Also: https://bugzilla.kernel.org/show_bug.cgi?id=194713
Kernel Version: 4.9.2 Subsystem:
Regression: Yes Bisected commit-id:
Attachments: crashing firmware dmesg
-17.ucode with DBGM
firmware crashdumps
fix candidate for 102C
W/A for 102C
add more data to 102C assert
102C assertion crashdumps and dmesg

Description Jarek Kamiński 2017-03-05 15:32:10 UTC
Created attachment 255085 [details]
crashing firmware dmesg

Hello,

This is a follow-up to https://bugzilla.kernel.org/show_bug.cgi?id=194713.

I have an accesspoint running on Intel 3160 (ZBOX RI-323). It was running just fine on 3.16 from Debian, but I recently tried upgrading it to 4.9 and noticed problems with dropping client connectivity. One of the problems was fixed in 194713, but after applying the patch posted there, another problem appeared. After running the kernel (with the 194713 patch applied) for some time, the following errors started to appear:
#v+
mar 02 23:57:39 Wintermute kernel: iwlwifi 0000:01:00.0: Microcode SW error detected.  Restarting 0x2000000.
[...]
mar 02 23:57:42 Wintermute kernel: iwlwifi 0000:01:00.0: RX BA Session failed stopping, status 0x8
mar 02 23:57:42 Wintermute kernel: wlan0.2: HW problem - can not stop rx aggregation for xx:xx:xx:xx:xx:xx tid 0
#v-

All stations dropped eventually (iw station dump showing nothing) and weren't able to reconnect.

Please find the full dmesg attached (Debian kernel 4.9.2-2~bpo8+1 + 194713). If that would help, another dmesg illustrating the same issue on 3.16 + patched backports tree is in https://bugzilla.kernel.org/attachment.cgi?id=254991.

I plan to try with kernel versions between 3.16 and 4.9, but as the issue needs some time to display, bisecting might take some time as well.
Comment 1 Emmanuel Grumbach 2017-03-05 17:58:31 UTC
FWIW what's really defining the issue here is:

lut 28 16:46:43 Wintermute kernel: iwlwifi 0000:01:00.0: Loaded firmware version: 17.459231.0
lut 28 16:46:43 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000034 | NMI_INTERRUPT_WDG    

So, to sum up:
3.16 - GOOD
3.16 + bacport - BAD
4.9 - BAD

Can you please paste the output of the ethtool -i <iface name> in 3.16 without backport (GOOD case above)?

I'd like to know the firmware version you are running there.
I am pretty sure you run a super old version of the firmware in the GOOD case and that the regression really comes for the firmware.
Note that not all the old versions of the firmware are supported by the latest driver. So it is not trivial to rollback the firmware on a new driver.
Comment 2 Emmanuel Grumbach 2017-03-05 18:07:21 UTC
I opened a ticket on the firmware team, but I am quite pessimistic on the chances to get a fix. This device is not really widespread and even much less in AP mode...

I'd still be interested to get the data asked in my previous comment.
Comment 3 Jarek Kamiński 2017-03-05 20:17:14 UTC
Hello,

Yes, 3.16 is OK while 3.16+backport and 4.9 not.

ethtool -i wlan0 output:
#v+
driver: iwlwifi
version: 3.16.0-4-amd64
firmware-version: 25.228.9.0
bus-info: 0000:01:00.0
supports-statistics: yes
supports-test: no
supports-eeprom-access: no
supports-register-dump: no
supports-priv-flags: no
#v-
The driver is loading firmware from the iwlwifi-3160-9.ucode file.

Thank you for both bringing it to the right team and your explanation about cause of problem and chances for the fix — that will allow me to plan accordingly. Also, many thanks for all your help with this so far.
Comment 4 Emmanuel Grumbach 2017-03-05 21:01:13 UTC
Created attachment 255089 [details]
-17.ucode with DBGM

This firmware has debug enabled. Please refer to [1] to see how to collect the firmware dump that will be automatically created when the firmware will crash.

Please take the time to read the note about privacy [2].

[1] https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging#firmware_debugging

[2] https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging#privacy_aspects
Comment 5 Jarek Kamiński 2017-03-08 23:16:15 UTC
Created attachment 255147 [details]
firmware crashdumps

Hello,

Please find the crashdumps attached, encrypted with the GPG keys listed in https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging#privacy_aspects.
Comment 6 Emmanuel Grumbach 2017-03-09 07:18:11 UTC
Hello,

unfortunately, there are many different asserts and many occurrences of them :(

I can see here:
 * NMI_INTERRUPT_WDG
 * 102C
 * 2B08

102C seems to be the more frequent one, so let's start with that one.
For that one, I don't need firmware dump, but tracing. Please record tracing:

sudo trace-cmd record -e iwlwifi -e mac80211 -e cfg80211 -e iwlwifi_msg

Thanks.
Comment 7 Jarek Kamiński 2017-03-17 23:54:16 UTC
Hello,

Please find the trace here: http://straylight.freeside.be/~jarek/trace-iwlwifi.tar.gpg, 110M after compression.

Sorry for the delay, I messed up something with my environment and wasn't initially able to reproduce + I was travelling in the recent days.
Comment 8 Emmanuel Grumbach 2017-03-18 20:16:57 UTC
Unfortunately, the files you attached were unusable.

You need to stop the tracing recording with control+C and then wait until trace-cmd gathers all the data into one single trace.dat file.
Comment 9 Jarek Kamiński 2017-03-27 19:19:56 UTC
Ouch, sorry! Please find the new trace: http://straylight.freeside.be/~jarek/trace.dat.gpg, it's also smaller. trace-cmd report outputs something meaningful.
Comment 10 Emmanuel Grumbach 2017-03-27 19:31:46 UTC
I am super busy these days. But I hope I'll get some time to look at this soon.
Comment 11 Jarek Kamiński 2017-03-27 19:39:37 UTC
Sure, thanks!
Comment 12 Emmanuel Grumbach 2017-03-28 13:33:45 UTC
Created attachment 255605 [details]
fix candidate for 102C

Please try the patch attached.

This may fix the 102C Assert. It won't fix any other bugs, but I'd like to know if with this, we get the rid of 102C.

thanks.
Comment 13 Emmanuel Grumbach 2017-03-28 13:48:42 UTC
Created attachment 255607 [details]
W/A for 102C

Hi,

can you also test the firmware attached? (not with the patch I attached earlier)

thanks
Comment 14 Jarek Kamiński 2017-04-02 16:52:17 UTC
Hello,

I tried the patch, unfortunately I still got 102C:
#v+
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: Microcode SW error detected.  Restarting 0x82000000.
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: Start IWL Error Log Dump:
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: Status: 0x00000000, count: 6
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: Loaded firmware version: 17.352738.0
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0000102C | ADVANCED_SYSASSERT
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x000002A0 | trm_hw_status0
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | trm_hw_status1
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000B30 | branchlink2
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x000148E0 | interruptlink1
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | interruptlink2
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0xDEADBEEF | data1
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0xDEADBEEF | data2
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0xDEADBEEF | data3
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0CC17553 | beacon time
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x008E3AC6 | tsf low
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | tsf hi
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | time gp1
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x008E3AC7 | time gp2
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | uCode revision type
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000011 | uCode version major
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x000561E2 | uCode version minor
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000164 | hw version
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00C89204 | board version
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0B02001C | hcmd
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00022008 | isr0
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | isr1
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0000000A | isr2
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x004020C1 | isr3
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | isr4
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00010110 | last cmd Id
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | wait_event
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00004188 | l2p_control
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00018020 | l2p_duration
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x000003BF | l2p_mhvalid
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00009100 | l2p_addr_match
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000005 | lmpm_pmg_sel
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x15061432 | timestamp
kwi 01 18:31:46 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0000C8D8 | flow_handler
kwi 01 18:31:46 Wintermute kernel: ieee80211 phy0: Hardware restart was requested
#v-

I was trying the patch on top of 99e9672847bbfb470878978c46902486a9cd403d on 3.16. I'm going to try the firmware now.
Comment 15 Jarek Kamiński 2017-04-09 20:38:51 UTC
Hello again,

I was running the attached firmware for several days (with interruptions). I didn't observe a single 102C during that time :-)

I observed other issues and the connection was still breaking, but that was expected. The most common firmware crashes were 0x00000034 NMI_INTERRUPT_WDG and 0x00002B08 ADVANCED_SYSASSERT (I checked previous logs and it appeared just 3 times, but became more common now).

The samples of all issues I found in my dmesgs are below:
#v+
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: RX BA Session failed stopping, status 0x8
kwi 09 15:41:47 Wintermute kernel: wlan0.2: HW problem - can not stop rx aggregation for 50:46:5d:7b:d2:8b tid 0

kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: Microcode SW error detected.  Restarting 0x2000000.
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: Start IWL Error Log Dump:
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: Status: 0x00000000, count: 6
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: Loaded firmware version: 17.490063.0
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000034 | NMI_INTERRUPT_WDG           
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x05B00222 | trm_hw_status0
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | trm_hw_status1
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000B30 | branchlink2
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00014904 | interruptlink1
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x000057AC | interruptlink2
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | data1
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000002 | data2
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x07030000 | data3
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x36C0014D | beacon time
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x07D7D9E0 | tsf low
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | tsf hi
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | time gp1
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x07D7D9E1 | time gp2
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | uCode revision type
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000011 | uCode version major
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00077A4F | uCode version minor
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000164 | hw version
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00C89204 | board version
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0748001C | hcmd
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x26223808 | isr0
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0101E000 | isr1
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0000000A | isr2
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x404164C1 | isr3
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000081 | isr4
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x10C00112 | last cmd Id
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | wait_event
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x000001C8 | l2p_control
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00019430 | l2p_duration
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x000000BF | l2p_mhvalid
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0000DB00 | l2p_addr_match
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000005 | lmpm_pmg_sel
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x05032018 | timestamp
kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0034F000 | flow_handler
kwi 09 15:41:47 Wintermute kernel: ieee80211 phy0: Hardware restart was requested

kwi 09 15:41:47 Wintermute kernel: iwlwifi 0000:01:00.0: L1 Disabled - LTR Enabled

kwi 09 15:49:41 Wintermute kernel: iwlwifi 0000:01:00.0: Couldn't drain frames for staid 6

kwi 09 15:52:55 Wintermute kernel: iwlwifi 0000:01:00.0: RX BA Session failed starting, status 0x8
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: Queue 11 is active on fifo 3 and stuck for 10000 ms. SW [86, 95] HW [86, 95] FH TRB=0x08030900f
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: Microcode SW error detected.  Restarting 0x2000000.
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: Start IWL Error Log Dump:
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: Status: 0x00000000, count: 6
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: Loaded firmware version: 17.490063.0
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000084 | NMI_INTERRUPT_UNKNOWN       
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00A00220 | trm_hw_status0
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | trm_hw_status1
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000B30 | branchlink2
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00014904 | interruptlink1
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00014904 | interruptlink2
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | data1
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000080 | data2
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x07030000 | data3
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x920116A8 | beacon time
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0A0F697F | tsf low
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | tsf hi
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | time gp1
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0A0F6980 | time gp2
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | uCode revision type
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000011 | uCode version major
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00077A4F | uCode version minor
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000164 | hw version
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00C89204 | board version
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00EB011C | hcmd
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x24022020 | isr0
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x01000000 | isr1
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0000000A | isr2
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x004174C0 | isr3
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | isr4
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000110 | last cmd Id
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | wait_event
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x000009C8 | l2p_control
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00018030 | l2p_duration
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x000000BF | l2p_mhvalid
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0000DB00 | l2p_addr_match
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000005 | lmpm_pmg_sel
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x05032018 | timestamp
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0034B8C8 | flow_handler
kwi 09 15:53:10 Wintermute kernel: ieee80211 phy0: Hardware restart was requested
kwi 09 15:53:10 Wintermute kernel: iwlwifi 0000:01:00.0: Failing on timeout while stopping DMA channel 8 [0x07df0001]

kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: Microcode SW error detected.  Restarting 0x82000000.
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: Start IWL Error Log Dump:
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: Status: 0x00000000, count: 6
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: Loaded firmware version: 17.490063.0
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00002B08 | ADVANCED_SYSASSERT          
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00A00220 | trm_hw_status0
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | trm_hw_status1
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000B30 | branchlink2
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00014904 | interruptlink1
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | interruptlink2
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: 0xDEADBEEF | data1
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: 0xDEADBEEF | data2
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: 0xDEADBEEF | data3
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: 0x7AC13DCF | beacon time
kwi 09 15:55:49 Wintermute kernel: iwlwifi 0000:01:00.0: 0x097E023B | tsf low
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | tsf hi
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | time gp1
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x097E023C | time gp2
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | uCode revision type
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000011 | uCode version major
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00077A4F | uCode version minor
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000164 | hw version
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00C89204 | board version
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0C08001C | hcmd
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x24022008 | isr0
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x01000000 | isr1
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0000000A | isr2
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x004120C0 | isr3
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | isr4
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00010110 | last cmd Id
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000000 | wait_event
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000148 | l2p_control
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00018030 | l2p_duration
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0000003F | l2p_mhvalid
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0000DB00 | l2p_addr_match
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x00000005 | lmpm_pmg_sel
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x05032018 | timestamp
kwi 09 15:55:50 Wintermute kernel: iwlwifi 0000:01:00.0: 0x0000F000 | flow_handler
kwi 09 15:55:50 Wintermute kernel: ieee80211 phy0: Hardware restart was requested

kwi 09 16:32:27 Wintermute kernel: iwlwifi 0000:01:00.0: invalid BA notification: Q 10, tid 0
#v-
Comment 16 Emmanuel Grumbach 2017-04-20 08:01:38 UTC
Created attachment 255933 [details]
add more data to 102C assert

I had removed assert 102c just to see what it will cause, so this is not really a progress.

I am now attaching a firmware with 102c but with more data in case it happens.
Please run with this firmware.
Comment 17 Jarek Kamiński 2017-04-23 22:23:48 UTC
Created attachment 255959 [details]
102C assertion crashdumps and dmesg

Hello,

I'm attaching the dmesg and crashdumps from today's session. A single 102C was caught, the corresponding crashdump should be in 2017-04-23_15-00-25.dump, but I'm attaching all others as well in case they might be useful for anything.

BTW: I'm starting a travel and won't be available before 3.05.
Comment 18 Emmanuel Grumbach 2017-04-24 05:26:09 UTC
I looked at the data, are you sure you used the firmware I attached?

I am getting more and more confidence that there is nothing I can do on the driver level, and I know that the relevant firmware team is very busy.

Do you have another Intel WiFi device that you could try?
7260 is really the only other option that supports Mini PCI.
Comment 19 Jarek Kamiński 2017-04-24 16:30:09 UTC
Hello,

I just verified sha1sum
abd9463de9b2917226f864ff741cd636278d53f8  /lib/firmware/iwlwifi-3160-17.ucode
so yes, the right version.

I'm afraid I don't have any other Intel card.

I understand the options are limited, nevertheless thank you for all the effort you put on this so far, it's really appreciated.
Comment 20 Emmanuel Grumbach 2017-05-24 07:53:30 UTC
so I am afraid I'll close this bug as will not fix.
I can't see any fix coming from the firmware team for this.

Sorry.