Bug 197279

Summary: iwlwifi: 7265/8260: sporadic BAD COMMAND after connection loss - WIFI-5909
Product: Drivers Reporter: Dan Aloni (alonid)
Component: network-wirelessAssignee: DO NOT USE - assign "network-wireless-intel" component instead (linuxwifi)
Status: CLOSED CODE_FIX    
Severity: normal CC: alonid, christopher, josh.fryman, kernel, luca, rstachowiak, weber.aulendorf
Priority: P1    
Hardware: Intel   
OS: Linux   
See Also: https://bugzilla.kernel.org/show_bug.cgi?id=197281
https://bugzilla.kernel.org/show_bug.cgi?id=198017
Kernel Version: 4.13.5-100.fc25.x86_64 Subsystem:
Regression: No Bisected commit-id:
Attachments: dmesg
dmesg with kernel 4.13.5-100.fc25.x86_64 and firmware 31.532993.0
8260 firmware with debug enabled
Firmware dump 1
dmesg related to fimware dump 1

Description Dan Aloni 2017-10-14 11:21:12 UTC
I am able using the built-in Wifi on an Intel NUC, but it occasionally crashes.

01:00.0 Network controller: Intel Corporation Wireless 8260 (rev 3a)

See dmesg of a typical 9-day boot session.
Comment 1 Dan Aloni 2017-10-14 11:22:01 UTC
Created attachment 258823 [details]
dmesg
Comment 2 Dan Aloni 2017-10-14 11:23:55 UTC
If you need more info, I've read 

    https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging

and may be able to supply it.
Comment 3 Emmanuel Grumbach 2017-10-14 16:45:29 UTC
*** Bug 197277 has been marked as a duplicate of this bug. ***
Comment 4 Emmanuel Grumbach 2017-10-14 16:52:07 UTC
Please move to 4.13 and use -31.ucode.
Comment 5 Dan Aloni 2017-10-14 17:07:04 UTC
Created attachment 258829 [details]
dmesg with kernel 4.13.5-100.fc25.x86_64 and firmware 31.532993.0
Comment 6 Dan Aloni 2017-10-14 17:08:42 UTC
Thanks for the quick response. I've upgraded the system a few hours ago and the problem reproduced with the new versions - I think those are the requested ones. 

See the attached dmesg.
Comment 7 Emmanuel Grumbach 2017-10-14 17:21:11 UTC
This error is very surprising and not related to the previous one.

Easy to reproduce?
If yes, I'll send you a debug firmware tomorrow.
Comment 8 Dan Aloni 2017-10-14 17:25:22 UTC
Yes, it happens every few hours.

Sure, send over a debug firmware, I'll get it running.
Comment 9 Emmanuel Grumbach 2017-10-15 07:40:59 UTC
Created attachment 258831 [details]
8260 firmware with debug enabled

Hi,

Here is the firmware for 8260. Please copy it to /lib/firmware (I assume your distro takes it from there), you can backup the existing one before you overriding it.

The instructions on how to collect the data are here:
https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging#firmware_debugging

Please take the time to read the privacy notice:
https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging#privacy_aspects

In your case, it'd be nice to get tracing as well.
Comment 10 Dan Aloni 2017-10-15 21:42:41 UTC
Created attachment 258849 [details]
Firmware dump 1
Comment 11 Dan Aloni 2017-10-15 21:43:13 UTC
Created attachment 258851 [details]
dmesg related to fimware dump 1
Comment 12 Dan Aloni 2017-10-15 21:43:52 UTC
Here's a firmware dump, hope it helps for the meanwhile. 

I'll try to get tracing too.
Comment 13 Emmanuel Grumbach 2017-10-16 06:12:35 UTC
Weird.

This log teaches me that we get this after we send command 0x28. Thing is that when I look at the Tx FIFO I can see the command that you send, looks fine. Right format. Then, I can see a command that comes immediately afterwards in the Tx FIFO and that command is bad. Not sure how the HW put that there. I doubt the driver sent that. So, I really need tracing and a firmware dump of the same run.

Thanks.
Comment 14 Emmanuel Grumbach 2017-10-16 06:31:04 UTC
A few more minutes on the dump taught me that the driver hasn't sent any new command. This is clearly a firmware bug. routing this bug to the right person.

No more data needed at this stage.
Comment 15 Emmanuel Grumbach 2017-11-03 04:23:45 UTC
*** Bug 197729 has been marked as a duplicate of this bug. ***
Comment 16 Luca Coelho 2017-11-13 08:13:54 UTC
*** Bug 197281 has been marked as a duplicate of this bug. ***
Comment 17 Luca Coelho 2017-11-17 09:15:25 UTC
Just as an update, this issue is currently being handled by our firmware team.  I'll report back when we have more news.
Comment 18 Luca Coelho 2017-11-17 09:20:16 UTC
*** Bug 197373 has been marked as a duplicate of this bug. ***
Comment 19 Emmanuel Grumbach 2017-11-28 21:56:41 UTC
*** Bug 198017 has been marked as a duplicate of this bug. ***
Comment 20 Luca Coelho 2018-03-07 11:30:33 UTC
We have a fix in the firmware which we will release soon.