Bug 201053 - iwlwifi: 9260: loop of ASSERT 942
Summary: iwlwifi: 9260: loop of ASSERT 942
Status: CLOSED CODE_FIX
Alias: None
Product: Drivers
Classification: Unclassified
Component: network-wireless (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: DO NOT USE - assign "network-wireless-intel" component instead
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2018-09-08 21:34 UTC by rsmith31415
Modified: 2021-08-31 03:59 UTC (History)
3 users (show)

See Also:
Kernel Version: 4.15.0-33-generic
Subsystem:
Regression: No
Bisected commit-id:


Attachments
dmesg output (28.71 KB, text/plain)
2018-09-08 21:34 UTC, rsmith31415
Details
lshw output (3.35 KB, text/plain)
2018-09-08 21:36 UTC, rsmith31415
Details
dmesg after firmware upgrade (35.94 KB, text/plain)
2018-09-12 18:40 UTC, rsmith31415
Details
make install output (3.39 KB, text/plain)
2018-09-12 18:41 UTC, rsmith31415
Details
firmware dump 1 (671.02 KB, text/plain)
2018-09-13 18:09 UTC, rsmith31415
Details
firmware dump 2 (703.40 KB, text/plain)
2018-09-13 18:11 UTC, rsmith31415
Details
dmesg with mode 802.11g (75.11 KB, text/plain)
2018-09-14 23:01 UTC, rsmith31415
Details
screenshot of wireless configuration page (108.01 KB, text/plain)
2018-09-14 23:02 UTC, rsmith31415
Details
dmesg with mode 802.11g (31.13 KB, text/plain)
2018-09-14 23:04 UTC, rsmith31415
Details

Description rsmith31415 2018-09-08 21:34:01 UTC
Created attachment 278373 [details]
dmesg output

There are small freezes in several Ubuntu distributions (16.04.5 LTS, 18.04) with error [35714.735364] iwlwifi 0000:00:14.3: Microcode SW error detected. Restarting 0x0.

General information:

~ lspci | grep Network
00:14.3 Network controller: Intel Corporation Device a370 (rev 10)

~ ethtool -i wlp0s20f3 | grep firmware
firmware-version: 34.0.0

Linux laptop2 4.15.0-33-generic #36~16.04.1-Ubuntu SMP Wed Aug 15 17:21:05 UTC 2018 x86_64 x86_64 x86_64 GNU/Linux

~ sudo dmidecode -s bios-version
1.3.0

~ lsb_release -a
No LSB modules are available.
Distributor ID: Ubuntu
Description:    Ubuntu 16.04.5 LTS
Release:        16.04
Codename:       xenial


Following the instructions described in https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging, I include encrypted output of dmesg.
Comment 1 rsmith31415 2018-09-08 21:36:12 UTC
Created attachment 278375 [details]
lshw output
Comment 2 Emmanuel Grumbach 2018-09-12 06:17:53 UTC
If you can upgrade your kernel, please do so and use a more recent firmware. If you can't upgrade your kernel, please use our backport tree to get the latest version of the driver (just take the master branch of the backport driver).

Both firmware and driver are available here:
https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/core_release
Comment 3 rsmith31415 2018-09-12 18:39:15 UTC
Thank you for your response.

I tried to upgrade the kernel, but I found that after 4.16, libssl1.1 is required, so I get "linux-headers-x.xx.xx.xxxx-generic depends on libssl1.1 (>= 1.1.0); however: Package libssl1.1 is not installed.". As far as I know, it is not straightforward to install it on Ubuntu 16.04.

So the next option was to install the latest version of the driver. There were some SSL errors after "make install". However, I think it installed correctly, because now I get:

~ ethtool -i wlp0s20f3 | grep firmware
firmware-version: 38.c0e03d94.0

In any case, the errors are still occurring. I attach dmesg.log and output after "make install".
Comment 4 rsmith31415 2018-09-12 18:40:30 UTC
Created attachment 278479 [details]
dmesg after firmware upgrade

dmesg after firmware upgrade
Comment 5 rsmith31415 2018-09-12 18:41:13 UTC
Created attachment 278481 [details]
make install output

Includes SSL errors
Comment 6 Emmanuel Grumbach 2018-09-12 19:07:27 UTC
Do you still see assert 942?

Sorry, my system with the key is shut down right now.
Comment 7 rsmith31415 2018-09-12 19:23:15 UTC
Yes, I still see 0x00000942 | ADVANCED_SYSASSERT. The only difference I see between the old and new dmesg output is that after the crash, the line "0x00000070 | ADVANCED_SYSASSERT" is not showing anymore in firmware 38.c0e03d94.0.

I hope that helps. Let me know if you need more information.
Comment 8 Emmanuel Grumbach 2018-09-12 19:36:04 UTC
Ok, thanks. We'll need to involve the firmware team.
Comment 9 rsmith31415 2018-09-13 07:57:18 UTC
Sure. Thanks for your help.
Comment 10 Emmanuel Grumbach 2018-09-13 08:23:12 UTC
Can you please collect a firmware dump when the error occurs?

Please check here:
https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging#firmware_debugging

Thanks.
Comment 11 Emmanuel Grumbach 2018-09-13 08:26:13 UTC
This bug is related to PHY and antennas.
Can you please tell us what platform do you have? Laptop? If so, what is the model.

Can you try to move the screen of the laptop?

Thanks.
Comment 12 rsmith31415 2018-09-13 18:09:04 UTC
This is a laptop. The model is Dell G7 15 7588 (https://www.dell.com/en-us/work/shop/dell-laptops-and-notebooks/dell-g7-15-gaming-laptop/spd/g-series-15-7588-laptop). 

I moved the screen to different angles but the crashes are still occurring. 

I'm also including a couple of firmware dumps after crashes. I did it manually, so let me now if I should also try to do it automatically with SUBSYSTEM=="devcoredump", ACTION=="add", RUN+="/sbin/iwlfwdump.sh".
Comment 13 rsmith31415 2018-09-13 18:09:47 UTC
Created attachment 278517 [details]
firmware dump 1
Comment 14 rsmith31415 2018-09-13 18:11:47 UTC
Created attachment 278519 [details]
firmware dump 2
Comment 15 Emmanuel Grumbach 2018-09-13 20:33:57 UTC
Thanks.

I'll take a look on Sunday.
Comment 16 rsmith31415 2018-09-14 00:21:55 UTC
Great. Thanks!
Comment 17 rsmith31415 2018-09-14 23:00:17 UTC
An update to this issue: Since you mentioned that this bug was related to PHY and antennas and thought about changing the mode used by the router. It was set by default to 802.11b/g/n, so I changed it to 802.11g and now there is no more crashes, although there are a few messages with iwlwifi 0000:00:14.3: Unhandled alg: 0x71b. 

I include dmesg output and a screenshot of the wireless configuration page in case this is relevant.
Comment 18 rsmith31415 2018-09-14 23:01:08 UTC
Created attachment 278547 [details]
dmesg with mode 802.11g
Comment 19 rsmith31415 2018-09-14 23:02:29 UTC
Created attachment 278549 [details]
screenshot of wireless configuration page
Comment 20 rsmith31415 2018-09-14 23:04:19 UTC
Created attachment 278551 [details]
dmesg with mode 802.11g
Comment 21 Emmanuel Grumbach 2018-09-16 06:05:01 UTC
I have sent the data to the firmware team. That's pretty much all I can do for now. I'll update when I get news from them.
Comment 22 rsmith31415 2018-09-18 01:29:33 UTC
Great. Thanks!
Comment 23 Emmanuel Grumbach 2018-12-02 07:34:10 UTC
Have you tried -41.ucode?
Comment 24 rsmith31415 2018-12-06 01:22:52 UTC
No, I haven't. Is that a new version with potential fix? By the way, I recently changed the mode to 802.11b/g/n but with Channel Width = 20MHz (instead of Auto 20/40MHz) and that also fixes the crashes. 

Let me know if I should try -41.ucode.
Comment 25 Emmanuel Grumbach 2018-12-16 11:23:13 UTC
I just pushed -43.ucode, so it'd be even better to test that one (with our backport driver)
Comment 26 Emmanuel Grumbach 2019-01-03 20:59:16 UTC
So how does it feel with -43.ucode?

Please take it from our Linux firmware repository, mainline also has -43.ucode but it has a bug. I sent a pull request but they haven't pulled yet.
Comment 27 rsmith31415 2019-01-04 02:22:33 UTC
Hi again,

Sorry. I haven't been able to test the update during the holidays. I will give it a try right now.
Comment 28 rsmith31415 2019-01-04 03:02:49 UTC
Uhm... what is the correct repository? I can see this one https://git.kernel.org/pub/scm/linux/kernel/git/iwlwifi/linux-firmware.git with a commit relevant to this issue, but I'm not sure if it contains the latest fix.  In any case, should I install in the same way as suggested here (https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/core_release) - "How to install the driver" ? 

Thanks!
Comment 29 Emmanuel Grumbach 2019-01-04 05:00:57 UTC
Yes to both
Comment 30 rsmith31415 2019-01-04 07:02:06 UTC
Thanks! I changed the WLAN settings back to Auto 20/40 MHz (which was triggering this issue initially) and it is working really well. I didn't notice any freezes and no errors were shown in /var/log/syslog during heavy traffic. 

I will keep testing and report back if anything changes, but as I said, it looks really good. 

Regards
Comment 31 Emmanuel Grumbach 2019-01-04 07:16:55 UTC
please send the output of iw wlp2s0 link
Comment 32 rsmith31415 2019-01-04 07:50:37 UTC
Sure. Is this the correct command? 

➜  ~  iw wlp0s20f3 link
Connected to xx:xx:xx:xx:xx:xx (on wlp0s20f3)
        SSID: WirelessNet
        freq: 2437
        RX: 11121023 bytes (2907715 packets)
        TX: 54319409 bytes (501439 packets)
        signal: -37 dBm
        tx bitrate: 300.0 MBit/s MCS 15 40MHz short GI

        bss flags:      short-slot-time
        dtim period:    1
        beacon int:     98
Comment 33 Emmanuel Grumbach 2019-01-04 08:09:55 UTC
Yes, you are using 40MHz, I wanted to check that the AP is not using 20 MHz despite being in auto mode. So it seems the situation is indeed better.
Comment 34 Emmanuel Grumbach 2019-01-04 08:10:57 UTC
I am closing this bug. Please reopen if needed. Thank you.
Comment 35 rsmith31415 2019-01-04 16:04:14 UTC
Awesome. Thank you!

Note You need to log in before you can comment on or make changes to this bug.