Bug 203709

Summary: iwlwifi: 8260: frequently disconnects since Linux 5.1 "No beacon heard and the time event is over already" - WIFI-25906
Product: Drivers Reporter: Denis Lisov (dennis.lissov)
Component: network-wireless-intelAssignee: DO NOT USE - assign "network-wireless-intel" component instead (linuxwifi)
Status: NEEDINFO ---    
Severity: normal CC: 5longluna, a.v.nikitushkin, abalmos, abhishek.naik, acelan, alegasalv, alexander, alito, amigan, andrey.vihrov, bavay, benjamin.redelings, borish, brian.litzinger, brian, bugzilla-kernel, bugzilla, callofdutypsp, cviruss, d, dagthree7, dcominottim, dirkjonker, eamonn.sullivan, edward.dan.hart, elfosardo, ericjolsen, Esokrarkose, fabio.coatti, fitapnet, frederico.klein, goodmirek, grbitt, grnnja, homegradients, honda, iamrihan, ian.cynk, iozho, jack, jamesps, jay+bko, jiffy, jiri.konec, jmherrero, johnsoft, juan.gonzalez.31, karthik.d.a, kernel, kernel, kernelorg, kernelspace, khalil.io+ubuntu, kotrfa, kuba, kult-ex, linux, linuxwifi, lubomir.sery, luca.loverde0, lukas.redlinger, lun, luukvbaal, mail, marc, mario, martin.stolpe, mathias.morbitzer, matthewbtjones2000, michael, mikecookie101, mikezackles, mwolf, namruf32, narutowindy, niclaas, nipsky, nomos48143, nsprea, nvcook42, oscarleal.network, ozancag, p.g.richardson, p, pahan, pascal, petrkr, phil+kernbugz, pirmin, Prabesh432, pszafer, radek, ronan, rubengees7, russianneuromancer, sehguh.hsa, sereza, smalleni, soprwa, spam, spam, stefan, sven.koehler, tclark77, testillox, thomas.natschlaeger, thomas, tmvolin, tomasz.belina, treahblade, triffid.hunter, untainsyd, wavexx, wgh, wifiintelbugsolution, wonghow, zhangjintao9020, zxwarior
Priority: P1    
Hardware: All   
OS: Linux   
Kernel Version: 5.1.5 Subsystem:
Regression: Yes Bisected commit-id:
Attachments: concatenation of full dmesg and post-boot logs of kernel+NM+wpa_supplicant
journal excerpt for intel 8265 NIC on iwd/networkd, iwlwifi debug=0x20
iwlwifi trace (5.2.9 w/patch from comment 6)
trace-cmd: Intel Corporation Wireless 7260 (rev 83)
Intel 7260 syslog accompanying the trace
Core dump and syslog on Intel 7260
5.4.0: trace-cmd record -e iwlwifi -e mac80211 -e cfg80211 -e iwlwifi_msg
log of AX200
Reverted SAR refactor patch for kernel 5.5.9
Sample output from journalctl (Ubuntu 20.04, 8265wifi, kernel 5.4.0)
trace_cmd output (8265wifi, Kernel 5.4.0)
trace_cmd output (8265wifi, kernel 5.4.0, debug-fw 36.9f0a2d68.0 op_mode iwlmvm)
fw_dbg_collect dump (8265wifi, kernel 5.4.0, debug-fw 36.9f0a2d68.0 op_mode iwlmvm)
Module parameter for beacon timeout
complete syslog output
syslog
dmesg output
dmesg after exiting the last suspend
Firmware dump crash
7260 system logs, trace and fw core dump
iwl-fw-error_2021-05-27_16-14-36.dump
dmseg log in Fedora 34 which has 5.13.16-200.fc34.x86_64 kernel at the moment
attachment-10582-0.html
attachment-29102-0.html
Log showing apparent race if beacon is lost during association

Description Denis Lisov 2019-05-25 23:06:34 UTC
Created attachment 282941 [details]
concatenation of full dmesg and post-boot logs of kernel+NM+wpa_supplicant

Hardware: Thinkpad P50 with Intel Corporation Wireless 8260 [8086:24f3]
Software: Gentoo Linux with vanilla kernel, NetworkManager using wpa_supplicant

In Linux 5.0 the WiFi connection to my home network is rather stable. In Linux 5.1 (5.1.5 at the moment) it's unstable with the following symptoms.

Ping check to the router looks like this
64 bytes from 192.168.183.1: icmp_seq=54 ttl=64 time=5.81 ms
64 bytes from 192.168.183.1: icmp_seq=55 ttl=64 time=1.00 ms
64 bytes from 192.168.183.1: icmp_seq=56 ttl=64 time=1.17 ms
64 bytes from 192.168.183.1: icmp_seq=57 ttl=64 time=3.31 ms
From 192.168.183.5 icmp_seq=58 Destination Host Unreachable
From 192.168.183.5 icmp_seq=59 Destination Host Unreachable
From 192.168.183.5 icmp_seq=60 Destination Host Unreachable
From 192.168.183.5 icmp_seq=65 Destination Host Unreachable
From 192.168.183.5 icmp_seq=66 Destination Host Unreachable
From 192.168.183.5 icmp_seq=67 Destination Host Unreachable
From 192.168.183.5 icmp_seq=68 Destination Host Unreachable
From 192.168.183.5 icmp_seq=69 Destination Host Unreachable
From 192.168.183.5 icmp_seq=70 Destination Host Unreachable
64 bytes from 192.168.183.1: icmp_seq=78 ttl=64 time=1.07 ms
64 bytes from 192.168.183.1: icmp_seq=79 ttl=64 time=0.940 ms
64 bytes from 192.168.183.1: icmp_seq=80 ttl=64 time=0.951 ms
64 bytes from 192.168.183.1: icmp_seq=81 ttl=64 time=0.950 ms
64 bytes from 192.168.183.1: icmp_seq=82 ttl=64 time=1.17 ms
and so on, changing from ok to host unreachable a few times per minute.

The dmesg output looks like this (full log in attachment):
[  408.580520] wlp4s0: associated
[  420.822273] wlp4s0: Connection to AP ec:43:f6:07:90:84 lost
[  423.420415] wlp4s0: authenticate with ec:43:f6:07:90:84
[  423.431798] wlp4s0: send auth to ec:43:f6:07:90:84 (try 1/3)
[  423.436016] wlp4s0: authenticated
[  423.438864] wlp4s0: associate with ec:43:f6:07:90:84 (try 1/3)
[  423.542285] wlp4s0: associate with ec:43:f6:07:90:84 (try 2/3)
[  423.550083] wlp4s0: RX AssocResp from ec:43:f6:07:90:84 (capab=0xc11 status=0 aid=3)
[  423.552862] wlp4s0: associated
[  424.045586] iwlwifi 0000:04:00.0: No beacon heard and the time event is over already...
[  424.045622] wlp4s0: Connection to AP ec:43:f6:07:90:84 lost
[  425.178388] wlp4s0: authenticate with ec:43:f6:07:90:84
[  425.188576] wlp4s0: send auth to ec:43:f6:07:90:84 (try 1/3)
[  425.191274] wlp4s0: authenticated
[  425.192183] wlp4s0: associate with ec:43:f6:07:90:84 (try 1/3)
[  425.202483] wlp4s0: RX AssocResp from ec:43:f6:07:90:84 (capab=0xc11 status=0 aid=3)
[  425.206217] wlp4s0: associated
[  425.802779] iwlwifi 0000:04:00.0: No beacon heard and the time event is over already...
[  425.802814] wlp4s0: Connection to AP ec:43:f6:07:90:84 lost
[  427.060487] wlp4s0: authenticate with ec:43:f6:07:90:84
[  427.069870] wlp4s0: send auth to ec:43:f6:07:90:84 (try 1/3)
[  427.072586] wlp4s0: authenticated
[  427.078966] wlp4s0: associate with ec:43:f6:07:90:84 (try 1/3)
[  427.185781] wlp4s0: associate with ec:43:f6:07:90:84 (try 2/3)
[  427.222630] wlp4s0: RX AssocResp from ec:43:f6:07:90:84 (capab=0xc11 status=0 aid=3)
[  427.225860] wlp4s0: associated
[  427.683876] iwlwifi 0000:04:00.0: No beacon heard and the time event is over already...
[  427.683911] wlp4s0: Connection to AP ec:43:f6:07:90:84 lost
[  440.104459] wlp4s0: authenticate with ec:43:f6:07:90:84
[  440.112495] wlp4s0: send auth to ec:43:f6:07:90:84 (try 1/3)
[  440.116197] wlp4s0: authenticated
[  440.119343] wlp4s0: associate with ec:43:f6:07:90:84 (try 1/3)
[  440.124865] wlp4s0: RX AssocResp from ec:43:f6:07:90:84 (capab=0xc11 status=0 aid=3)
[  440.129543] wlp4s0: associated
[  440.750534] IPv6: ADDRCONF(NETDEV_CHANGE): wlp4s0: link becomes ready
[  452.055008] wlp4s0: Connection to AP ec:43:f6:07:90:84 lost

The corresponding part of NetworkManager logs is attached too.
Comment 1 Ronan Pigott 2019-07-26 19:24:22 UTC
Created attachment 283993 [details]
journal excerpt for intel 8265 NIC on iwd/networkd, iwlwifi debug=0x20
Comment 2 Ronan Pigott 2019-07-26 19:26:16 UTC
I am experiencing what seems to be the same issue on the intel 8265.

```
$ lspci -kvnn | sed -n '/Network/,/^$/ p'
02:00.0 Network controller [0280]: Intel Corporation Wireless 8265 / 8275 [8086:24fd] (rev 78)
	Subsystem: Intel Corporation Dual Band Wireless-AC 8265 [8086:1010]
	Flags: bus master, fast devsel, latency 0, IRQ 141
	Memory at edc00000 (64-bit, non-prefetchable) [size=8K]
	Capabilities: <access denied>
	Kernel driver in use: iwlwifi
	Kernel modules: iwlwifi
```
In my experience the interface can remain stable for a long time, but occasionally will disconnect and then struggles to reassociate/authenticate for a while before becoming stable again. It sometimes records the 'no beacon heard' message and always disconnects with reason 4, being inactivity.

I'm using a different network stack, so I'll leave journal excerpts in case the iwd logging is more helpful. I've got some debugging set for iwlwifi as I was trying to debug my issue.

$ printf %#x\n $(cat /sys/module/iwlwifi/parameters/debug)
0x20

$ journalctl --boot=517d350fbd224b28a65e491ff95036db _KERNEL_DEVICE=+pci:0000:02:00.0 + _SYSTEMD_UNIT={iwd,systemd-network}.service
(attached)
Comment 3 Ronan Pigott 2019-07-26 19:35:49 UTC
forgot to mention -- in my case,

$ uname -r
5.2.2-arch1-1-ARCH
Comment 4 bugzilla.kernel.org 2019-08-01 01:57:29 UTC
Can see this occuring Intel Corporation Wireless 7260 as well on `5.1.15-arch1-1-ARCH`.
Comment 5 kurmikon 2019-08-12 20:29:36 UTC
It's happening also on my Intel 3165, even on Linux 5.0.
Comment 6 Luca Coelho 2019-08-22 05:22:33 UTC
This could be related to TX-AMSDU, which we unfortunately had to disable for older devices.  Can you try this and see if it helps you with stability?

https://patchwork.kernel.org/patch/11029027/
Comment 7 kurmikon 2019-08-22 06:25:42 UTC
Already used that patch compiling the kernel, but I had same issues.
Comment 8 Ronan Pigott 2019-08-22 06:34:28 UTC
If I'm not mistaken, patch 11029027 has been backported to the Arch core/linux since the release of version 5.2.arch2.

See here for the first appearance of the patch in Arch linux.git (v5.2, since 07/09): https://git.archlinux.org/linux.git/commit/?h=v5.2-arch2&id=f6f6b798c7330d7851fee4bbe835c3954886fc3f

Additionally see here for the inclusion of the patch in Arch linux.git (v5.2.2):  https://git.archlinux.org/linux.git/commit/?h=v5.2.2-arch1&id=d426d34276c9f17081179be6497cddd74154556a

Which means that the TX-AMSDU patch had already been applied when I encountered the error as I described above, and applied for the boot where my journal excerpt was taken and attached above.

Additionally, I have examples in my journal of what appears to be the same error (cycling disconnect with iwlwifi time event over already error) in 5.1.15-arch1, which predates the application of that patch, and another isolated example all the way back in 5.1.4-arch1 (a different AP as well), which predates the authorship of that patch. I can attach more logs if you would like.

So I have examples of the error with and without the patch.

For better or for worse, this error has somehow become much less common for me. The most recent example of the iwlwifi error for me was one spurious disconnect 1 week ago, in 5.2.7-arch1, even though I use this laptop every day.
Comment 9 Luca Coelho 2019-08-22 06:36:58 UTC
Thanks kurmikon@libero.it! But your device is very different from Denis' and Ronan's... I think it would be best to create a new bug report here and provide all the logs we recommend in our wiki[1]

[1] https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging
Comment 10 Luca Coelho 2019-08-22 06:38:56 UTC
Thanks for all the info, Ronan!

Can you also provide trace-cmd logs as explained in the wiki I linked in comment 9?
Comment 11 Ronan Pigott 2019-08-22 07:03:20 UTC
Okay, If I get the error again, I'll trace it.

I don't have a reliable way to reproduce atm, so if anyone experiencing this issue knows of a method that may trigger it, please let me know.
Comment 12 Denis Lisov 2019-08-23 09:22:21 UTC
I've tried the patch suggested in comment 6 on top of vanilla 5.2.9 and the bug seems to be still there. Anything else to try? Should I provide the trace-cmd trace too?
Comment 13 Luca Coelho 2019-08-23 09:24:58 UTC
Thanks for testing, Denis.

So, if the issue is not related, we could use some traces as mentioned in comment #9.
Comment 14 Denis Lisov 2019-08-23 19:06:20 UTC
Created attachment 284575 [details]
iwlwifi trace (5.2.9 w/patch from comment 6)

I've generated a short trace during which there were a number of lost ICMP packets and messages from wpa_supplicant in the log. The kernel was 5.2.9 + patch from comment 6. The command used was

trace-cmd record -e iwlwifi -e mac80211 -e cfg80211 -e iwlwifi_msg
Comment 15 Luca Coelho 2019-08-26 06:19:56 UTC
Thanks again.  We will assign someone internally to look into this.
Comment 16 brian 2019-08-27 22:37:12 UTC
Created attachment 284643 [details]
trace-cmd: Intel Corporation Wireless 7260 (rev 83)

Trace from Intel 7260 in ThinkPad T440s attached!
Comment 17 brian 2019-08-27 22:38:43 UTC
Created attachment 284645 [details]
Intel 7260 syslog accompanying the trace

This may help correlate timestamps in the trace.
Comment 18 brian 2019-08-27 22:40:00 UTC
Intel 7260 seems to be affected.

Linux version 5.2.9-arch1-1-ARCH (builduser@heftig-119803) (gcc version 9.1.0 (GCC)) #1 SMP PREEMPT Fri Aug 16 11:29:43 UTC 2019
Comment 19 brian 2019-08-31 19:04:25 UTC
Confirmed likely a software issue. Just installed another 7260 in same machine. Same issue.
Comment 20 Denis Lisov 2019-09-18 19:58:09 UTC
The problem is still present in 5.3.0.

Maybe they're independent, but I've also seen multiple firmware errors in GEO_TX_POWER_LIMIT (looking like https://bugzilla.kernel.org/show_bug.cgi?id=204151) on 5.3 while checking for this bug.
Comment 21 Karthik 2019-11-04 09:07:19 UTC
The issue is that beacons are not received at the driver level.
Please provide the fw traces.
  
https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging#firmware_debugging
Comment 22 Denis Lisov 2019-11-04 09:20:14 UTC
I've provided one for kernel 5.2.9 in comment 14. Anything that needs to be done differently, or just do the same on 5.3.latest?
Comment 23 Nathaniel Cook 2019-11-11 17:50:53 UTC
Created attachment 285861 [details]
Core dump and syslog on Intel 7260

Here is a core dump from right after the issue occurred. Also includes a syslog from boot. 


Kernel 5.3.10 Arch Linux

Hopes this helps. Let me know if something is off with the dump as this is the first time I have done this.
Comment 24 Denis Lisov 2019-11-26 08:59:40 UTC
This bug is still present in Linux 5.4.

I'm attaching a trace recorded on 5.4 with

trace-cmd record -e iwlwifi -e mac80211 -e cfg80211 -e iwlwifi_msg

The trace includes connecting to the WiFi network, staying on it for some time (with a few visible instances of this bug) and disconnecting.
Comment 25 Denis Lisov 2019-11-26 09:00:55 UTC
Created attachment 286063 [details]
5.4.0: trace-cmd record -e iwlwifi -e mac80211 -e cfg80211 -e iwlwifi_msg
Comment 26 Denis Lisov 2019-12-15 12:22:42 UTC
Ping. Is anything else I can help with?
Comment 27 pirmin 2019-12-27 08:05:57 UTC
Same problem here on a ThinkPad T470s (intel 8265). Latest checked kernel version was 5.5-rc3 on Opensuse Tumbleweed.

The same machine has a perfectly stable Wifi connection with Debian and kernel 4.19 (0.02% Packetloss during a 24h ping test) and had a stable connection on Tumbleweed with older kernels as well.
Comment 28 Emmanuel Grumbach 2020-01-21 07:40:11 UTC
@Denis: it looks like we can't hear beacons at regular intervals.
We disconnected because we couldn't hear beacons from almost a second and then we couldn't re-connect because we couldn't hear beacons during the connection phase.
If you have the possibility to put another Intel Wifi device as an air sniffer, it'd allow us to know what's going on there.

@Nathaniel: thank you for providing the firmware dump, please re-do this with a debug version of the firmware:
https://wireless.wiki.kernel.org/_media/en/users/drivers/iwlwifi/iwlwifi-7260-17.ucode.gz

Thanks.

PS: if someone can reproduce this with an AX200 device it'd be interesting.
PS2: people who don't have AX200 can try to apply this and let me know how it goes:

diff --git a/drivers/net/wireless/intel/iwlwifi/mvm/time-event.c b/drivers/net/wireless/intel/iwlwifi/mvm/time-event.c
index 1babc4bb5194..02a38b9a3126 100644
--- a/drivers/net/wireless/intel/iwlwifi/mvm/time-event.c
+++ b/drivers/net/wireless/intel/iwlwifi/mvm/time-event.c
@@ -600,7 +600,7 @@ void iwl_mvm_protect_session(struct iwl_mvm *mvm,
        time_cmd.action = cpu_to_le32(FW_CTXT_ACTION_ADD);
        time_cmd.id_and_color =
                cpu_to_le32(FW_CMD_ID_AND_COLOR(mvmvif->id, mvmvif->color));
-       time_cmd.id = cpu_to_le32(TE_BSS_STA_AGGRESSIVE_ASSOC);
+       time_cmd.id = cpu_to_le32(TE_BSS_STA_ASSOC);
 
        time_cmd.apply_time = cpu_to_le32(0);
 
@@ -608,7 +608,7 @@ void iwl_mvm_protect_session(struct iwl_mvm *mvm,
        time_cmd.max_delay = cpu_to_le32(max_delay);
        /* TODO: why do we need to interval = bi if it is not periodic? */
        time_cmd.interval = cpu_to_le32(1);
-       time_cmd.duration = cpu_to_le32(duration);
+       time_cmd.duration = cpu_to_le32(1000);
        time_cmd.repeat = 1;
        time_cmd.policy = cpu_to_le16(TE_V2_NOTIF_HOST_EVENT_START |
                                      TE_V2_NOTIF_HOST_EVENT_END |
Comment 29 mail 2020-01-21 13:35:51 UTC
AX200 here.
I replaced my 8265ngw with an AX200ngw hoping it would solve the problem.

It happens less frequently now with the AX200, and it is always able to reconnect within a few seconds. But it still happens.



>Jan 21 08:17:08 kernel: iwlwifi 0000:04:00.0: No beacon heard and the time
>event is over already...
>Jan 21 08:17:08 kernel: wlan0: Connection to AP XXX lost
>Jan 21 08:17:08  iwd[680]: Received Deauthentication event, reason: 4,
>from_ap: XXX false
>Jan 21 08:17:08 kernel: wlan0: authenticate with XXX
>Jan 21 08:17:08 kernel: wlan0: send auth to XXX (try 1/3)
>Jan 21 08:17:08 kernel: wlan0: authenticated
>Jan 21 08:17:08 kernel: wlan0: associate with XXX (try 1/3)
>Jan 21 08:17:08 kernel: wlan0: RX AssocResp from XXX (capab=0x811 status=0
>aid=3)
>Jan 21 08:17:08 kernel: wlan0: associated
>Jan 21 08:17:09 kernel: iwlwifi 0000:04:00.0: No beacon heard and the time
>event is over already...
>Jan 21 08:17:09 kernel: wlan0: Connection to AP XXX lost
>Jan 21 08:17:09 iwd[680]: Received Deauthentication event, reason: 4, from_ap:
>false
>Jan 21 08:17:09 kernel: wlan0: authenticate with XXX
>Jan 21 08:17:09 kernel: wlan0: send auth to XXX (try 1/3)
...
...
Comment 30 Emmanuel Grumbach 2020-01-21 18:01:06 UTC
Hi,

What firmware version do you use?
Comment 31 mail 2020-01-21 18:49:22 UTC
iwlwifi 0000:04:00.0: loaded firmware version 50.3e391d3e.0 op_mode iwlmvm
iwlwifi 0000:04:00.0: Detected Intel(R) Wi-Fi 6 AX200 160MHz, REV=0x340
Comment 32 Emmanuel Grumbach 2020-01-25 18:28:16 UTC
Ok, this doesn't include our new flow in the firmware that can help.

Can you try the patch I attached to simulate this in the driver?

Thanks.
Comment 33 Emmanuel Grumbach 2020-01-29 12:04:05 UTC
Hi,

did you have a chance to test the patch?
Comment 34 mail 2020-01-29 13:48:03 UTC
I'm trying to establish some kind of baseline behaviour so that I'll know whether the patch worked or not. The bug hasn't reappeared in several days. I'm wondering if there is some kind of stress I can apply that might trigger it.

What's the simplest way to apply the patch on my Arch Linux system?
Comment 35 Emmanuel Grumbach 2020-01-29 13:53:27 UTC
I guess the easiest may be to install our backport based driver. Then you can use our latest firmware.

You can take the master branch of:
https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/core_release

Let me know if you want to do this.
Comment 36 Denis Lisov 2020-01-29 16:57:46 UTC
I don't understand the reason, but at the moment I'm unable to reliably reproduce this problem on any of the kernels that used to display it. Will stay on mainline kernels for now and report when (if) the problem reproduces itself.
Comment 37 Emmanuel Grumbach 2020-01-29 17:17:37 UTC
My bet is that this problem is more related to the AP than to our device. But I can't prove this obviously. I do would like to get some testing on the patch.
For AX200 this code has been moved to the firmware and newest firmware (not publicly available) includes this new behavior.
Comment 38 Denis Lisov 2020-01-29 17:47:30 UTC
May be the case. However, when 5.1 came, it really felt like a regression, so I'd say that this particular problem was something that 5.0 handled better than 5.1.
Comment 39 Emmanuel Grumbach 2020-02-03 20:57:35 UTC
That would be really strange..
Still no luck on reproduction ?
Comment 40 Andrey Vihrov 2020-02-03 21:21:22 UTC
Hi,

I've been observing similar error messages with "Intel Corporation Wireless 7265 (rev 59)" since Linux 5.1. The behavior is similar to comment #2, it can work for hours and then enters a sequence of rapid disconnects.

I'll try to provide firmware logs later, but for now I've found that changing "Beacon Interval" from the default 100 ms to 50 ms on my OpenWrt AP fixes the problem. Perhaps this information is useful.
Comment 41 Emmanuel Grumbach 2020-02-04 06:03:39 UTC
Hi Andrey,

do you have additional system with Linux and an Intel WiFi device?

If set, I'd like you to try to record an air sniffer capture of the connection.

You shouldn't have to reduce the beacon interval.
Have you tried my patch from comment #28?
Comment 42 pirmin 2020-02-04 14:33:13 UTC
In my case the patch helped a lot. However: the first time I've tried it I observed quite bad ping times (sometimes up to 20 seconds when pinging the AP) but as soon as I plugged-in the power supply it went down to 0.8ms...

Maybe either a PM or an electrical grounding issue, who knows... Never happened again since then.
Comment 43 Emmanuel Grumbach 2020-02-04 15:07:22 UTC
@pirmin, do you have an additional Intel Wifi device on a Linux machine?
I'd like to get an Air sniffer of the connection.
Typically, it should work properly without my patch. I'd like to understand *why* it is needed.
Comment 44 pirmin 2020-02-05 10:09:46 UTC
@Emmanuel

unfortunatelly not, no.
Comment 45 Mike Wilson 2020-02-09 17:57:12 UTC
@Emmanuel I believe I'm experiencing this issue and I have multiple linux machines with Intel Wifi adapters. I'm happy to help if you can walk me through sniffing traffic to collect the data you're looking for.
Comment 46 Emmanuel Grumbach 2020-02-09 21:08:49 UTC
@Mike, great.
First we need to determine the channel of your AP, for this, do:
iw <iface name> link.

Then you can refer to https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging#air_sniffing
Try to collect a pcap file on your AP's channel just to see that things are working.
You can send me the file or attach it here.

Next step will be to do an air capture while the bug is happening.
Comment 47 Andrey Vihrov 2020-02-10 18:59:46 UTC
(In reply to Emmanuel Grumbach from comment #41)

Unfortunately, I have only one such machine.

I restored Beacon Interval to 100 ms on my AP, and successfully reproduced the problem with Linux 5.5.2 and firmware version 29.163394017.0 on Intel 7265 card.

Then I applied the patch from comment #28, and within a day could still reproduce the problem. In the logs I noticed also a few new "Time Event [start|end] notification failure" messages, which sometimes occurred between "No beacon heard" messages.

[  +0,820074] wlan0: authenticate with b0:48:7a:c2:7f:16
[  +0,005267] wlan0: send auth to b0:48:7a:c2:7f:16 (try 1/3)
[  +0,003479] wlan0: authenticated
[  +0,004186] wlan0: associate with b0:48:7a:c2:7f:16 (try 1/3)
[  +0,003872] wlan0: RX AssocResp from b0:48:7a:c2:7f:16 (capab=0x431 status=0 aid=2)
[  +0,001171] wlan0: associated
[  +0,008698] IPv6: ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready
[  +0,282995] iwlwifi 0000:02:00.0: Time Event end notification failure
[  +0,000026] wlan0: Connection to AP b0:48:7a:c2:7f:16 lost
[  +0,120978] iwlwifi 0000:02:00.0: Time Event start notification failure
[  +0,158990] wlan0: Connection to AP 00:00:00:00:00:00 lost
[  +1,032228] wlan0: authenticate with b0:48:7a:c2:7f:16
[  +0,004169] wlan0: send auth to b0:48:7a:c2:7f:16 (try 1/3)
[  +0,001163] iwlwifi 0000:02:00.0: Time Event end notification failure
[  +0,000023] wlan0: Connection to AP b0:48:7a:c2:7f:16 lost
[  +1,204168] wlan0: send auth to b0:48:7a:c2:7f:16 (try 2/3)
[  +0,001942] wlan0: authenticated
[  +0,002892] wlan0: associate with b0:48:7a:c2:7f:16 (try 1/3)
[  +0,001817] wlan0: RX AssocResp from b0:48:7a:c2:7f:16 (capab=0x431 status=0 aid=2)
[  +0,001030] wlan0: associated
[  +0,009735] IPv6: ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready
[  +1,006093] iwlwifi 0000:02:00.0: No beacon heard and the time event is over already...
[  +0,000083] wlan0: Connection to AP b0:48:7a:c2:7f:16 lost

Note that now the "No beacon heard" message occurs around 1 s after the initial connection, I suppose in accordance with the patch.
Comment 48 Francisco Cribari 2020-02-11 17:37:36 UTC
I have a Samsung laptop with an Intel 8260 card. I run Arch Linux. Since Arch upgraded to kernel 5.5 I have no wifi, unless I boot into the LTS kernel. 

dmesg info is available at https://pastebin.com/LY771WXg

Additional information: 

┌─[cribari @ darwin5r ~]
└─╼ uname -a
Linux darwin5r 5.5.2-arch2-2 #1 SMP PREEMPT Wed, 05 Feb 2020 22:01:13 +0000 x86_64 GNU/Linux

┌─[cribari @ darwin5r ~]
└─╼ ip a
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
       valid_lft forever preferred_lft forever
    inet6 ::1/128 scope host 
       valid_lft forever preferred_lft forever
2: enp2s0: <NO-CARRIER,BROADCAST,MULTICAST,UP> mtu 1500 qdisc fq_codel state DOWN group default qlen 1000
    link/ether 98:83:89:ca:d7:fc brd ff:ff:ff:ff:ff:ff
┌─[cribari @ darwin5r ~]
└─╼ 

┌─[cribari @ darwin5r ~]
└─╼ inxi -N
Network:
  Device-1: Intel Wireless 8260 driver: iwlwifi 
  Device-2: Realtek RTL8111/8168/8411 PCI Express Gigabit Ethernet 
  driver: r8169
Comment 49 Emmanuel Grumbach 2020-02-11 18:09:10 UTC
This is absolutely not related to this bug.
Comment 50 lukas.redlinger 2020-02-19 10:03:11 UTC
Created attachment 287507 [details]
log of AX200

Having the same here with an AX200.

Linux Helena 5.5.4-arch1-1 #1 SMP PREEMPT
05:00.0 Network controller: Intel Corporation Wi-Fi 6 AX200 (rev 1a)
iwlwifi 0000:05:00.0: loaded firmware version 50.3e391d3e.0 op_mode iwlmvm
CISCO based infrastructure, EAP-TLS with FT-EAP enabled. 5GHz

Strangely it startet at 00:00:09. System might have been a little busy as logrotate and some other things happen at 00:00? Uptime was roundabout ~14h till it happened.

Kept continuing till someone restarted the system but re-appeared 2:30 later


===

Feb 19 00:00:08 Helena kernel: wlp5s0: associate with 00:a7:42:ea:25:cf (try 1/3)
Feb 19 00:00:08 Helena kernel: wlp5s0: RX ReassocResp from 00:a7:42:ea:25:cf (capab=0x1111 status=0 aid=31)
Feb 19 00:00:08 Helena kernel: wlp5s0: associated 
Feb 19 00:00:09 Helena kernel: iwlwifi 0000:05:00.0: No beacon heard and the time event is over already...
Feb 19 00:00:09 Helena kernel: wlp5s0: Connection to AP 00:a7:42:ea:25:cf lost
Feb 19 00:00:09 Helena kernel: iwlwifi 0000:05:00.0: expected hw-decrypted unicast frame for station
Feb 19 00:00:09 Helena kernel: iwlwifi 0000:05:00.0: expected hw-decrypted unicast frame for station
Feb 19 00:00:09 Helena kernel: iwlwifi 0000:05:00.0: expected hw-decrypted unicast frame for station
Comment 51 lukas.redlinger 2020-02-19 11:43:06 UTC
I've seen some channel-freq-change stuff in wpa_supplicant debug log, while that error occured. Maybe it's somehow related:


1582110704.979768: nl80211: Ignored event 110 (NL80211_CMD_UNKNOWN) for foreign interface (ifindex 8 wdev 0x0)
1582110704.979778: nl80211: Drv Event 110 (NL80211_CMD_UNKNOWN) received for wlp5s0
1582110704.979784: nl80211: Channel switch started event
1582110704.979796: wlp5s0: Event CH_SWITCH_STARTED (39) received
1582110704.979813: wlp5s0: CTRL-EVENT-STARTED-CHANNEL-SWITCH freq=5220 ht_enabled=1 ch_offset=1 ch_width=80 MHz cf1=5210 cf2=0
1582110706.596660: nl80211: Event message available
1582110706.596714: nl80211: Ignored event 64 (NL80211_CMD_NOTIFY_CQM) for foreign interface (ifindex 8 wdev 0x0)
1582110706.596723: nl80211: Drv Event 64 (NL80211_CMD_NOTIFY_CQM) received for wlp5s0
1582110706.596729: nl80211: Beacon loss event
1582110706.596738: wlp5s0: Event BEACON_LOSS (53) received
1582110706.596750: wlp5s0: CTRL-EVENT-BEACON-LOSS 

...

1582110711.238897: BSS: 6c:8b:d3:61:7b:4f changed freq 5220 --> 5200
1582110711.238910: BSS: 6c:8b:d3:61:7b:4f has multiple entries in the scan results - select the most current one
Comment 52 Kyriakos Fytrakis 2020-02-19 12:53:33 UTC
Same here on Fedora 31.

~ $ uname -a
Linux hp-spectre-x360 5.4.19-200.fc31.x86_64 #1 SMP Wed Feb 12 15:21:24 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux


~ $ ip a
1: lo: <LOOPBACK,UP,LOWER_UP> mtu 65536 qdisc noqueue state UNKNOWN group default qlen 1000
    link/loopback 00:00:00:00:00:00 brd 00:00:00:00:00:00
    inet 127.0.0.1/8 scope host lo
       valid_lft forever preferred_lft forever
    inet6 ::1/128 scope host 
       valid_lft forever preferred_lft forever
2: wlo1: <BROADCAST,MULTICAST,UP,LOWER_UP> mtu 1500 qdisc noqueue state UP group default qlen 1000
    link/ether 4c:34:88:94:2c:f3 brd ff:ff:ff:ff:ff:ff
    inet 192.168.1.3/24 brd 192.168.1.255 scope global dynamic noprefixroute wlo1
       valid_lft 84056sec preferred_lft 84056sec
    inet6 2a02:2149:8487:8e00:be2e:dda1:1833:bdba/64 scope global dynamic noprefixroute 
       valid_lft 86151sec preferred_lft 42951sec
    inet6 fe80::dbcd:d6d3:ad84:1d6e/64 scope link noprefixroute 
       valid_lft forever preferred_lft forever


~ $ inxi -N
Network:   Device-1: Intel Wireless 7265 driver: iwlwifi 


~ $ journalctl | grep wpa_supplicant

Feb 19 11:25:14 hp-spectre-x360 audit[1]: SERVICE_START pid=1 uid=0 auid=4294967295 ses=4294967295 subj=system_u:system_r:init_t:s0 msg='unit=wpa_supplicant comm="systemd" exe="/usr/lib/systemd/systemd" hostname=? addr=? terminal=? res=success'
Feb 19 11:25:14 hp-spectre-x360 wpa_supplicant[998]: Successfully initialized wpa_supplicant
Feb 19 11:25:14 hp-spectre-x360 NetworkManager[944]: <info>  [1582104314.6416] supplicant: wpa_supplicant running
Feb 19 11:25:18 hp-spectre-x360 wpa_supplicant[998]: wlo1: CTRL-EVENT-REGDOM-CHANGE init=DRIVER type=COUNTRY alpha2=GR
Feb 19 11:25:19 hp-spectre-x360 wpa_supplicant[998]: wlo1: SME: Trying to authenticate with 34:4d:ea:95:5d:14 (SSID='Forthnet-4jPPK' freq=2412 MHz)
Feb 19 11:25:19 hp-spectre-x360 wpa_supplicant[998]: wlo1: Trying to associate with 34:4d:ea:95:5d:14 (SSID='Forthnet-4jPPK' freq=2412 MHz)
Feb 19 11:25:19 hp-spectre-x360 wpa_supplicant[998]: wlo1: Associated with 34:4d:ea:95:5d:14
Feb 19 11:25:19 hp-spectre-x360 wpa_supplicant[998]: wlo1: CTRL-EVENT-SUBNET-STATUS-UPDATE status=0
Feb 19 11:25:19 hp-spectre-x360 wpa_supplicant[998]: wlo1: WPA: Key negotiation completed with 34:4d:ea:95:5d:14 [PTK=CCMP GTK=CCMP]
Feb 19 11:25:19 hp-spectre-x360 wpa_supplicant[998]: wlo1: CTRL-EVENT-CONNECTED - Connection to 34:4d:ea:95:5d:14 completed [id=0 id_str=]
Feb 19 11:25:19 hp-spectre-x360 wpa_supplicant[998]: wlo1: CTRL-EVENT-SIGNAL-CHANGE above=0 signal=-39 noise=9999 txrate=1000
Feb 19 11:25:23 hp-spectre-x360 wpa_supplicant[998]: wlo1: WPA: Group rekeying completed with 34:4d:ea:95:5d:14 [GTK=CCMP]
Feb 19 11:28:01 hp-spectre-x360 wpa_supplicant[998]: wlo1: CTRL-EVENT-SIGNAL-CHANGE above=1 signal=-44 noise=9999 txrate=270000
Feb 19 11:35:22 hp-spectre-x360 wpa_supplicant[998]: wlo1: WPA: Group rekeying completed with 34:4d:ea:95:5d:14 [GTK=CCMP]
Feb 19 11:41:48 hp-spectre-x360 wpa_supplicant[998]: wlo1: CTRL-EVENT-BEACON-LOSS
Feb 19 11:41:50 hp-spectre-x360 wpa_supplicant[998]: wlo1: CTRL-EVENT-BEACON-LOSS
Feb 19 11:41:51 hp-spectre-x360 wpa_supplicant[998]: wlo1: CTRL-EVENT-SIGNAL-CHANGE above=0 signal=0 noise=9999 txrate=0
Feb 19 11:41:51 hp-spectre-x360 wpa_supplicant[998]: wlo1: CTRL-EVENT-DISCONNECTED bssid=34:4d:ea:95:5d:14 reason=4 locally_generated=1
Feb 19 11:41:51 hp-spectre-x360 wpa_supplicant[998]: dbus: wpa_dbus_property_changed: no property SessionLength in object /fi/w1/wpa_supplicant1/Interfaces/0
Comment 53 Danilo Cominotti Marques 2020-02-20 17:10:44 UTC
Had been facing the same disconnects (every few minutes) since I replaced a Qualcomm WiFi card with an AX200 on the other day. Using Ubuntu 19.10 and kernel 5.3, getting the same "No beacon heard and the time event is over already" messages and often failing to authenticate as well. After resuming from sleep, it was almost impossible to have working WiFi even for a few minutes, requiring a reboot but also facing new issues after a few minutes. However, I have fixed the issue by disabling ipv6 for the WiFi connection in NetworkManager, as I've seem somebody suggest in some other forum. Haven't noticed a single disconnect/issue ever since, even after resuming from sleep.
Comment 54 Danilo Cominotti Marques 2020-02-20 17:20:17 UTC
(In reply to Danilo Cominotti Marques from comment #53)
> Had been facing the same disconnects (every few minutes) since I replaced a
> Qualcomm WiFi card with an AX200 on the other day. Using Ubuntu 19.10 and
> kernel 5.3, getting the same "No beacon heard and the time event is over
> already" messages and often failing to authenticate as well. After resuming
> from sleep, it was almost impossible to have working WiFi even for a few
> minutes, requiring a reboot but also facing new issues after a few minutes.
> However, I have fixed the issue by disabling ipv6 for the WiFi connection in
> NetworkManager, as I've seem somebody suggest in some other forum. Haven't
> noticed a single disconnect/issue ever since, even after resuming from sleep.

Obs.: Have disabled ipv6 *and* set the 5GHz network in the AP to a fixed (39) channel.
Comment 55 WGH 2020-03-10 00:56:18 UTC
My 7260 on ThinkPad T440 also started to go into disconnecting rage once in a while ever since I upgraded from 4.19.x to 5.5.x kernel.
Comment 56 MIC 2020-03-11 17:55:39 UTC
(In reply to WGH from comment #55)
> My 7260 on ThinkPad T440 also started to go into disconnecting rage once in
> a while ever since I upgraded from 4.19.x to 5.5.x kernel.

Same for me
Network controller: Intel Corporation Wireless 7265 (rev 59)
on my T450s
Comment 57 Andre Jonas 2020-03-11 19:24:15 UTC
Same here for Network controller: Intel Corporation Wireless 7260 (rev 83), kernel 5.5.5, but it has been the same for months with the current kernel version. Though the disconnecting problems are not as severe and frequent as it was the case a while back.
Comment 58 iamrihan 2020-03-24 01:43:39 UTC
Been seeing the same issue since kernel 5.1 on Arch Linux.

~ lspci | rg Wireless
02:00.0 Network controller: Intel Corporation Wireless 3165 (rev 79)
~ uname -a
Linux ph 5.5.4-arch1-1 #1 SMP PREEMPT Sat, 15 Feb 2020 00:36:29 +0000 x86_64 GNU/Linux

The frequency of this occurring reduced a lot after I switched to channel 11 (from 9). I don't have an air sniffing log, but I don't think this is an AP issue as my Android phone stays connected to the same AP while this happens. I do however note that this happens only when my laptop is charging (may just be a coincidence, though, as the problem persists for a while even if I remove the charging cable).

Restarting wpa_supplicant a bunch of times usually fixes it for me. Happy to provide any other info I can.
Comment 59 Francesco Giudici 2020-03-27 09:29:06 UTC
Same issue here starting from 5.5.* kernels onward.

WiFi card: Intel(R) Dual Band Wireless AC 8265, REV=0x230

Description of the observed issue:
- random instability of the WiFi connection: from time to time bursts of ping to the AP are lost or have high delay (> 1000msec)
- noticeable regression (wrt 5.4.* kernels) in the transfer speed (expecially tx) with low signal (< -73dBM, as reported by iw). This is easily reproducible for me.

Bisected 5.5.1 kernel and found the commit causing the regression:
39c1a9728f93 iwlwifi: refactor the SAR tables from mvm to acpi

Recompiled kernel 5.5.9 reverting the commit (and conflicting ones).
Reverted:
8f5c78a456e2 Revert "iwlwifi: fw: make pos static in iwl_sar_get_ewrd_table() loop"
2f0a482f683d Revert "iwlwifi: mvm: fix non-ACPI function"
39c1a9728f93 iwlwifi: refactor the SAR tables from mvm to acpi

Issue is fixed. With low signal (<= -72) transfer speed is comparable to 5.4.x kernels. Loading the iwlwifi and iwlmvm modules with and without the patch, immediately shows the difference with low signal (< 73 dBM).

Tried to investigate 39c1a9728f93, spent much time: cannot real spot the problem, seems just a migration of the functions from mvm/fw.c to the acpi.c.

Please, help! :-)
Comment 60 Francesco Giudici 2020-03-27 09:35:35 UTC
Created attachment 288091 [details]
Reverted SAR refactor patch for kernel 5.5.9

Applying this patch (which just reverts SAR refactor patch) to a 5.5.z kernel series (tried with 5.5.1 and 5.5.9) brings back stability and performance in low signal conditions as available in 5.4.z kernels.
Comment 61 Andre Jonas 2020-03-28 22:17:24 UTC
(In reply to Francesco Giudici from comment #60)
> Created attachment 288091 [details]
> Reverted SAR refactor patch for kernel 5.5.9
> 
> Applying this patch (which just reverts SAR refactor patch) to a 5.5.z
> kernel series (tried with 5.5.1 and 5.5.9) brings back stability and
> performance in low signal conditions as available in 5.4.z kernels.

Hi, I applied the patch to kernel 5.5.9 (Arch Linux) but I’m still getting random disconnects with the “No beacon heard” message with Intel Corporation Wireless 7260 (rev 83).
Comment 62 Francesco Giudici 2020-03-29 08:22:11 UTC
(In reply to Andre Jonas from comment #61)
> (In reply to Francesco Giudici from comment #60)
> > Created attachment 288091 [details]
> > Reverted SAR refactor patch for kernel 5.5.9
> > 
> > Applying this patch (which just reverts SAR refactor patch) to a 5.5.z
> > kernel series (tried with 5.5.1 and 5.5.9) brings back stability and
> > performance in low signal conditions as available in 5.4.z kernels.
> 
> Hi, I applied the patch to kernel 5.5.9 (Arch Linux) but I’m still getting
> random disconnects with the “No beacon heard” message with Intel Corporation
> Wireless 7260 (rev 83).

Thanks for giving it a try. I can confirm the patch fixes the issues with my wifi card (Intel(R) Dual Band Wireless AC 8265, REV=0x230)... but clearly this is another issue then. Sorry, didn't wanted to bring more confusion here. I will open another bug.
Comment 63 Fitap 2020-04-24 01:30:06 UTC
Same issue here on gentoo-NB 5.4.28-gentoo.
I have two box with iwlwifi with different wifi cards and the error is the same in both cards.

iwlwifi 0000:03:00.0: No beacon heard and the time event is over already...
wlp3s0: Connection to AP e4:8d:8c:e2:ff:08 lost
wlp3s0: authenticate with e4:8d:8c:e2:ff:08
wlp3s0: send auth to e4:8d:8c:e2:ff:08 (try 1/3)
wlp3s0: authenticated
wlp3s0: associate with e4:8d:8c:e2:ff:08 (try 1/3)
wlp3s0: RX AssocResp from e4:8d:8c:e2:ff:08 (capab=0x431 status=0 aid=2)
wlp3s0: associated
wlp3s0: Connection to AP e4:8d:8c:e2:ff:08 lost
wlp3s0: authenticate with e4:8d:8c:e2:ff:08
wlp3s0: send auth to e4:8d:8c:e2:ff:08 (try 1/3)
wlp3s0: authenticated
wlp3s0: associate with e4:8d:8c:e2:ff:08 (try 1/3)
wlp3s0: RX AssocResp from e4:8d:8c:e2:ff:08 (capab=0x431 status=0 aid=2)
wlp3s0: associated
iwlwifi 0000:03:00.0: No beacon heard and the time event is over already...
wlp3s0: Connection to AP e4:8d:8c:e2:ff:08 lost
wlp3s0: authenticate with e4:8d:8c:e2:ff:08
wlp3s0: send auth to e4:8d:8c:e2:ff:08 (try 1/3)
wlp3s0: authenticated
wlp3s0: associate with e4:8d:8c:e2:ff:08 (try 1/3)
wlp3s0: RX AssocResp from e4:8d:8c:e2:ff:08 (capab=0x431 status=0 aid=2)
wlp3s0: associated
iwlwifi 0000:03:00.0: No beacon heard and the time event is over already...
wlp3s0: Connection to AP e4:8d:8c:e2:ff:08 lost
wlp3s0: authenticate with e4:8d:8c:e2:ff:08
wlp3s0: send auth to e4:8d:8c:e2:ff:08 (try 1/3)
wlp3s0: authenticated
wlp3s0: associate with e4:8d:8c:e2:ff:08 (try 1/3)
wlp3s0: RX AssocResp from e4:8d:8c:e2:ff:08 (capab=0x431 status=0 aid=2)
wlp3s0: associated
wlp3s0: Connection to AP e4:8d:8c:e2:ff:08 lost
wlp3s0: authenticate with e4:8d:8c:e2:ff:08
wlp3s0: send auth to e4:8d:8c:e2:ff:08 (try 1/3)
wlp3s0: authenticated
wlp3s0: associate with e4:8d:8c:e2:ff:08 (try 1/3)
wlp3s0: RX AssocResp from e4:8d:8c:e2:ff:08 (capab=0x431 status=0 aid=2)
wlp3s0: associated
wlp3s0: Connection to AP e4:8d:8c:e2:ff:08 lost
wlp3s0: authenticate with e4:8d:8c:e2:ff:08
wlp3s0: send auth to e4:8d:8c:e2:ff:08 (try 1/3)
wlp3s0: authenticated

Any workaround?
Regards.
Comment 64 kotrfa 2020-04-26 13:06:31 UTC
Exactly the same here: https://bbs.archlinux.org/viewtopic.php?pid=1901025#p1901025, I really don't know what to do next.

I am on Arch Linux, linux 5.6.6, wpa_supplicant 2.9-7.
Comment 65 Konstantin 2020-06-04 12:08:56 UTC
I have the same issue - it was working fine with 4.19 but no good with 5.4. Also I could give you a hint what might triggers this - I was using wifi analyzer app on my other device (Android) to scan for the channels and it appears it triggers the issue easily.

[328060.858996] iwlwifi 0000:04:00.0: No beacon heard and the time event is over already...
[328060.859005] wlp4s0: Connection to AP f4:f2:6d:c2:01:16 lost
[328064.334373] wlp4s0: authenticate with f4:f2:6d:c2:01:16
[328064.336691] wlp4s0: send auth to f4:f2:6d:c2:01:16 (try 1/3)
[328064.340536] wlp4s0: authenticated
[328064.344626] wlp4s0: associate with f4:f2:6d:c2:01:16 (try 1/3)
[328064.349165] wlp4s0: RX AssocResp from f4:f2:6d:c2:01:16 (capab=0x431 status=0 aid=1)
[328064.349993] wlp4s0: associated
[328113.788236] wlp4s0: Connection to AP f4:f2:6d:c2:01:16 lost
[328117.219951] wlp4s0: authenticate with f4:f2:6d:c2:01:16
[328117.221941] wlp4s0: send auth to f4:f2:6d:c2:01:16 (try 1/3)
[328117.225417] wlp4s0: authenticated
[328117.232304] wlp4s0: associate with f4:f2:6d:c2:01:16 (try 1/3)
[328117.236907] wlp4s0: RX AssocResp from f4:f2:6d:c2:01:16 (capab=0x431 status=0 aid=1)
[328117.237751] wlp4s0: associated
[328117.836053] iwlwifi 0000:04:00.0: No beacon heard and the time event is over already...
[328117.836064] wlp4s0: Connection to AP f4:f2:6d:c2:01:16 lost
[328121.302118] wlp4s0: authenticate with f4:f2:6d:c2:01:16
[328121.304557] wlp4s0: send auth to f4:f2:6d:c2:01:16 (try 1/3)
[328121.308252] wlp4s0: authenticated
[328121.312279] wlp4s0: associate with f4:f2:6d:c2:01:16 (try 1/3)
[328121.316874] wlp4s0: RX AssocResp from f4:f2:6d:c2:01:16 (capab=0x431 status=0 aid=1)
[328121.329829] wlp4s0: associated
Comment 66 Naruto windy 2020-06-06 15:00:14 UTC
anybody has a fix for this?? Its been there for last 6months.I had to switch to windows to be productive,Couldn't use my Linux system more than 10min without being disconneted from WIFI.
Comment 67 kotrfa 2020-06-17 14:10:00 UTC
I don't think there is any fix yet :-( . I am now on 5.7.2 (Arch Linux) and the problem is still there.
Comment 68 kotrfa 2020-06-17 14:15:29 UTC
It has `NEEDSINFO` status - what info is needed please?
Comment 69 mikezackles 2020-06-17 14:24:13 UTC
For the people on Arch, obviously it's not a permanent fix, but I downgraded to the Arch Linux LTS kernel (5.4.41) around a month ago and haven't observed the issue since.
Comment 70 brian 2020-06-17 14:48:19 UTC
Using Arch linux-hardened (5.6.18.a-1-hardened); The issue has not been 
observed since my AP's beacon interval was increased to 50ms. It was 
mentioned by someone here as a workaround.

On 6/17/20 10:24 AM, bugzilla-daemon@bugzilla.kernel.org wrote:
> https://bugzilla.kernel.org/show_bug.cgi?id=203709
>
> --- Comment #69 from mikezackles@gmail.com ---
> For the people on Arch, obviously it's not a permanent fix, but I downgraded
> to
> the Arch Linux LTS kernel (5.4.41) around a month ago and haven't observed
> the
> issue since.
>
Comment 71 Naruto windy 2020-06-17 15:17:14 UTC
It's observed on Debian , Ubuntu all other distros not only  on Arch Linux.
Comment 72 kotrfa 2020-06-18 13:42:09 UTC
I downgraded to 5.4.46-1 on Archlinux too and it seems it doesn't happen there indeed.
Comment 73 mikezackles 2020-06-18 15:29:09 UTC
I actually observed the issue happening last night (again on Arch LTS 5.4.41-1), but it eventually managed to connect after a minute or two. So it seems like the issue is still present with this kernel, but perhaps less severe, as my wifi became unusable for long periods of time with the default Arch kernel (independent of reboots).
Comment 74 Naruto windy 2020-06-18 19:09:23 UTC
If you're using networkmanager , it tries to autoconnect to the wireless AP, but if you're using iwd or wpa supplicant youre out of luck.
Comment 75 kotrfa 2020-06-18 19:13:24 UTC
OK, sorry for the confusion, I confirm it happens on 5.4 too :-(
Comment 76 Mathias Bavay 2020-07-06 08:27:44 UTC
Same thing happening on my Dell XPS 7590 using Killer Wi-Fi 6 AX1650 (see Bug 208425):
[52529.973261] iwlwifi 0000:3b:00.0: No beacon heard and the session protection is over already...
[52529.973354] wlp59s0: Connection to AP ac:22:05:83:cd:8d lost
[52530.330451] wlp59s0: send auth to ac:22:05:83:cd:8d (try 2/3)
[52531.226916] wlp59s0: send auth to ac:22:05:83:cd:8d (try 3/3)
[52532.125912] iwlwifi 0000:3b:00.0: No beacon heard and the session protection is over already...
[52532.125955] wlp59s0: Connection to AP ac:22:05:83:cd:8d lost
[52532.217812] wlp59s0: authentication with ac:22:05:83:cd:8d timed out
[52542.252729] wlp59s0: authenticate with ac:22:05:83:ce:3c
[52542.258736] wlp59s0: send auth to ac:22:05:83:ce:3c (try 1/3)
[52543.158015] iwlwifi 0000:3b:00.0: No beacon heard and the session protection is over already...
[52543.158106] wlp59s0: Connection to AP ac:22:05:83:ce:3c lost
[52543.210675] wlp59s0: send auth to ac:22:05:83:ce:3c (try 2/3)
[52544.109377] iwlwifi 0000:3b:00.0: No beacon heard and the session protection is over already...
[52544.109477] wlp59s0: Connection to AP ac:22:05:83:ce:3c lost
[52544.218379] wlp59s0: send auth to ac:22:05:83:ce:3c (try 3/3)
[52545.117366] iwlwifi 0000:3b:00.0: No beacon heard and the session protection is over already...
[52545.117443] wlp59s0: Connection to AP ac:22:05:83:ce:3c lost
[52545.209765] wlp59s0: authentication with ac:22:05:83:ce:3c timed out
[52556.087441] wlp59s0: authenticate with ac:22:05:83:ce:3c
[52556.092715] wlp59s0: send auth to ac:22:05:83:ce:3c (try 1/3)
[52556.991951] iwlwifi 0000:3b:00.0: No beacon heard and the session protection is over already...
[52556.992044] wlp59s0: Connection to AP ac:22:05:83:ce:3c lost
[52557.210817] wlp59s0: send auth to ac:22:05:83:ce:3c (try 2/3)
[52558.109601] iwlwifi 0000:3b:00.0: No beacon heard and the session protection is over already...
[52558.109688] wlp59s0: Connection to AP ac:22:05:83:ce:3c lost

It seems that this also depends on the access point that I use (on some, it happens very often, on some others almost never).
Comment 77 Naruto windy 2020-07-06 12:31:33 UTC
Temp solution is to use networkmanager , it automatically reconnects to the AP.

Still I want this bug to be disappear.
Comment 78 how 2020-07-09 09:24:28 UTC
I can confirm this bug happens in kernel 5.4.0-40 with Intel 8260. The No beacon keeps happening in a short time. Revert to kernel 4.15 and back to normal.
Comment 79 how 2020-07-09 09:30:28 UTC
when the No beacon problem happens, the wifi disconnects and you no longer see the AP at that frequency. I had to switch wifi off then on to detect the AP again.
Comment 80 sebalinux 2020-07-25 11:14:01 UTC
Hello, I'm suffering the same problem with my Thinkpad P52

Fedora 32

$ uname -a
Linux mariuccio-P52 5.7.9-200.fc32.x86_64 #1 SMP Fri Jul 17 16:23:37 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux


$ lspci -kvnn | sed -n '/Network/,/^$/ p'
00:14.3 Network controller [0280]: Intel Corporation Wireless-AC 9560 [Jefferson Peak] [8086:a370] (rev 10)
	Subsystem: Intel Corporation Device [8086:0030]
	Flags: bus master, fast devsel, latency 0, IRQ 16
	Memory at 604b10c000 (64-bit, non-prefetchable) [size=16K]
	Capabilities: <access denied>
	Kernel driver in use: iwlwifi
	Kernel modules: iwlwifi


Often I'm getting disconnected for few seconds


wpa_supplicant[16018]: nl80211: Event message available
wpa_supplicant[16018]: nl80211: Drv Event 64 (NL80211_CMD_NOTIFY_CQM) received for wlp0s20f3
wpa_supplicant[16018]: nl80211: Beacon loss event
wpa_supplicant[16018]: wlp0s20f3: Event BEACON_LOSS (53) received
wpa_supplicant[16018]: wlp0s20f3: CTRL-EVENT-BEACON-LOSS
wpa_supplicant[16018]: RTM_NEWLINK: ifi_index=3 ifname=wlp0s20f3 operstate=2 linkmode=1 ifi_family=0 ifi_flags=0x1003 ([UP])
wpa_supplicant[16018]: nl80211: Event message available
wpa_supplicant[16018]: nl80211: Drv Event 20 (NL80211_CMD_DEL_STATION) received for wlp0s20f3
wpa_supplicant[16018]: nl80211: Delete station 2c:30:33:26:5d:1c
wpa_supplicant[16018]: nl80211: Event message available
wpa_supplicant[16018]: nl80211: Drv Event 39 (NL80211_CMD_DEAUTHENTICATE) received for wlp0s20f3
wpa_supplicant[16018]: nl80211: MLME event 39 (NL80211_CMD_DEAUTHENTICATE) on wlp0s20f3(d4:3b:04:c3:59:ec) A1=2c:30:33:26:5d:1c A2=d4:3b:04:c3:59:ec
wpa_supplicant[16018]: nl80211: MLME event frame - hexdump(len=26): c0 00 00 00 2c 30 33 26 5d 1c d4 3b 04 c3 59 ec 2c 30 33 26 5d 1c 00 00 04 00
wpa_supplicant[16018]: nl80211: Deauthenticate event
wpa_supplicant[16018]: wlp0s20f3: Event DEAUTH (11) received
wpa_supplicant[16018]: wlp0s20f3: Deauthentication notification
wpa_supplicant[16018]: wlp0s20f3:  * reason 4 (DISASSOC_DUE_TO_INACTIVITY) locally_generated=1
wpa_supplicant[16018]: wlp0s20f3:  * address 2c:30:33:26:5d:1c
wpa_supplicant[16018]: Deauthentication frame IE(s) - hexdump(len=0): [NULL]
wpa_supplicant[16018]: wlp0s20f3: CTRL-EVENT-DISCONNECTED bssid=2c:30:33:26:5d:1c reason=4 locally_generated=1



The problem seems the same to me.

May I provide other useful info? This issue it's very annoying
Thanks,
Mario
Comment 81 dagthree7 2020-07-28 10:26:51 UTC
Hey guys so I am on arch linux and I wanted to provide some details. I use iwd for dhcp + wireless connections, and randomly in heavy network throughput the wireless connection will drop and I cannot ping my gateway or anything, so for all intents and purposes I am offline. I have GT-AX11000 asus router and the intel ax-200 wireless card. This is key, when I set a limit on my download or upload clients  at 100kbps I can stay connected for weeks at a time. If I run a very intensive (50MBps+) samba connection it will drop after an hour or so. If I attempt to saturate my ISP's uplink with a upload application such as bittorrent, it will disconnect me in seconds. (single peer, TCP only, so its not number of connections but maybe packets per second?)

Kernel information.
I have tried the latest kernel and the lts kernel, both have identical issues.
Currently on LTS, downgraded from current today.
```uname -r
5.4.53-1-lts```

I believe this is a bug in iwd, so unrelated, but 
``` cat /etc/iwd/main.conf
[General]
EnableNetworkConfiguration=true
[Network]
NameResolvingService=systemd
```

This will not reconnect when it drops. I am going to try network manager and see if it reconnects, but the goal would be for it to maintain a good connection when the signal is good at all times. IWD says I have 5/5 signal strength if that helps.

My routers default beacon was 100ms, and I changed this to 50ms. I have not noticed a disconnection yet. Will post back later in 48 hours, this is usually my disconnection frequency. When it disconnects, only a reboot of the iwd service will bring it back. (Might be a bug with iwd, NetworkManager apparently will just keep re-connecting?)

Also, I apologuise for my ignorance but I am unsure how to apply kernel patches to be of help in this matter. 

Thanks for your time.
Comment 82 Naruto windy 2020-07-28 12:49:30 UTC
I tried with wpa_supplicant , networkmanager, iwd all will disconnects the network. Only networkmanaher will able to reconnect. But this is not the point. Wifi connection is so essential these days should run without a drop for weeks.
Comment 83 sebalinux 2020-07-30 03:42:50 UTC
(In reply to sebalinux from comment #80)
> Hello, I'm suffering the same problem with my Thinkpad P52
> 
> Fedora 32
> 
> $ uname -a
> Linux mariuccio-P52 5.7.9-200.fc32.x86_64 #1 SMP Fri Jul 17 16:23:37 UTC
> 2020 x86_64 x86_64 x86_64 GNU/Linux
> 
> 
> $ lspci -kvnn | sed -n '/Network/,/^$/ p'
> 00:14.3 Network controller [0280]: Intel Corporation Wireless-AC 9560
> [Jefferson Peak] [8086:a370] (rev 10)
>       Subsystem: Intel Corporation Device [8086:0030]
>       Flags: bus master, fast devsel, latency 0, IRQ 16
>       Memory at 604b10c000 (64-bit, non-prefetchable) [size=16K]
>       Capabilities: <access denied>
>       Kernel driver in use: iwlwifi
>       Kernel modules: iwlwifi
> 
> 
> Often I'm getting disconnected for few seconds
> 
> 
> wpa_supplicant[16018]: nl80211: Event message available
> wpa_supplicant[16018]: nl80211: Drv Event 64 (NL80211_CMD_NOTIFY_CQM)
> received for wlp0s20f3
> wpa_supplicant[16018]: nl80211: Beacon loss event
> wpa_supplicant[16018]: wlp0s20f3: Event BEACON_LOSS (53) received
> wpa_supplicant[16018]: wlp0s20f3: CTRL-EVENT-BEACON-LOSS
> wpa_supplicant[16018]: RTM_NEWLINK: ifi_index=3 ifname=wlp0s20f3 operstate=2
> linkmode=1 ifi_family=0 ifi_flags=0x1003 ([UP])
> wpa_supplicant[16018]: nl80211: Event message available
> wpa_supplicant[16018]: nl80211: Drv Event 20 (NL80211_CMD_DEL_STATION)
> received for wlp0s20f3
> wpa_supplicant[16018]: nl80211: Delete station 2c:30:33:26:5d:1c
> wpa_supplicant[16018]: nl80211: Event message available
> wpa_supplicant[16018]: nl80211: Drv Event 39 (NL80211_CMD_DEAUTHENTICATE)
> received for wlp0s20f3
> wpa_supplicant[16018]: nl80211: MLME event 39 (NL80211_CMD_DEAUTHENTICATE)
> on wlp0s20f3(d4:3b:04:c3:59:ec) A1=2c:30:33:26:5d:1c A2=d4:3b:04:c3:59:ec
> wpa_supplicant[16018]: nl80211: MLME event frame - hexdump(len=26): c0 00 00
> 00 2c 30 33 26 5d 1c d4 3b 04 c3 59 ec 2c 30 33 26 5d 1c 00 00 04 00
> wpa_supplicant[16018]: nl80211: Deauthenticate event
> wpa_supplicant[16018]: wlp0s20f3: Event DEAUTH (11) received
> wpa_supplicant[16018]: wlp0s20f3: Deauthentication notification
> wpa_supplicant[16018]: wlp0s20f3:  * reason 4 (DISASSOC_DUE_TO_INACTIVITY)
> locally_generated=1
> wpa_supplicant[16018]: wlp0s20f3:  * address 2c:30:33:26:5d:1c
> wpa_supplicant[16018]: Deauthentication frame IE(s) - hexdump(len=0): [NULL]
> wpa_supplicant[16018]: wlp0s20f3: CTRL-EVENT-DISCONNECTED
> bssid=2c:30:33:26:5d:1c reason=4 locally_generated=1
> 
> 
> 
> The problem seems the same to me.
> 
> May I provide other useful info? This issue it's very annoying
> Thanks,
> Mario


Hello,
just another little piece of information: yesterday morning I activated the 5Ghz net in my router and I switched to it. Results: about 24H without no drop, no disconnection, no bug. The router is the same, the laptop is the same, what changed is only the wifi network frequency. I tried this but I have to use 2.4Ghz network, so I would like to see the bug solved.
Comment 84 dagthree7 2020-07-30 06:21:52 UTC
Alright here is what I think is going on.

Setting beacon to 50ms vs 100ms keeps the card from going into sleep mode. I think the sleep mode is the issue.... Maybe too aggressive power save?

sebalinux 
This may work for you, but I was on a 5GHz channel 160MHz and I still have major issues. Setting beacon helped, but it will still eventually disconnect (am on latest arch kernel). Since I was on iwd, every 2 days I would need to manually restart the service as it was a nas. NetworkManager service will reboot the connection on its own, so it helped. But disconnecting randomly is not ideal.
Comment 85 sebalinux 2020-07-30 19:07:17 UTC
(In reply to dagthree7 from comment #84)
> Alright here is what I think is going on.
> 
> Setting beacon to 50ms vs 100ms keeps the card from going into sleep mode. I
> think the sleep mode is the issue.... Maybe too aggressive power save?
> 
> sebalinux 
> This may work for you, but I was on a 5GHz channel 160MHz and I still have
> major issues. Setting beacon helped, but it will still eventually disconnect
> (am on latest arch kernel). Since I was on iwd, every 2 days I would need to
> manually restart the service as it was a nas. NetworkManager service will
> reboot the connection on its own, so it helped. But disconnecting randomly
> is not ideal.

I completely agree that this behavior is not ideal nor acceptable. I only shared my experience (update: it was working for two days with no disconnection) just to add some (I hope useful) informations to people that are analyzing the issue.
I have to switch down the 5Ghz wifi for other reason, so it was only a test to verify if the problem was related to the different frequency.
Comment 86 kotrfa 2020-07-31 07:13:22 UTC
I was experiencing this problem on both 2 and 5GHz. Reducing beacon interval from 100 to 50 didn't eliminate the issue.
Comment 87 Naruto windy 2020-07-31 13:26:05 UTC
Can we just take a break and admire that this bug is here for a more than a year. Thank you Intel.
Comment 88 Sai Sindhur Malleni 2020-08-01 22:57:33 UTC
Not to muddy the waters, but this is still an issue on my Lenovo Thinkpad P1Gen2 notebook with Fedora 32

[root@localhost conf.d]# uname -a
Linux localhost.localdomain 5.7.10-201.fc32.x86_64 #1 SMP Thu Jul 23 00:58:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux

[root@localhost conf.d]# ethtool -i wlp82s0
driver: iwlwifi
version: 5.7.10-201.fc32.x86_64
firmware-version: 53.c31ac674.0 cc-a0-53.ucode
expansion-rom-version: 
bus-info: 0000:52:00.0
supports-statistics: yes
supports-test: no
supports-eeprom-access: no
supports-register-dump: no
supports-priv-flags: no

[root@localhost conf.d]# lspci | grep -i wi-fi
52:00.0 Network controller: Intel Corporation Wi-Fi 6 AX200 (rev 1a)

Wifi disconnects every 20ish minutes,and in the journal I see messages like
Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0: CTRL-EVENT-BEACON-LOSS
Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0: CTRL-EVENT-BEACON-LOSS
Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0: CTRL-EVENT-BEACON-LOSS
Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0: CTRL-EVENT-BEACON-LOSS
Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0: CTRL-EVENT-BEACON-LOSS
Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0: CTRL-EVENT-BEACON-LOSS
Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0: CTRL-EVENT-BEACON-LOSS
Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0: CTRL-EVENT-BEACON-LOSS
Aug 01 17:53:36 localhost.localdomain kernel: wlp82s0: Connection to AP 20:c0:47:e1:69:1c lost
Aug 01 17:53:36 localhost.localdomain wpa_supplicant[1113]: wlp82s0: CTRL-EVENT-SIGNAL-CHANGE above=0 signal=0 noise=9999 txrate=0
Aug 01 17:53:36 localhost.localdomain wpa_supplicant[1113]: wlp82s0: CTRL-EVENT-DISCONNECTED bssid=20:c0:47:e1:69:1c reason=4 locally_generated=1
Aug 01 17:53:36 localhost.localdomain wpa_supplicant[1113]: dbus: wpa_dbus_property_changed: no property SessionLength in object /fi/w1/wpa_supplicant1/Interfaces/13

While we "wait" for this bug to be solved, I figured I'd just share what helped make my connection more stable.

1. I disabled IPv6 (not sure this step is really needed)
[root@localhost conf.d]# cat /etc/sysctl.conf 
net.ipv6.conf.all.disable_ipv6 = 1
net.ipv6.conf.default.disable_ipv6 = 1
net.ipv6.conf.lo.disable_ipv6 = 1

2. Then I disabled powersaving on the Wi-Fi card (this probably helped the most)

[root@localhost conf.d]# cat /etc/NetworkManager/conf.d/wifi-power-save-off.conf 
[connection]
# Values are 0 (use default), 1 (ignore/don't touch), 2 (disable) or 3 (enable).
wifi.powersave = 2

3. Following https://forum.manjaro.org/t/solved-intel-ax200-iwlwifi-based-adapter-slow-wifi/148708 I also did lsmod | grep -o -e ^iwlmvm -e ^iwldvm -e ^iwlwifi | xargs sudo rmmod && sleep 3 && sudo modprobe iwlwifi swcrypto=1 11n_disable=1 and added 
[root@localhost conf.d]# cat /etc/modprobe.d/iwlwifi.conf 
options iwlwi fi swcrypto=1 11n_disable=1
for persistance.

While I realize that I probably tried a few different things and probably (2) helped the most, I have a lot more stable Wi-Fi connection now.
Comment 89 Naruto windy 2020-08-01 23:36:45 UTC
@sai(In reply to Sai Sindhur Malleni from comment #88)
> Not to muddy the waters, but this is still an issue on my Lenovo Thinkpad
> P1Gen2 notebook with Fedora 32
> 
> [root@localhost conf.d]# uname -a
> Linux localhost.localdomain 5.7.10-201.fc32.x86_64 #1 SMP Thu Jul 23
> 00:58:39 UTC 2020 x86_64 x86_64 x86_64 GNU/Linux
> 
> [root@localhost conf.d]# ethtool -i wlp82s0
> driver: iwlwifi
> version: 5.7.10-201.fc32.x86_64
> firmware-version: 53.c31ac674.0 cc-a0-53.ucode
> expansion-rom-version: 
> bus-info: 0000:52:00.0
> supports-statistics: yes
> supports-test: no
> supports-eeprom-access: no
> supports-register-dump: no
> supports-priv-flags: no
> 
> [root@localhost conf.d]# lspci | grep -i wi-fi
> 52:00.0 Network controller: Intel Corporation Wi-Fi 6 AX200 (rev 1a)
> 
> Wifi disconnects every 20ish minutes,and in the journal I see messages like
> Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0:
> CTRL-EVENT-BEACON-LOSS
> Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0:
> CTRL-EVENT-BEACON-LOSS
> Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0:
> CTRL-EVENT-BEACON-LOSS
> Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0:
> CTRL-EVENT-BEACON-LOSS
> Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0:
> CTRL-EVENT-BEACON-LOSS
> Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0:
> CTRL-EVENT-BEACON-LOSS
> Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0:
> CTRL-EVENT-BEACON-LOSS
> Aug 01 17:53:35 localhost.localdomain wpa_supplicant[1113]: wlp82s0:
> CTRL-EVENT-BEACON-LOSS
> Aug 01 17:53:36 localhost.localdomain kernel: wlp82s0: Connection to AP
> 20:c0:47:e1:69:1c lost
> Aug 01 17:53:36 localhost.localdomain wpa_supplicant[1113]: wlp82s0:
> CTRL-EVENT-SIGNAL-CHANGE above=0 signal=0 noise=9999 txrate=0
> Aug 01 17:53:36 localhost.localdomain wpa_supplicant[1113]: wlp82s0:
> CTRL-EVENT-DISCONNECTED bssid=20:c0:47:e1:69:1c reason=4 locally_generated=1
> Aug 01 17:53:36 localhost.localdomain wpa_supplicant[1113]: dbus:
> wpa_dbus_property_changed: no property SessionLength in object
> /fi/w1/wpa_supplicant1/Interfaces/13
> 
> While we "wait" for this bug to be solved, I figured I'd just share what
> helped make my connection more stable.
> 
> 1. I disabled IPv6 (not sure this step is really needed)
> [root@localhost conf.d]# cat /etc/sysctl.conf 
> net.ipv6.conf.all.disable_ipv6 = 1
> net.ipv6.conf.default.disable_ipv6 = 1
> net.ipv6.conf.lo.disable_ipv6 = 1
> 
> 2. Then I disabled powersaving on the Wi-Fi card (this probably helped the
> most)
> 
> [root@localhost conf.d]# cat
> /etc/NetworkManager/conf.d/wifi-power-save-off.conf 
> [connection]
> # Values are 0 (use default), 1 (ignore/don't touch), 2 (disable) or 3
> (enable).
> wifi.powersave = 2
> 
> 3. Following
> https://forum.manjaro.org/t/solved-intel-ax200-iwlwifi-based-adapter-slow-
> wifi/148708 I also did lsmod | grep -o -e ^iwlmvm -e ^iwldvm -e ^iwlwifi |
> xargs sudo rmmod && sleep 3 && sudo modprobe iwlwifi swcrypto=1
> 11n_disable=1 and added 
> [root@localhost conf.d]# cat /etc/modprobe.d/iwlwifi.conf 
> options iwlwi fi swcrypto=1 11n_disable=1
> for persistance.
> 
> While I realize that I probably tried a few different things and probably
> (2) helped the most, I have a lot more stable Wi-Fi connection now.


I believe i tried all of the things you mentioning in all the kernel firmware updates since aug 2019, probably more than the Intel engineers.none of them works. Can't argue the fact that they are just a Titanic, they are doomed for sinking. No wonder they are failing. RIP
Comment 90 Alexander Tsoy 2020-08-04 15:03:41 UTC
Confirming this issue with ath9k-based APs. My router is Turris Omnia with Qualcomm Atheros AR9287 and QCA9880 cards. Turris OS 3.x.x (based on OpenWRT 15.05) had some issues with ath10k, so my laptop was often connected to 2.4GHz network. I had multiple disconnections per day, sometimes to the point that I need to manually disconnect and connect again:

$ sudo LC_ALL=en_US.UTF-8 journalctl | fgrep 'Connection to AP 04:f0:21:XX:XX:XX lost' | awk '{print $1" "$2}' | uniq -c 
     33 Jun 23
      2 Jun 25
     60 Jun 26
      1 Jun 27
     80 Jun 28
     23 Jun 29
      2 Jul 02
     26 Jul 03
     71 Jul 08
    167 Jul 09
     56 Jul 10
     67 Jul 11
     56 Jul 14
     25 Jul 15

After upgrade to the Turris OS 5.x.x (based on OpenWRT 19.07) my laptop is always connected to the 5GHz network and the issue is completely gone. I didn't change anything in my laptop and/or router configuration.


My laptop wifi card:

[  171.990009] iwlwifi 0000:00:14.3: Detected Intel(R) Dual Band Wireless AC 9560, REV=0x318

00:14.3 Network controller [0280]: Intel Corporation Cannon Point-LP CNVi [Wireless-AC] [8086:9df0] (rev 11)
	Subsystem: Intel Corporation Cannon Point-LP CNVi [Wireless-AC] [8086:0034]
	Flags: bus master, fast devsel, latency 0, IRQ 16
	Memory at f1210000 (64-bit, non-prefetchable) [size=16K]
	Capabilities: [c8] Power Management version 3
	Capabilities: [d0] MSI: Enable- Count=1/1 Maskable- 64bit+
	Capabilities: [40] Express Root Complex Integrated Endpoint, MSI 00
	Capabilities: [80] MSI-X: Enable+ Count=16 Masked-
	Capabilities: [100] Null
	Capabilities: [14c] Latency Tolerance Reporting
	Capabilities: [164] Vendor Specific Information: ID=0010 Rev=0 Len=014 <?>
	Kernel driver in use: iwlwifi
	Kernel modules: iwlwifi
Comment 91 Levi Harris-Browning 2020-08-05 17:10:55 UTC
AX200 on Arch Linux here been having this problem since I purchased the card a few weeks ago.

I updated to kernel 5.8.0.1 via linux-mainline in the aur today and so far have been connected for nearly four hours of solidly hitting the connection without any beacon loss (or any connection loss). This is much longer than I normally get. It could be a coincidence but it hasn't lasted this long before.

For a full list of things I've also changed or have:

AX200 (more specifically this one: https://www.amazon.co.uk/gp/product/B07YZ7J8QD)

Computer is a Dell OptiPlex 990 SFF, using second PCIe slot, first used by an Nvidia NVS 510

Arch Linux x86-64

linux-mainline 5.8-1 from the aur

Disconnects frequently (hour max) and has trouble connecting to TalkTalk's ISP router, Sagemcom FAST5364 3.0. Stays connected to Galaxy Note 10+ as a hotspot to the same router for a few hours at a time before disconnecting. Neither has had their beacon interval changed (not changeable on both). I have yet to try it on my home Ubiquiti Unifi UAP-AC-LITE as I won't be there for a while.

Using NetworkManager

Disabled ipv6

Disabled power saving via modprobe with:
options iwlmvm power_scheme=1
options iwlwifi power_save=0

Tried both latest Arch kernel (5.7.9) and LTS and issue persists on both

Would be great if someone else could have a go and see if they're having a consistent connection on 5.8
Comment 92 Levi Harris-Browning 2020-08-05 17:45:26 UTC
I'm not sure how to edit comments, I forgot this thing I changed as well:

Disabled mac spoofing randomisation, instead using the actual mac for the card. I found this made reconnections easier and possibly allowed it to stay connected a bit longer, although that may have been a placebo.
Comment 93 Levi Harris-Browning 2020-08-06 20:47:54 UTC
Just to say it has lasted over 30 hours without a disconnection now. I had the following a few times around 1pm today:

    iwlwifi 0000:04:00.0: Unhandled alg: 0x71b

But that didn't actually cause any connection loss or any visible effects (I had a download going at the time and it didn't even slow down), and no beacon errors or anything else.
Comment 94 Mathias Bavay 2020-08-10 09:06:05 UTC
So far, keeping ipv6 but turning off power saving via modprobe seems to be a great workaround (no more firmware crashes in the last few days when it used to crash many times a day with every heavy web usage). 

So, just add to /etc/modprobe.d/iwlwifi.conf the following:
options iwlmvm power_scheme=1
options iwlwifi power_save=0
Comment 95 Sai Sindhur Malleni 2020-08-10 13:40:48 UTC
Unfortunately, disabling power saving in the driver alone is not solving my problem. The only thing that has solved my problem so far is 
 lsmod | grep -o -e ^iwlmvm -e ^iwldvm -e ^iwlwifi |xargs sudo rmmod && sleep 3 && sudo modprobe iwlwifi 11n_disable=1

At which point I have horrible throughput and can't even connect to 5GHz networks anymore.
Comment 96 Sai Sindhur Malleni 2020-08-10 13:41:09 UTC
Unfortunately, disabling power saving in the driver alone is not solving my problem. The only thing that has solved my problem so far is 
 lsmod | grep -o -e ^iwlmvm -e ^iwldvm -e ^iwlwifi |xargs sudo rmmod && sleep 3 && sudo modprobe iwlwifi 11n_disable=1

At which point I have horrible throughput and can't even connect to 5GHz networks anymore.
Comment 97 kurmikon 2020-08-10 15:00:42 UTC
I resolved this issue blacklisting iwlwifi and buying a Chinese USB wifi adapter.

Thank you Intel.
Comment 98 kotrfa 2020-08-11 17:01:04 UTC
Mathias Bavay solution actually solved that for me (THANKS!). Specifically I have:

* TP-Link Archer C6 router, 5GHz, beacon interval set to 50. 

* Intel Corporation Wireless 8260 (rev 3a) (with Thinkpad T460s)

* Linux 5.7.12-arch1-1 #1 SMP PREEMPT Fri, 31 Jul 2020 17:38:22 +0000 x86_64 GNU/Linux (previously I was on lts419 which didn't have this bug)

* modprobe like:
# cat /etc/modprobe.d/iwlwifi.conf
options iwlmvm power_scheme=1
options iwlwifi power_save=0

I have been stable for a day and a half. 

I have not disabled ipv6 (although I don't use it). I also didn't fixed channel on the router. See my early attempts earlier.
Comment 99 kotrfa 2020-08-19 06:47:58 UTC
Update to my last comment: it only makes the problem less likely apparently - today I got in the "No beacon heard..." after a week of stable connection (two days ago, I upgraded to 5.8.1 though).
Comment 100 kotrfa 2020-08-24 20:18:40 UTC
Sorry, another update - it got to the same frequency as before, therefore my last two comments were just a random noise, sorry for that.

A friend of mine has exactly the same hardware as I do, Thinkpad T460s with Intel Corporation Wireless 8260 (rev 3a), but with Linux Manjaro (I have Arch). He doesn't experience the problems at all. Is there anyone who's _not_ on Arch and experiencing this?
Comment 101 MIC 2020-08-24 20:25:22 UTC
Manjaro and Kernel 5.8.1 here on my T450s. Card is Intel Corporation Wireless 7265 (rev 59).
Had this issue with every recent Kernel.
The comment from Mathias Bavay fixes the issue for me:
https://bugzilla.kernel.org/show_bug.cgi?id=203709#c94
Comment 102 GRbit 2020-08-28 09:50:27 UTC
Debian 5.8.1 toshiba portege x30-f. Never had this issue on 5.7.x kernels, but since 5.8.1 had this issue twice, but can't reproduce on purpose. Since I've got it on AC power, it doesn't seems like switching off powersave can help.
Comment 103 Mathias Bavay 2020-08-28 11:15:18 UTC
@GRbit I forgot to mention, I was always on AC power.  It's just that the wifi card would by default try to use a power save mode that seems quite buggy...
Comment 104 khalil.io+ubuntu 2020-08-31 14:33:04 UTC
I am experiencing what seems to be the same issue on the intel 8265 / 8275. Performance drops, and only picks up speed when WiFi is turned on and off

$ uname -r
5.4.0-42-generic

$ lspci
00:1f.4 SMBus: Intel Corporation 100 Series/C230 Series Chipset Family SMBus (rev 31)
01:00.0 Display controller: Advanced Micro Devices, Inc. [AMD/ATI] Polaris 22 MGL XL [Radeon Pro WX Vega M GL] (rev c0)
02:00.0 Network controller: Intel Corporation Wireless 8265 / 8275 (rev 78)
03:00.0 Unassigned class [ff00]: Realtek Semiconductor Co., Ltd. RTS525A PCI Express Card Reader (rev 01)
Comment 105 khalil.io+ubuntu 2020-09-01 16:59:25 UTC
It seems that the error is with some IPv6 configuration. In Ubuntu, I disabled DNS and Auto Routes and the error went almost unnoticed
Comment 106 Sergey Slizovskiy 2020-09-02 19:21:31 UTC
I want to mention that the below advice has helped me with AC 8265 adaptor.  Iw worked fine on Windows, but dropped packets in Kernel 5.4 and 5.8
Thank you,
Sergey

(In reply to Mathias Bavay from comment #94)
> So far, keeping ipv6 but turning off power saving via modprobe seems to be a
> great workaround (no more firmware crashes in the last few days when it used
> to crash many times a day with every heavy web usage). 
> 
> So, just add to /etc/modprobe.d/iwlwifi.conf the following:
> options iwlmvm power_scheme=1
> options iwlwifi power_save=0
Comment 107 tomasz.belina 2020-09-15 17:57:36 UTC
I've notice this isse on my Lenovo L490. Wifi went crazy - computer started to loose connection every few minutes. Restart helped.  
# uname -r
5.4.0-47-generic

# lspci 
00:00.0 Host bridge: Intel Corporation Coffee Lake HOST and DRAM Controller (rev 0c)
00:02.0 VGA compatible controller: Intel Corporation UHD Graphics 620 (Whiskey Lake) (rev 02)
00:04.0 Signal processing controller: Intel Corporation Xeon E3-1200 v5/E3-1500 v5/6th Gen Core Processor Thermal Subsystem (rev 0c)
00:08.0 System peripheral: Intel Corporation Xeon E3-1200 v5/v6 / E3-1500 v5 / 6th/7th/8th Gen Core Processor Gaussian Mixture Model
00:12.0 Signal processing controller: Intel Corporation Cannon Point-LP Thermal Controller (rev 30)
00:14.0 USB controller: Intel Corporation Cannon Point-LP USB 3.1 xHCI Controller (rev 30)
00:14.2 RAM memory: Intel Corporation Cannon Point-LP Shared SRAM (rev 30)
00:15.0 Serial bus controller [0c80]: Intel Corporation Cannon Point-LP Serial IO I2C Controller #0 (rev 30)
00:15.2 Serial bus controller [0c80]: Intel Corporation Device 9dea (rev 30)
00:16.0 Communication controller: Intel Corporation Cannon Point-LP MEI Controller #1 (rev 30)
00:1d.0 PCI bridge: Intel Corporation Cannon Point-LP PCI Express Root Port #9 (rev f0)
00:1d.3 PCI bridge: Intel Corporation Device 9db3 (rev f0)
00:1d.7 PCI bridge: Intel Corporation Device 9db7 (rev f0)
00:1f.0 ISA bridge: Intel Corporation Cannon Point-LP LPC Controller (rev 30)
00:1f.3 Audio device: Intel Corporation Cannon Point-LP High Definition Audio Controller (rev 30)
00:1f.4 SMBus: Intel Corporation Cannon Point-LP SMBus Controller (rev 30)
00:1f.5 Serial bus controller [0c80]: Intel Corporation Cannon Point-LP SPI Controller (rev 30)
00:1f.6 Ethernet controller: Intel Corporation Ethernet Connection (6) I219-V (rev 30)
04:00.0 Non-Volatile memory controller: Toshiba Corporation Device 011a
05:00.0 Network controller: Intel Corporation Wireless-AC 9260 (rev 29)
07:00.0 Unassigned class [ff00]: Realtek Semiconductor Co., Ltd. RTS522A PCI Express Card Reader (rev 01)
Comment 108 a.katzander 2020-09-28 10:06:35 UTC
same here Asus Vivobook S15 

5.8.6-1-MANJARO

Network controller: Intel Corporation Wireless 8265 / 8275 (rev 78)
Comment 109 johnsoft 2020-10-07 03:53:31 UTC
I have same problem after suspend on lenovo ideapad 5 ( amd ryzen 7 )

Network controller: Intel Corporation Wi-Fi 6 AX200 (rev 1a)

Linux ubuntu-pc 5.8.13-050813-generic...
Comment 110 Petr Kracik 2020-10-11 13:58:34 UTC
Same on Arch 5.8.10-arch1-1 and 02:00.0 Network controller: Intel Corporation Wi-Fi 6 AX200 (rev 1a)
but same with other kernels and Intel based wifi.

it's really annoy, do not find solution what really helps. Sometimes littlebit help modprobe iwlwifi power_save=0 sometimes help move with laptop. Sometimes help turn off bluetooth on mobile from which using hotspot.

First I thought it's because Mikrotik's WiFi, but now I can see that on other networks too (even TPLink, UBNT, android hotspot, Turris). Mostly it's worse on 5ghz than 2.4GHz

If I can help somehow with debugging (wireshark, some dumps, sniff something by other wifi card. Anything what could help to find what it causes.


[466410.753117] wlp2s0: authenticate with 68:72:51:60:bd:7b
[466410.755615] wlp2s0: send auth to 68:72:51:60:bd:xx (try 1/3)
[466411.323928] wlp2s0: send auth to 68:72:51:60:bd:xx (try 2/3)
[466411.350849] wlp2s0: authenticated
[466411.353084] wlp2s0: associate with 68:72:51:60:bd:xx (try 1/3)
[466411.356407] wlp2s0: RX AssocResp from 68:72:51:60:bd:xx (capab=0x431 status=0 aid=4)
[466411.363796] wlp2s0: associated
[466412.222808] iwlwifi 0000:02:00.0: No beacon heard and the session protection is over already...
[466412.222836] wlp2s0: Connection to AP 68:72:51:60:bd:xx lost


and repeat every second
Comment 111 kotrfa 2020-10-15 20:33:02 UTC
I "solved" this by buying https://www.tp-link.com/cz/home-networking/adapter/archer-t2u-nano/ instead :-( . It's sad...
Comment 112 Martin Stolpe 2020-10-22 18:28:49 UTC
I think I got a pretty stable setup now but there were three problems on my system (I'm using IWD on my system):

1.There is currently a bug in the Kernel (5.9.1):
Discussion: https://www.reddit.com/r/archlinux/comments/jdlayz/iwd_broken_after_update/
Proposed patch: https://lore.kernel.org/linux-wireless/20201017230818.04896494%40mathy-work.localhost/

2. Using IWD with NetworkManager also causes disconnects
=> I have disabled NetworkManager on my system and use systemd-networkd to configure my wireless connection

3. I have disabled all power saving options and also 11ax in the driver
/etc/modprobe.d/iwlwifi.conf:
options iwlwifi power_save=0
options iwlwifi disable_11ax=1
options iwlmvm power_scheme=1
Comment 113 Jack Bamford 2020-11-02 20:47:07 UTC
Hi folks,

I have two laptops with this a Intel AC7260 the one laptop is a Sony Vaio Flip 14 and the other is a Probook 4540s. This bug only seems to happen on the Probook 4540s. Problematic Laptop is the 4540s.

I have been suffering this bug since 5.4 Kernel on Ubuntu 18.04, Kernel 4.15 on Ubuntu 18.04 is fine no problems but something i have noticed. Having Fast Boot enabled in the BIOS breaks the Bluetooth also causes WIFI to drop every 5mins, disabled Fast Boot has solved issue with Bluetooth not being seen and WIFI dropping out every 5 mins.

Here what i have found, changing DTIM to 50 on 5G with the Ubiquti AC Lite has seemed to fix the problem so far, when this problem is caused its mainly when there is no activity on the Network Card loads of traffic and constant pings do not cause this problem.

When SAMBA Share is mounted that is when it happens the most. Something else that is interesting it only happens on certain Routers. Ubiquti AC Lite it happens on but a BT Homehub or Smarthub this doesn't happen also tested on Sky Broadband Router and no problems neither.

Heres what i have tried before changing DTIM to 50.

Downloaded latest Driver from Intel and removed old firmware, This makes Network card not detected,

Disabled PowerManagement,
Disabled ipv6

none solved except issue has disappeared when changing DTIM to 50.

Find it strange that some Routers are fine tho. Maybe something to do with UEFI BIOS will test with Legacy and see if problem is there.

Cheers.

Jack.
Comment 114 Jack Bamford 2020-11-04 21:54:06 UTC
Update,

My HP Probook 4540s has been up for 2 days and had this No beacon heard and the time event is over already... It is not as frequent as it was tho.

I have noticed something tho, This seems to be machine specific why i say this, I have another 7260 Card which i installed on a Desktop tested with Ubuntu 18.04 and 20.04 with 5.4.0-52-generic Kernel and this Error does not happen. Output i get on both machines are,

[140029.104658] wlan0: authenticate with fc:ec:da
[140029.107120] wlan0: send auth to fc:ec:da (try 1/3)
[140029.111995] wlan0: authenticated

at least it connects on first attempt. But this issue is only on certain machines, Doesn't happen on Sony Vaio Flip 14 or a G41M-P25, Core 2 Quad Desktop but does indeed happen on mu HP Probook 4540s.
Comment 115 lukas.redlinger 2020-11-05 09:01:52 UTC
(In reply to Jack Bamford from comment #114)

you could try swapping the cards between those two devices?! Maybe it's a problem in the cards, not the notebooks?

We're having some hundred of the cards out in the wild, but still don't understand why some fail often, some sometimes and some never...
Comment 116 Jack Bamford 2020-11-05 09:08:34 UTC
(In reply to lukas.redlinger from comment #115)
> (In reply to Jack Bamford from comment #114)
> 
> you could try swapping the cards between those two devices?! Maybe it's a
> problem in the cards, not the notebooks?
> 
> We're having some hundred of the cards out in the wild, but still don't
> understand why some fail often, some sometimes and some never...

Hi,

Thanks for your reply, I have tried that that was the first thing i done but the problem still happens. 

Problem doesnt happen on Ubuntu 16.04 or Windows 10 on the HP Probook 4540s.

Thanks.

Jack.
Comment 117 mail 2020-11-05 22:04:41 UTC
Moving off of iwlwifi with an external adapter (awus036ach) using realtek-rtl8812 driver "solved" it for me.
Comment 118 Ozan Caglayan 2020-11-08 12:02:45 UTC
Created attachment 293565 [details]
Sample output from journalctl (Ubuntu 20.04, 8265wifi, kernel 5.4.0)
Comment 119 Ozan Caglayan 2020-11-08 12:05:30 UTC
Created attachment 293567 [details]
trace_cmd output (8265wifi, Kernel 5.4.0)

The issue happened multiple times during this trace_cmd.
Comment 120 Ozan Caglayan 2020-11-08 12:26:55 UTC
Created attachment 293569 [details]
trace_cmd output (8265wifi, kernel 5.4.0, debug-fw 36.9f0a2d68.0 op_mode iwlmvm)
Comment 121 Ozan Caglayan 2020-11-08 12:28:40 UTC
Created attachment 293571 [details]
fw_dbg_collect dump (8265wifi, kernel 5.4.0, debug-fw 36.9f0a2d68.0 op_mode iwlmvm)
Comment 122 Ozan Caglayan 2020-11-08 12:31:24 UTC
Hello,

I have this problem for quite a long time, both with Ubuntu 18.04 and 20.04. The issue occurs rarely but consistently. My laptop is a Dell Inc. Latitude 7490 with Intel 8265 wireless controller. I am on Kernel 5.4.0 shipped with Ubuntu 20.04. I replaced the FW with the debug version and verified that the version changed in the dmesg output. I don't observe the "No beacon heard and the time event is over already" message anywhere, though my error seems to be related to a beacon loss as well according to the system logs (see the sample_syslog.txt) attachment. Lastly, the error occurs with 2 different APs. I have a macbook air and multiple other wifi devices hanging around that does not have such connectivity issues.

I previously attached (I couldn't encrypt the files because the keys seem outdated?):
 - a sample log from journalctl
 - as well as a trace_cmd dump during which the issue occurred multiple times. 
 - An fw_dbg_collect coredump file during which the issue was there as well

I am willing to help more to resolve this issue, @devs please do use me :) Thanks!
Comment 123 a.katzander 2020-11-16 14:43:02 UTC
I tried now with fedora (5.9.8-200.fc33.x86_64) and is the same issue.Now i "solve" this wirh TP-Link TL-WN823N v2/v3 [Realtek RTL8192EU].
Comment 124 Jack Bamford 2020-11-16 16:01:31 UTC
Hi folks,

So after a few days i installed PopOS on my 4540s and this problem doesn't happen as much but the actual connection doesn't drop when No beacon heard and the time event is over already... is under dmesg live TV stream or Radio Streams doesn't stop playing im guessing this is just cosmetic but in Ubuntu the connection actually drops and live streams drops which causes Kodi to use Timeshift which doesn't happen in PopOS its actually usable again.

Output of dmesg,

[983327.928806] iwlwifi 0000:03:00.0: No beacon heard and the time event is over already...
[983327.928866] wlo1: Connection to AP fc:ec:da: lost
[983331.227857] wlo1: authenticate with fc:ec:da:
[983331.230331] wlo1: send auth to fc:ec:da:1e:a3 (try 1/3)
[983331.235718] wlo1: authenticated
[983331.237283] wlo1: associate with fc:ec:da: (try 1/3)
[983331.241497] wlo1: RX AssocResp from fc:ec:da: (capab=0x411 status=0 aid=1)
[983331.242967] wlo1: associated
[983339.959839] wlo1: Connection to AP fc:ec:da: lost
[983343.211156] wlo1: authenticate with fc:ec:da:
[983343.214274] wlo1: send auth to fc:ec:da: (try 1/3)
[983343.216853] wlo1: authenticated
[983343.221001] wlo1: associate with fc:ec:da: (try 1/3)
[983343.225177] wlo1: RX AssocResp from fc:ec:da: (capab=0x411 status=0 aid=1)
[983343.226556] wlo1: associated

Kernel Version in PopOS,
5.8.0-7625-generic

Kernel Version in Ubuntu 18.04
5.4.0-53-generic

Update of laptop on PopOS no disconnect from WIFI happens. LiveStreams still plays when dmesg shows No beacon heard and the time event is over already...
15:51:16 up 11 days, 12:21,  1 user,  load average: 1.22, 1.10, 1.08


Hope this is a bit more information.
Comment 125 Sven Köhler 2020-11-18 00:29:06 UTC
I fitted my laptop with an AX200 to get stable Wifi. But it's getting worse recently.

I cannot connect to the Hotspot of the Android phone anymore (A520F with all updates). It just failed to authenticate over and over again, each time with the infamous "No beacon heard and the session protection is over already". The phone is right next to the laptop.

Now I also get the message when I'm connected to my Router (AVM FritzBox 7490, all updates). Connection is somewhat stable though.
Comment 126 how 2020-11-27 04:58:12 UTC
any progress? No beacon and beacon lost happens everyday. sometimes can't connect anymore because no AP detected.
Comment 127 a.katzander 2020-11-27 09:47:41 UTC
Update:with latest update in Fedora i have now a very stable Wifi connection,
no ping lag,no lost packet.(tested 2 days with no issue)
Comment 128 how 2020-11-27 10:20:13 UTC
what latest update is Fedora? Which kernel version?

I just managed to get a Mint Kernel 4.15-124 working. it is LTS to 2023. So I will use this instead, is much more stable.
Comment 129 a.katzander 2020-11-27 10:27:33 UTC
(In reply to how from comment #128)
> what latest update is Fedora? Which kernel version?
> 
> I just managed to get a Mint Kernel 4.15-124 working. it is LTS to 2023. So
> I will use this instead, is much more stable.

5.9.10-200.fc33.x86_64
Comment 130 how 2020-11-27 10:46:12 UTC
thanks, but the only version I can get with the repo is 5.8 and is reported as buggy as well. So just use 4.15 I am happy with for now.
Comment 131 Ozan Caglayan 2020-12-02 08:58:56 UTC
Just installed 5.9.10 on Ubuntu 20.04. Let's see what happens.
Comment 132 Ozan Caglayan 2020-12-02 12:55:18 UTC
The issue still persists with the mainline kernel 5.9.10 on Ubuntu 20.04.

iwlwifi 0000:01:00.0: No beacon heard and the time event is over already...
wlp1s0: Connection to AP 18:31:bf:62:09:ac lost
wlp1s0: authenticate with 18:31:bf:62:09:ac
wlp1s0: send auth to 18:31:bf:62:09:ac (try 1/3)
wlp1s0: authenticated
wlp1s0: associate with 18:31:bf:62:09:ac (try 1/3)
wlp1s0: RX AssocResp from 18:31:bf:62:09:ac (capab=0x31 status=0 aid=4)
wlp1s0: associated
wlp1s0: Connection to AP 18:31:bf:62:09:ac lost
wlp1s0: Connection to AP 00:00:00:00:00:00 lost
wlp1s0: authenticate with 18:31:bf:62:09:ac
wlp1s0: send auth to 18:31:bf:62:09:ac (try 1/3)
wlp1s0: authenticated
wlp1s0: associate with 18:31:bf:62:09:ac (try 1/3)
wlp1s0: RX AssocResp from 18:31:bf:62:09:ac (capab=0x31 status=0 aid=4)
wlp1s0: associated
iwlwifi 0000:01:00.0: No beacon heard and the time event is over already...
wlp1s0: Connection to AP 18:31:bf:62:09:ac lost
wlp1s0: authenticate with 18:31:bf:62:09:ac
wlp1s0: send auth to 18:31:bf:62:09:ac (try 1/3)
wlp1s0: authenticated
wlp1s0: associate with 18:31:bf:62:09:ac (try 1/3)
wlp1s0: RX AssocResp from 18:31:bf:62:09:ac (capab=0x31 status=0 aid=4)
wlp1s0: associated
iwlwifi 0000:01:00.0: No beacon heard and the time event is over already...
wlp1s0: Connection to AP 18:31:bf:62:09:ac lost
wlp1s0: authenticate with 18:31:bf:62:09:ac
wlp1s0: send auth to 18:31:bf:62:09:ac (try 1/3)
wlp1s0: authenticated
wlp1s0: associate with 18:31:bf:62:09:ac (try 1/3)
wlp1s0: RX AssocResp from 18:31:bf:62:09:ac (capab=0x31 status=0 aid=4)
wlp1s0: associated
Comment 133 how 2020-12-03 05:44:49 UTC
well, The only kernel that works is 4.15 or may be 5.0 as well. If you need to know how to get those kernel working I can provide the details
Comment 134 Rob 2020-12-04 08:50:12 UTC
I must say, that I don't understand why this bug have so low priority. It is known for 1,5 year and it concerns all (or almost all) Intel WIFI cards. All distros forums are full of topics regarding this issue.
Some people will not even notice 3 lost pings until their wifi will not break definitely, but the experience with online meetings (kids learning, work) in time of covid is terrible.

My laptop with arch have become unusable (I need to buy external USB wifi card without Intel chipset). All "workarounds" do not work.

Kernel: Linux arch 5.9.11-arch2-1
Network controller: Intel Corporation Centrino Wireless-N 1000

gru 04 08:28:42 arch iwd[441]: Received Deauthentication event, reason: 4, from_ap: false
gru 04 08:29:46 arch iwd[441]: Received Deauthentication event, reason: 4, from_ap: false
gru 04 08:33:16 arch iwd[441]: Received Deauthentication event, reason: 4, from_ap: false
gru 04 08:35:10 arch iwd[441]: Received Deauthentication event, reason: 4, from_ap: false
gru 04 08:36:50 arch iwd[441]: Received Deauthentication event, reason: 4, from_ap: false
gru 04 08:38:57 arch iwd[441]: Received Deauthentication event, reason: 4, from_ap: false
gru 04 08:45:59 arch iwd[441]: Received Deauthentication event, reason: 4, from_ap: false
gru 04 09:01:53 arch iwd[441]: Received Deauthentication event, reason: 4, from_ap: false
gru 04 09:04:45 arch iwd[441]: Received Deauthentication event, reason: 4, from_ap: false
gru 04 09:15:24 arch iwd[441]: Received Deauthentication event, reason: 4, from_ap: false

What other info do we need to change the status of this bug? I think that all info was provided.
Comment 135 d 2020-12-05 15:56:05 UTC
Same issue on Ubuntu 20.04 and been happening since I started using linux on a Lenovo P76.

$ lspci -kvnn | sed -n '/Network/,/^$/ p'

52:00.0 Network controller [0280]: Intel Corporation Wi-Fi 6 AX200 [8086:2723] (rev 1a)
	Subsystem: Intel Corporation Wi-Fi 6 AX200 [8086:0080]
	Flags: bus master, fast devsel, latency 0, IRQ 17
	Memory at ee300000 (64-bit, non-prefetchable) [size=16K]
	Capabilities: <access denied>
	Kernel driver in use: iwlwifi
	Kernel modules: iwlwifi
 
Multiple entries in dmesg:
[60383.709055] wlp82s0: Connection to AP 74:83:c2:19:df:9e lost
[60384.149855] wlp82s0: authentication with 74:83:c2:19:df:9e timed out
[60395.563700] wlp82s0: authenticate with 74:83:c2:19:df:9d
[60395.568179] wlp82s0: send auth to 74:83:c2:19:df:9d (try 1/3)
[60396.082119] wlp82s0: send auth to 74:83:c2:19:df:9d (try 2/3)
[60396.696463] iwlwifi 0000:52:00.0: No beacon heard and the time event is over already...
[60396.696541] wlp82s0: Connection to AP 74:83:c2:19:df:9d lost
[60397.142290] wlp82s0: send auth to 74:83:c2:19:df:9d (try 3/3)
[60397.756606] iwlwifi 0000:52:00.0: No beacon heard and the time event is over already...


Syslog details:
Dec  5 03:01:11 D wpa_supplicant[802]: wlp82s0: CTRL-EVENT-STARTED-CHANNEL-SWITCH freq=5220 ht_enabled=1 ch_offset=1 ch_width=80 MHz cf1=5210 cf2=0
Dec  5 03:01:13 D wpa_supplicant[802]: wlp82s0: CTRL-EVENT-BEACON-LOSS
Dec  5 03:01:14 D wpa_supplicant[802]: message repeated 5 times: [ wlp82s0: CTRL-EVENT-BEACON-LOSS ]
Dec  5 03:01:14 D wpa_supplicant[802]: wlp82s0: CTRL-EVENT-SIGNAL-CHANGE above=0 signal=0 noise=9999 txrate=0
Dec  5 03:01:14 D wpa_supplicant[802]: wlp82s0: CTRL-EVENT-DISCONNECTED bssid=74:83:c2:19:df:9d reason=4 locally_generated=1
Dec  5 03:01:14 D NetworkManager[772]: <warn>  [1607166074.5520] sup-iface[0x55ce06446120,wlp82s0]: connection disconnected (reason -4)
Dec  5 03:01:14 D NetworkManager[772]: <info>  [1607166074.5576] device (wlp82s0): supplicant interface state: completed -> disconnected
Dec  5 03:01:14 D NetworkManager[772]: <info>  [1607166074.5577] device (p2p-dev-wlp82s0): supplicant management interface state: completed -> disconnected
Dec  5 03:01:14 D wpa_supplicant[802]: wlp82s0: Reject scan trigger since one is already pending
Dec  5 03:01:15 D wpa_supplicant[802]: wlp82s0: SME: Trying to authenticate with 74:83:c2:19:df:9d (SSID='A&D' freq=5220 MHz)
Dec  5 03:01:15 D kernel: [44577.801756] wlp82s0: authenticate with 74:83:c2:19:df:9d
Dec  5 03:01:15 D NetworkManager[772]: <info>  [1607166075.8417] device (wlp82s0): supplicant interface state: disconnected -> authenticating
Dec  5 03:01:15 D NetworkManager[772]: <info>  [1607166075.8417] device (p2p-dev-wlp82s0): supplicant management interface state: disconnected -> authenticating
Dec  5 03:01:15 D kernel: [44577.806415] wlp82s0: send auth to 74:83:c2:19:df:9d (try 1/3)
Dec  5 03:01:16 D kernel: [44578.421045] iwlwifi 0000:52:00.0: No beacon heard and the time event is over already...
Dec  5 03:01:16 D kernel: [44578.421084] wlp82s0: Connection to AP 74:83:c2:19:df:9d lost
Dec  5 03:01:17 D kernel: [44579.161075] wlp82s0: send auth to 74:83:c2:19:df:9d (try 2/3)
Dec  5 03:01:17 D kernel: [44579.775600] iwlwifi 0000:52:00.0: No beacon heard and the time event is over already...
Dec  5 03:01:17 D kernel: [44579.775665] wlp82s0: Connection to AP 74:83:c2:19:df:9d lost
Dec  5 03:01:18 D kernel: [44580.057137] wlp82s0: send auth to 74:83:c2:19:df:9d (try 3/3)
Dec  5 03:01:18 D kernel: [44580.671557] iwlwifi 0000:52:00.0: No beacon heard and the time event is over already...
Dec  5 03:01:18 D kernel: [44580.671624] wlp82s0: Connection to AP 74:83:c2:19:df:9d lost
Dec  5 03:01:19 D kernel: [44581.080601] wlp82s0: authentication with 74:83:c2:19:df:9d timed out
Dec  5 03:01:19 D NetworkManager[772]: <info>  [1607166079.1438] device (wlp82s0): supplicant interface state: authenticating -> disconnected



My Asus laptops, using the Broadcom drivers, have no such issues and stay connected until a reboot.

Using the GUI to go to airplane mode and back to wireless mode fixes it.
I suspect rmmod(1) and insmod(1) would also fix this issue. But why does it happen for an Intel driver and not Broadcom?

It happens about 3 hours after I stop being active on the laptop but I don't have an exact time.
Comment 136 d 2020-12-05 16:07:35 UTC
I forgot to add that I'm also seeing if wifi-powersave is the issue. It was set to 3 and I'm trying 2.

See 
1. https://askubuntu.com/questions/1230140/wifi-keeps-dropping-out-ubuntu-20-04-and-broadcom-wireless-adaptor
2. https://gist.github.com/jcberthon/ea8cfe278998968ba7c5a95344bc8b55 

And I'll report back in a day or two to see if this fixes the issue.
Comment 137 Jack Bamford 2020-12-05 16:38:51 UTC
(In reply to d from comment #136)
> I forgot to add that I'm also seeing if wifi-powersave is the issue. It was
> set to 3 and I'm trying 2.
> 
> See 
> 1.
> https://askubuntu.com/questions/1230140/wifi-keeps-dropping-out-ubuntu-20-04-
> and-broadcom-wireless-adaptor
> 2. https://gist.github.com/jcberthon/ea8cfe278998968ba7c5a95344bc8b55 
> 
> And I'll report back in a day or two to see if this fixes the issue.

Disabling Power Management doesn't make much difference. I changed DTIM to 3 on my UAP which makes this problem less frequent but WIFI never disconnects, when this happens i can still ping anything.
Comment 138 d 2020-12-06 16:02:22 UTC
Jack,

You are correct, no change for me. The kernel dropped the connection despite the change, just like it has. 

It looks like the entry  https://bugzilla.kernel.org/show_bug.cgi?id=205299 is a duplicate of this. I don't have fast boot set so it's not that issue.

https://bugzilla.kernel.org/show_bug.cgi?id=210491 may be related.

I was just reading earlier entries and saw the ref to the wiki for debugging wifi. Most of the data is included but I did miss some items.

$ ethtool -i wlp82s0 | grep firmware
firmware-version: 48.4fa0041f.0

But it does seem closely related to hibernate/sleep function that I believe I have turned off but looks like it's triggering anyway. And I can't figure out where else to check.
Comment 139 Dirk Jonker 2020-12-14 10:24:11 UTC
Has anyone tried the advice from the iwlwifi module? https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi#about_platform_noise

> Disable Wi-Fi's power save (prevent the PCIe link to go to power save):
> power_scheme=1 module parameter for iwlmvm

I have added this to /etc/modprobe.d/wifi-powersave.conf

    options iwlmvm power_scheme=1

And reloaded the wifi kernel modules (I'm on Debian testing with kernel 5.9). I did not run this for long enough to check whether it permanently fixes the issue though, but given that I cannot reproduce it, it might take some time.
Comment 140 Ozan Caglayan 2020-12-14 10:26:42 UTC
I'm already on 5GHz so I think this is unlikely to be relevant..
Comment 141 how 2020-12-14 10:30:10 UTC
the power option does not work for me. It also reported not to work by some users. You shouldn't need to use the power option. It is just a really bad bug. 
Just cannot use the internet with this bug.
Comment 142 Ozan Caglayan 2020-12-14 10:35:12 UTC
If only we had a way to trigger it, we could do git bisect and find the relevant commit. But it happens very sporadically.
Comment 143 Sergey Slizovskiy 2020-12-14 11:06:32 UTC
As I commented some time ago, disabling powersave  has worked for me.  Still, it's a shameful bug to be fixed.
Comment 144 dagthree7 2020-12-14 11:09:34 UTC
Created attachment 294123 [details]
attachment-28808-0.html

Any updates on this

On Mon, Dec 14, 2020, 2:06 AM <bugzilla-daemon@bugzilla.kernel.org> wrote:

> https://bugzilla.kernel.org/show_bug.cgi?id=203709
>
> --- Comment #143 from Sergey Slizovskiy (sereza@gmail.com) ---
> As I commented some time ago, disabling powersave  has worked for me.
> Still,
> it's a shameful bug to be fixed.
>
> --
> You are receiving this mail because:
> You are on the CC list for the bug.
Comment 145 Dirk Jonker 2020-12-14 13:12:30 UTC
(In reply to how from comment #141)
> the power option does not work for me. It also reported not to work by some
> users. You shouldn't need to use the power option. It is just a really bad
> bug. 
> Just cannot use the internet with this bug.

I agree that the power option should not be needed. But if it "fixes" the problem, it may give us a better idea of what might be causing the bug.
Comment 146 Dirk Jonker 2020-12-15 08:29:35 UTC
The iwlmvm power_scheme=1 option also does not seems to help the issue for my intel 8265.
Comment 147 dagthree7 2020-12-15 10:44:06 UTC
Created attachment 294137 [details]
attachment-15444-0.html

Any updates on this

On Mon, Dec 14, 2020, 11:29 PM <bugzilla-daemon@bugzilla.kernel.org> wrote:

> https://bugzilla.kernel.org/show_bug.cgi?id=203709
>
> --- Comment #146 from Dirk Jonker (dirkjonker@gmail.com) ---
> The iwlmvm power_scheme=1 option also does not seems to help the issue for
> my
> intel 8265.
>
> --
> You are receiving this mail because:
> You are on the CC list for the bug.
Comment 148 Ozan Caglayan 2020-12-15 14:30:26 UTC
I just compiled and started to use the 4.19.163 stable kernel for Ubuntu 20.04. I assumed from the discussions that this version did not have the issue. Let's see what happens.
Comment 149 how 2020-12-15 22:54:17 UTC
4.15 definitely works. I think up to kernel 5.0 will work. Problem starts with 5.1 as reported.

So 4.19 will work.
Comment 150 Jack Bamford 2020-12-16 05:26:09 UTC
(In reply to how from comment #149)
> 4.15 definitely works. I think up to kernel 5.0 will work. Problem starts
> with 5.1 as reported.
> 
> So 4.19 will work.

Kernel version 4.15+ will work with no problems. The problem is with kernel version 5+ since its been patched for new vulnerabilities for WIFI this is where the problem starts. What ever the security patch has done this is where the issue relies.
Comment 151 how 2020-12-16 09:44:47 UTC
4.15 is lts for several years more. So can't be less secure than 5+
Comment 152 Ozan Caglayan 2020-12-16 10:42:18 UTC
OK, between 5.0 and 5.1, I see a commit that may be related as people previously also mentioned that this may be due to buggy APs

I'm currently running v5.0-rc1 to see if it occurred between 4.9 and 5.


commit babea2d4fe76c6515da6231e8d940044bad686b1
Author: Andrei Otcheretianski <andrei.otcheretianski@intel.com>
Date:   Sun Nov 4 21:56:31 2018 +0200

    iwlwifi: mvm: Disconnect on large beacon loss
    
    Some buggy APs stop sending beacons, but continue to ack our null data
    packets or even run some traffic. It's better not to stick connected to
    such an AP forever, so disconnect after some larger beacon loss
    threshold is crossed.
    
    Signed-off-by: Andrei Otcheretianski <andrei.otcheretianski@intel.com>
    Signed-off-by: Luca Coelho <luciano.coelho@intel.com>
Comment 153 p.g.richardson 2020-12-16 11:02:17 UTC
Running Fedora and recently upgraded to Fedora 32 (kernel 5.9.13-100-fc32) so have been experiencing this problem. Have an old Fedora 30 kernel 5.0.9-301-fc30 still installed so rebooted to using that instead and this still suffers BEACON-LOSS events.

> 04:00.0 Network controller [0280]: Intel Corporation Wireless 8265 / 8275
> [8086:24fd] (rev 78)

log of NetworkManger:
> Dec 16 10:31:41 pj NetworkManager[943]: <info>  [1608114701.0423] device
> (wlp4s0): supplicant interface state: 4-way handshake -> completed
> Dec 16 10:31:41 pj NetworkManager[943]: <info>  [1608114701.0437] device
> (p2p-dev-wlp4s0): supplicant management interface state: 4-way handshake ->
> completed
> Dec 16 10:41:40 pj NetworkManager[943]: <warn>  [1608115300.5121]
> sup-iface[0x5570298a2920,wlp4s0]: connection disconnected (reason -4)
> Dec 16 10:41:40 pj NetworkManager[943]: <info>  [1608115300.5260] device
> (wlp4s0): supplicant interface state: completed -> disconnected
> Dec 16 10:41:40 pj NetworkManager[943]: <info>  [1608115300.5261] device
> (p2p-dev-wlp4s0): supplicant management interface state: completed ->
> disconnected
> Dec 16 10:41:40 pj NetworkManager[943]: <info>  [1608115300.6175] device
> (wlp4s0): supplicant interface state: disconnected -> scanning


pinging my router:
> 64 bytes from os (<ip>): icmp_seq=584 ttl=64 time=0.772 ms
> 64 bytes from os (<ip>): icmp_seq=585 ttl=64 time=0.813 ms
> 64 bytes from os (<ip>): icmp_seq=586 ttl=64 time=0.934 ms
> From pj (<ip>) icmp_seq=588 Destination unreachable: Address unreachable
> From pj (<ip>) icmp_seq=589 Destination unreachable: Address unreachable
> From pj (<ip>) icmp_seq=590 Destination unreachable: Address unreachable
> From pj (<ip>) icmp_seq=591 Destination unreachable: Address unreachable
> From pj (<ip>) icmp_seq=592 Destination unreachable: Address unreachable
> From pj (<ip>) icmp_seq=593 Destination unreachable: Address unreachable
> From pj (<ip>) icmp_seq=594 Destination unreachable: Address unreachable
> From pj (<ip>) icmp_seq=595 Destination unreachable: Address unreachable
> From pj (<ip>) icmp_seq=596 Destination unreachable: Address unreachable
> 64 bytes from os (<ip>): icmp_seq=598 ttl=64 time=2.51 ms
> 64 bytes from os (<ip>): icmp_seq=599 ttl=64 time=0.612 ms

log of wpa_supplicant:
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: Event message available
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: Ignored event 64
> (NL80211_CMD_NOTIFY_CQM) for foreign interface (ifindex 3 wdev 0x0)
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: Drv Event 64
> (NL80211_CMD_NOTIFY_CQM) received for wlp4s0
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: Beacon loss event
> Dec 16 10:31:32 pj wpa_supplicant[1274]: wlp4s0: Event BEACON_LOSS (53)
> received
> Dec 16 10:31:32 pj wpa_supplicant[1274]: wlp4s0: CTRL-EVENT-BEACON-LOSS
> Dec 16 10:31:32 pj wpa_supplicant[1274]: RTM_NEWLINK: ifi_index=3
> ifname=wlp4s0 operstate=2 linkmode=1 ifi_family=0 ifi_flags=0x1003 ([UP])
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: Event message available
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: Ignored event 20
> (NL80211_CMD_DEL_STATION) for foreign interface (ifindex 3 wdev 0x0)
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: Drv Event 20
> (NL80211_CMD_DEL_STATION) received for wlp4s0
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: Delete station
> 02:50:7f:c1:b8:16
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: Event message available
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: Ignored event 39
> (NL80211_CMD_DEAUTHENTICATE) for foreign interface (ifindex 3 wdev 0x0)
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: Drv Event 39
> (NL80211_CMD_DEAUTHENTICATE) received for wlp4s0
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: MLME event 39
> (NL80211_CMD_DEAUTHENTICATE) on wlp4s0(ac:ed:5c:73:a5:28)
> A1=02:50:7f:c1:b8:16 A2=ac:ed:5c:73:a5:28
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: MLME event frame -
> hexdump(len=26): c0 00 00 00 02 50 7f c1 b8 16 ac ed 5c 73 a5 28 02 50 7f c1
> b8 16 00 00 04 00
> Dec 16 10:31:32 pj wpa_supplicant[1274]: nl80211: Deauthenticate event
> Dec 16 10:31:32 pj wpa_supplicant[1274]: wlp4s0: Event DEAUTH (11) received
> Dec 16 10:31:32 pj wpa_supplicant[1274]: wlp4s0: Deauthentication
> notification
> Dec 16 10:31:32 pj wpa_supplicant[1274]: wlp4s0:  * reason 4
> (DISASSOC_DUE_TO_INACTIVITY) locally_generated=1
> Dec 16 10:31:32 pj wpa_supplicant[1274]: wlp4s0:  * address 02:50:7f:c1:b8:16
> Dec 16 10:31:32 pj wpa_supplicant[1274]: Deauthentication frame IE(s) -
> hexdump(len=0): [NULL]
> Dec 16 10:31:32 pj wpa_supplicant[1274]: wlp4s0: CTRL-EVENT-DISCONNECTED
> bssid=02:50:7f:c1:b8:16 reason=4 locally_generated=1
> Dec 16 10:31:32 pj wpa_supplicant[1274]: wlp4s0: Auto connect enabled: try to
> reconnect (wps=0/0 wpa_state=9)
Comment 154 p.g.richardson 2020-12-16 15:10:56 UTC
Installed & booted the following kernels and still saw the same log entries:
  * 4.20.16-200.fc29
  * 4.15.9-300.fc27

I have updated the firmware of my access point and router so will see if that makes a difference.
Comment 155 p.g.richardson 2020-12-16 19:30:45 UTC
(In reply to p.g.richardson from comment #154)
> Installed & booted the following kernels and still saw the same log entries:
>   * 4.20.16-200.fc29
>   * 4.15.9-300.fc27
> 
> I have updated the firmware of my access point and router so will see if
> that makes a difference.

After upgrade of the firmware of both my router (draytek vigor 2862ac) and access-point (draytek vigor AP903) to the latest recommended [1][2], I've experienced none of the symptoms while running either kernel 4.20-16 or 5.9.13-100.

So, in my case, a bug in the router firmware has been the culprit. 

[1] https://www.draytek.co.uk/support/downloads/vigor-2862
[2] https://www.draytek.co.uk/support/downloads/vigorap-903
Comment 156 lubomir.sery 2020-12-16 20:31:44 UTC
Hi,
I tested sever kernels and can say (very likely), this bug is not in 5.0x kernel (never experienced this issue), but in kernel 5.1 always got this bug.
Lubo
Comment 157 Jack Bamford 2020-12-27 02:02:42 UTC
Hi,

Just an update, I have fixed the issue with No Beacon, it seems to be related to Ubiqutis Firmware i had a issue with my Access Point which i had to fix after a firmware update. Spent 16 hours straight trying to solve the issue i had with the AP. UniFi firmware 4.3.21 and UniFi firmware 4.3.24 both has issues i downgraded to 4.3.20 and i havent had this No Beacon heard for the past 3 days now.

If anyone has a Ubiquti Access Point i would recommend downgrade to 4.3.20 version, the past two firmware updates are known to have problems.

Hope this helps.

Cheers.

Jack.
Comment 158 d 2020-12-27 03:03:05 UTC
Created attachment 294355 [details]
attachment-10169-0.html

How would I tell what my access point is running? 

And why this isn't an issue on the two machines with NON-Intel chips. The Broadcom chipset doesn't have this issue.



So I'm not convinced, at least not yet.



David





---- On Sun, 27 Dec 2020 02:02:42 +0000  <bugzilla-daemon@bugzilla.kernel.org> wrote ----


https://bugzilla.kernel.org/show_bug.cgi?id=203709 
 
--- Comment #157 from Jack Bamford (mailto:jack@violetdragonsnetwork.co.uk) --- 
Hi, 
 
Just an update, I have fixed the issue with No Beacon, it seems to be related 
to Ubiqutis Firmware i had a issue with my Access Point which i had to fix 
after a firmware update. Spent 16 hours straight trying to solve the issue i 
had with the AP. UniFi firmware 4.3.21 and UniFi firmware 4.3.24 both has 
issues i downgraded to 4.3.20 and i havent had this No Beacon heard for the 
past 3 days now. 
 
If anyone has a Ubiquti Access Point i would recommend downgrade to 4.3.20 
version, the past two firmware updates are known to have problems. 
 
Hope this helps. 
 
Cheers. 
 
Jack.
Comment 159 Jack Bamford 2020-12-27 03:13:54 UTC
(In reply to d from comment #158)
> Created attachment 294355 [details]
> attachment-10169-0.html
> 
> How would I tell what my access point is running? 
> 
> And why this isn't an issue on the two machines with NON-Intel chips. The
> Broadcom chipset doesn't have this issue.
> 
> 
> 
> So I'm not convinced, at least not yet.
> 
> 
> 
> David
> 
> 
> 
> 
> 
> ---- On Sun, 27 Dec 2020 02:02:42 +0000 
> <bugzilla-daemon@bugzilla.kernel.org> wrote ----
> 
> 
> https://bugzilla.kernel.org/show_bug.cgi?id=203709 
>  
> --- Comment #157 from Jack Bamford (mailto:jack@violetdragonsnetwork.co.uk)
> --- 
> Hi, 
>  
> Just an update, I have fixed the issue with No Beacon, it seems to be
> related 
> to Ubiqutis Firmware i had a issue with my Access Point which i had to fix 
> after a firmware update. Spent 16 hours straight trying to solve the issue i 
> had with the AP. UniFi firmware 4.3.21 and UniFi firmware 4.3.24 both has 
> issues i downgraded to 4.3.20 and i havent had this No Beacon heard for the 
> past 3 days now. 
>  
> If anyone has a Ubiquti Access Point i would recommend downgrade to 4.3.20 
> version, the past two firmware updates are known to have problems. 
>  
> Hope this helps. 
>  
> Cheers. 
>  
> Jack.


No idea but anything is possible. What Access Point do you have? if its Ubiquti you can retrieve it by clicking on your AP and looking down Overview in the controller.
Comment 160 NHonda 2020-12-27 13:50:10 UTC
(In reply to Ozan Caglayan from comment #152)
> OK, between 5.0 and 5.1, I see a commit that may be related as people
> previously also mentioned that this may be due to buggy APs
> 
> I'm currently running v5.0-rc1 to see if it occurred between 4.9 and 5.
> 
> 
> commit babea2d4fe76c6515da6231e8d940044bad686b1
> Author: Andrei Otcheretianski <andrei.otcheretianski@intel.com>
> Date:   Sun Nov 4 21:56:31 2018 +0200
> 

According to the above commit, 
I changed the timeout value
```
#define IWL_MVM_MISSED_BEACONS_THRESHOLD_LONG 16
```
in drivers/net/wireless/intel/iwlwifi/mvm/mvm.h to
```
#define IWL_MVM_MISSED_BEACONS_THRESHOLD_LONG 128
```

Now everything goes well without any change of module parameters.
It seems the timeout value is too small in my case.
Comment 161 Brian Litzinger 2020-12-30 06:52:03 UTC
I have run into this bug for a few months, but only now started looking for a solution.

I am running Debian 10 on a Lenovo Thinkpad E570 with an AX200 Wifi card talking to a ASUS AX6100, though this same problem manifests itself with other access points.  

My kernel is currently Linux x501b 5.9.0-0.bpo.2-amd64 #1 SMP Debian 5.9.6-1~bpo10+1 (2020-11-19) x86_64 GNU/Linux

Changing the beacon interval to 50ms has during the last 12 hours eliminated the problem.  I generally experienced the issue 3 to 4 times per day, so I am keeping my fingers crossed.

By the way, I can trigger BEACON LOSS on demand by connecting via 2.4GHz and running my microwave oven.
Comment 162 Eaglet 2021-01-03 11:24:16 UTC
Salute, comrades! 
In order to avoid the BEACON LOSS problem, you need to set BEACON intervals in your access points: for 5.8 GHz 100, for 2.4 GHz 200. After setting these values, the problem will disappear !!!
Comment 163 Sergey Slizovskiy 2021-01-03 12:41:03 UTC
Sounds reasonable, but not a proper solution for the laptop to work with any access point.  Can the expected BEACON intervals be raised in the WLAN driver?

(In reply to Eaglet from comment #162)
> Salute, comrades! 
> In order to avoid the BEACON LOSS problem, you need to set BEACON intervals
> in your access points: for 5.8 GHz 100, for 2.4 GHz 200. After setting these
> values, the problem will disappear !!!
Comment 164 d 2021-01-03 16:22:40 UTC
And not everyone has access to the change the beacon settings in the access point. The real fix is the change the NHonda committed (3 posts away, on 27 Dec).

And not everyone has the knowledge to build their own kernels.
Comment 165 Sven Köhler 2021-01-03 19:23:46 UTC
Which Wifi-driver would say "I'm receiving too many beacons, let's disconnect"?
100 is pretty much the standard setting for all DD-WRT and Tomato routers. Also ASUS's router firmware uses it as a default. This doesn't sound reasonable at all! (a workaround is always though)
Comment 166 Sergey Slizovskiy 2021-01-03 19:41:26 UTC
I am not an expert, but the name of the variable: "IWL_MVM_MISSED_BEACONS_THRESHOLD_LONG" sounds like these are the MISSED beacons. So I can interpret it that it needs to seem to miss 128 beacons before disconnecting (instead of a smaller amount).

(In reply to Sven Köhler from comment #165)
> Which Wifi-driver would say "I'm receiving too many beacons, let's
> disconnect"?
> 100 is pretty much the standard setting for all DD-WRT and Tomato routers.
> Also ASUS's router firmware uses it as a default. This doesn't sound
> reasonable at all! (a workaround is always though)
Comment 167 Sven Köhler 2021-01-03 20:02:50 UTC
Sorry, I was responding to comment #162 suggesting to _increase_ the time between beacons sent by the access point.

(In reply to Sergey Slizovskiy from comment #166)
> I am not an expert, but the name of the variable:
> "IWL_MVM_MISSED_BEACONS_THRESHOLD_LONG" sounds like these are the MISSED
> beacons. So I can interpret it that it needs to seem to miss 128 beacons
> before disconnecting (instead of a smaller amount).
Comment 168 kernelspace 2021-01-06 03:57:05 UTC
This bug is not solely an Intel issue, it’s the very same on a Qualcomm Atheros AR93xx Wireless Network Adapter. The latest kernel made the network very unstable, ping got worse and the network would deauthenticate a lot.

(Reason: 9=STA_REQ_ASSOC_WITHOUT_AUTH)

Please fix this asap!
Comment 169 Eaglet 2021-01-06 07:12:51 UTC
(In reply to kernelspace from comment #168)
> This bug is not solely an Intel issue, it’s the very same on a Qualcomm
> Atheros AR93xx Wireless Network Adapter. The latest kernel made the network
> very unstable, ping got worse and the network would deauthenticate a lot.
> 
> (Reason: 9=STA_REQ_ASSOC_WITHOUT_AUTH)
> 
> Please fix this asap!

I agree with you that this is not an Intel driver problem. This is a problem with the AP drivers and the NDIS EEPROM of the APs. Also, this problem can be caused by sysctl settings in access points - this is a consequence of incorrectly written NDIS EEPROM for the WiFi module and its access point drivers for the Linux kernel.
I had a similar issue with hotspot on OpenWRT. The problem was almost completely solved by more liberal sysctl settings in the OpenWRT access point.
Comment 170 Jack Bamford 2021-01-06 07:19:40 UTC
(In reply to Eaglet from comment #169)
> (In reply to kernelspace from comment #168)
> > This bug is not solely an Intel issue, it’s the very same on a Qualcomm
> > Atheros AR93xx Wireless Network Adapter. The latest kernel made the network
> > very unstable, ping got worse and the network would deauthenticate a lot.
> > 
> > (Reason: 9=STA_REQ_ASSOC_WITHOUT_AUTH)
> > 
> > Please fix this asap!
> 
> I agree with you that this is not an Intel driver problem. This is a problem
> with the AP drivers and the NDIS EEPROM of the APs. Also, this problem can
> be caused by sysctl settings in access points - this is a consequence of
> incorrectly written NDIS EEPROM for the WiFi module and its access point
> drivers for the Linux kernel.
> I had a similar issue with hotspot on OpenWRT. The problem was almost
> completely solved by more liberal sysctl settings in the OpenWRT access
> point.

Downgrading the Firmware on my UAP Lite solved the problem for me ive tried known Distros that was causing me issues, Ubuntu 18.04, 20.04, PopOS after downgrading to 4.3.20.11298 problem is solved. Ive tested with both 4.3.21, 4.3.24 are problematic i was also suffering with drops out on my Phone so not sure if it was related to the issue i was seeing on Ubuntu but 4.3.20.11298 solved the issue.
Comment 171 Jack Bamford 2021-01-06 07:49:27 UTC
In my Testing to trigger it, refreshing or reloading the DHCP Server on pfSense. Problem started back in November 1st last year.

Hope this gives more of a idea to what is happening.
Comment 172 mikezackles 2021-01-09 16:24:03 UTC
Created attachment 294585 [details]
Module parameter for beacon timeout

Building on Ozan and Nhonda's observations, I made this little patch that makes the beacon timeout a runtime-configurable kernel parameter, keeping the default of 16 missed beacons. It seems to resolve the issue for me (with a timeout of 256 missed beacons).

Even if this is an issue entirely external to iwlwifi (buggy AP etc.), it seems that other drivers both for Linux and other operating systems tolerate these conditions, so there should still be some sort of fix in iwlwifi.

Without this fix, my wifi becomes completely unusable for long stretches due to repeated disconnects every few days (likely correlated with network conditions).
Comment 173 Eaglet 2021-01-09 16:43:49 UTC
(In reply to mikezackles from comment #172)
> Created attachment 294585 [details]
> Module parameter for beacon timeout
> 
> Building on Ozan and Nhonda's observations, I made this little patch that
> makes the beacon timeout a runtime-configurable kernel parameter, keeping
> the default of 16 missed beacons. It seems to resolve the issue for me (with
> a timeout of 256 missed beacons).
> 
> Even if this is an issue entirely external to iwlwifi (buggy AP etc.), it
> seems that other drivers both for Linux and other operating systems tolerate
> these conditions, so there should still be some sort of fix in iwlwifi.
> 
> Without this fix, my wifi becomes completely unusable for long stretches due
> to repeated disconnects every few days (likely correlated with network
> conditions).

This is an access point problem only, not the iwlwifi driver!!! The source of the access point problem:
1) problems in the WiFi hotspot driver;
2) the problem is in the EEPROM of the WiFi chip;
3) the problem is in the settings of the access point software (wpasupplicant, hostapd & etc ...) incompatible with the WiFi driver of the access point.

From my experience using OpenWRT, I can definitely say that such a problem is quickly fixed by the correct software parameters (wpasupplicant, hostapd & etc ...) for the WiFI driver used by the access point.

If you patch the iwlwifi driver as you want (reduce BEACON to a critically possible level), then you will encounter degradation in the performance of the WiFi system on your laptop / computer, i.e., you will actually decrease the speed parameters of your Linux system when working with network.
I repeat for those who do not check this problem and do not find out its causes in practice: THIS IS NOT A PROBLEM of the iwlwifi driver, IT IS EXCLUSIVELY AN ACCESS POINT PROBLEM !!!
Comment 174 mikezackles 2021-01-09 17:13:56 UTC
> From my experience using OpenWRT, I can definitely say that such a problem
> is quickly fixed by the correct software parameters (wpasupplicant, hostapd
> & etc ...) for the WiFI driver used by the access point.

I'm glad you found a fix that works for you, but not everyone runs OpenWRT or is in a position to make many customizations at the access point. If other drivers are tolerating this just fine, there isn't much incentive for commercial vendors to worry about it.
 
> If you patch the iwlwifi driver as you want (reduce BEACON to a critically
> possible level), then you will encounter degradation in the performance of
> the WiFi system on your laptop / computer, i.e., you will actually decrease
> the speed parameters of your Linux system when working with network.

If you look back at my patch and what I wrote, it defaults to the existing behavior (16 missed timeouts), but it allows people who are out of options to customize the timeout so that they can actually use the network. I'm not sure why that upsets you.

Even if this is an access point bug, it seems to be a widely deployed one, making it something that (imo) iwlwifi should worry about. Something along the lines of my patch acknowledges that the existing behavior is the desired one but provides a workaround when that's not enough.
Comment 175 kotrfa 2021-01-09 17:21:11 UTC
Eh, I don't really follow the discussion anymore, but consider that:

1. I can reproduce the problem with my intel wifi (Intel 8260) on many different access points where none of other ~15+ devices has any issues at all
2. The problem does not show up on any access points if I use the exact same hardware _but_ the wifi adapter (TP-Link AC600 wireless Realtek RTL8811AU) without any obvious drawbacks.

It's really hard to see how the above could be the case if it was "EXCLUSIVELY AN ACCESS POINT PROBLEM"
Comment 176 Eaglet 2021-01-09 18:13:15 UTC
(In reply to kotrfa from comment #175)
> Eh, I don't really follow the discussion anymore, but consider that:
> 
> 1. I can reproduce the problem with my intel wifi (Intel 8260) on many
> different access points where none of other ~15+ devices has any issues at
> all
> 2. The problem does not show up on any access points if I use the exact same
> hardware _but_ the wifi adapter (TP-Link AC600 wireless Realtek RTL8811AU)
> without any obvious drawbacks.
> 
> It's really hard to see how the above could be the case if it was
> "EXCLUSIVELY AN ACCESS POINT PROBLEM"

You were told what the problem is: it's in the access points. Variants of the problem in points of access by me are given in the previous message. If something is difficult for you to understand, then you have not tested what I tested (I described the causes of the problem in access points in three points). Understanding only happens if you have relevant experience, which you do not yet have, but I do. And if so, then not a troll!
I have tested all possible causes (indicated in three points). And you could not test it and have the audacity to point out!
Comment 177 d 2021-01-09 18:31:12 UTC
Eaglet:

I'm sorry, I don't see kotrfa@ as a troll. I have the same issue. 

How long were you running your tests for? We're talking about days and if your tests only ran for a few hours you may not have triggered the issue.

When I'm running my ThinkPad P73 under Windows (god save my soul) there is no issue, for days. Under Ubuntu 20.04, I get a disconnect between 1am and 4am 90% of the time. 

So I don't believe it's a firmware or AP issue, at least not at this point.

Also, how would the average user, who can't compile a kernel access the parameter you've made available? I don't believe there is a CLI or configuration file that would allow this to be set at boot time.

I can add that my two Asus laptops and Asus PC motherboard, all running Ubuntu 20.04 but using the Broadcom chips do not exhibit this failure.

This is all with an Ubiquiti AP that I don't have access to change settings on (even if I knew the Unifi interface).

So if you can explain why, when Windows works on this laptop, but linux fails I'd be interested. If you can tell me what I should ask the network people to look for in their logs or settings it may help get to the bottom of this.
Comment 178 Eaglet 2021-01-09 18:54:52 UTC
(In reply to d from comment #177)
> Eaglet:
> 
> I'm sorry, I don't see kotrfa@ as a troll. I have the same issue. 
> 
> How long were you running your tests for? We're talking about days and if
> your tests only ran for a few hours you may not have triggered the issue.
> 
> When I'm running my ThinkPad P73 under Windows (god save my soul) there is
> no issue, for days. Under Ubuntu 20.04, I get a disconnect between 1am and
> 4am 90% of the time. 
> 
> So I don't believe it's a firmware or AP issue, at least not at this point.
> 
> Also, how would the average user, who can't compile a kernel access the
> parameter you've made available? I don't believe there is a CLI or
> configuration file that would allow this to be set at boot time.
> 
> I can add that my two Asus laptops and Asus PC motherboard, all running
> Ubuntu 20.04 but using the Broadcom chips do not exhibit this failure.
> 
> This is all with an Ubiquiti AP that I don't have access to change settings
> on (even if I knew the Unifi interface).
> 
> So if you can explain why, when Windows works on this laptop, but linux
> fails I'd be interested. If you can tell me what I should ask the network
> people to look for in their logs or settings it may help get to the bottom
> of this.

The problem is actually much more interesting. I was able to reproduce this issue when my laptop with Intel Link 5100 chip was connected to an OpenWRT router based on MTK7620A chip. With the standard WiFi chip EEPROM, my laptop had no connection problems even when trying to use 80211n mode, which is not recommended by the OpenWRT developers. After downgrade the current version of the WiFi EEPROM chip on my router to the EEPROM version from 2016 and using the 80211n mode, problems began with BEACON-LOST: a lot of this appeared in the logs of my laptop. After disabling 80211n support and enabling only 80211g mode in OpenWRT (version 19.07.5 /latest/) with 2016 EEPROM firmware, BEACON-LOST messages no longer appeared. Then I updated the EEPROM of my WiFi chip (MTK7620A) on the router again to the 2019 version (without changing the installed version and OpenWRT settings). BEACON-LOST messages also stopped appearing. From what I concluded that the problem is/or/and: 1) EEPROM; 2) WiFi router driver; 3) settings of the 80211 operating mode in software AP. Or in a combination of these reasons.
As you can see, the problem is not in the Intel driver for Linux, but in the access point.
I hope I have satisfied your interest on this issue.
Comment 179 mikezackles 2021-01-09 19:11:30 UTC
> Also, how would the average user, who can't compile a kernel access the
> parameter you've made available? I don't believe there is a CLI or
> configuration file that would allow this to be set at boot time.

There is no way for a user to change this timeout without recompiling the kernel. My patch makes it a module parameter configurable at boot in the standard ways (https://wiki.archlinux.org/index.php/Kernel_module#Setting_module_options) or by writing to the file /sys/module/iwlwifi/parameters/beacon_timeout as root. Absent a more complete fix, it would be nice to upstream something along the lines of my patch to the mainline kernel seeing as how this has been an issue for so long, and many users don't have a workaround. That way users could configure the timeout without needing to compile a kernel. But I'm just a frustrated user, not (at all) an expert in this area, and I don't have a ton of time to go digging.
Comment 180 Sergey Slizovskiy 2021-01-09 19:14:24 UTC
Let us stop blaming WiFi router. When it works for other Operating Systems, you can imagine a situation that you will not be allowed to change the driver of the router.  
What makes me feel the problem is not yet  fully understood is that there are at least two workarounds: one with BEACON wait interval,  another with "add to /etc/modprobe.d/iwlwifi.conf the following:
> options iwlmvm power_scheme=1
> options iwlwifi power_save=0  "

Are these two "solutions" related?  Are the beacons missed due to adaptor going to sleep? Or, is it vice versa: WiFi Adaptor goes to sleep due to missed beacons?
Comment 181 mikezackles 2021-01-09 19:31:43 UTC
> > options iwlmvm power_scheme=1
> > options iwlwifi power_save=0  "

For what it's worth, these did not resolve the issue for me.

> Are these two "solutions" related?  Are the beacons missed due to adaptor
> going to sleep? Or, is it vice versa: WiFi Adaptor goes to sleep due to
> missed beacons?

This might be a clue: https://github.com/torvalds/linux/blob/d635a69dd4981cc51f90293f5f64268620ed1565/include/net/mac80211.h#L2869-L2928

I am in agreement that the problem isn't fully understood.
Comment 182 kotrfa 2021-01-09 19:44:10 UTC
(In reply to Eaglet from comment #176)

> You were told what the problem is: it's in the access points. Variants of
> the problem in points of access by me are given in the previous message. If
> something is difficult for you to understand, then you have not tested what
> I tested (I described the causes of the problem in access points in three
> points). Understanding only happens if you have relevant experience, which
> you do not yet have, but I do. And if so, then not a troll!
> I have tested all possible causes (indicated in three points). And you could
> not test it and have the audacity to point out!

I am not an expert, I am a regular user who barely understands what's going on here. I have been trying to provide relevant information and support as you can see in the earlier comments. In my message you reacted to, I only asked considering (I used exactly this word) the observations and that the conclusion seems unlikely to me. 

Your answer could be simply "not relevant" and I would take it, but instead I am asked to stop trolling and that I am rude? I am grateful for your contribution, but your answer is fairly offensive and simply makes me less likely to contribute the next time. 


> options iwlmvm power_scheme=1
> options iwlwifi power_save=0

did not solve it for me either
Comment 183 Eaglet 2021-01-09 20:13:06 UTC
(In reply to kotrfa from comment #182)
> (In reply to Eaglet from comment #176)
> 
> > You were told what the problem is: it's in the access points. Variants of
> > the problem in points of access by me are given in the previous message. If
> > something is difficult for you to understand, then you have not tested what
> > I tested (I described the causes of the problem in access points in three
> > points). Understanding only happens if you have relevant experience, which
> > you do not yet have, but I do. And if so, then not a troll!
> > I have tested all possible causes (indicated in three points). And you
> could
> > not test it and have the audacity to point out!
> 
> I am not an expert, I am a regular user who barely understands what's going
> on here. I have been trying to provide relevant information and support as
> you can see in the earlier comments. In my message you reacted to, I only
> asked considering (I used exactly this word) the observations and that the
> conclusion seems unlikely to me. 
> 
> Your answer could be simply "not relevant" and I would take it, but instead
> I am asked to stop trolling and that I am rude? I am grateful for your
> contribution, but your answer is fairly offensive and simply makes me less
> likely to contribute the next time. 
> 
> 
> > options iwlmvm power_scheme=1
> > options iwlwifi power_save=0
> 
> did not solve it for me either

You have inflicted no less insult upon me and received what you deserve. From now on, learn to be courteous and this will not happen to you again.
Comment 184 d 2021-01-09 20:33:51 UTC
Eaglet,

Thanks for the URL. I'm having issues getting it the parameters to show-up under /sys/module/<anything> but that is my issue to learn.

When I have it working I'll post what I've set and how the results worked. This may work around the several possible causes of different interactions of firmware, drivers and APs.

FYI for people coming in late I did try using /etc/NetworkManager/conf.d/default-wifi-powersave-on.conf with:
wifi.powersave = 2 

for several weeks but that didn't change anything.
Comment 185 Sergey Slizovskiy 2021-01-09 21:05:21 UTC
Disabling power saving has worked for me, as I reported above. Following the clue of mikezackles, it now looks reasonable that either we increase the beacon wait time,  or, we disable the beacon filtering support (which will increase the beacon frequency reported by hardware).  It is also logically possible that a beacon filtering support is not working as expected in the driver. 

(In reply to mikezackles from comment #181)

> 
> This might be a clue:
> https://github.com/torvalds/linux/blob/
> d635a69dd4981cc51f90293f5f64268620ed1565/include/net/mac80211.h#L2869-L2928
> 
> I am in agreement that the problem isn't fully understood.
Comment 186 how 2021-01-10 01:20:08 UTC
It is not an Access Point bug.  Why all other devices and Windows work as expected except the one with iwlwifi on Linux Kernel 5.1 and above? You revert the Kernel to 4.15 and works as normal as other devices.
Comment 187 d 2021-01-10 01:53:39 UTC
While I'm not on the dev team the revision to a 4.15 kernel is interesting.

Have you tried the 4.15 kernel with the latest driver? 

I don't think this is a trival problem for the people working on this.

Unfortunately I'm not in position to revert to earlier kernels but it may be an issue for some specific Intel chip sets.
Comment 188 how 2021-01-10 02:06:54 UTC
I am using the Kernel 4.15.0-124 LTS. I managed to revert to 4.15 by booting to USB Mint 19.3 install the kernel,then copy the files to Mint 20. So the drivers are from 19.3. 

I cannot try the new drivers since the kernel 4.15 is compiled with gcc7. Mint 20 uses gcc9.
Comment 189 mikezackles 2021-01-10 05:41:48 UTC
The commit that Ozan pointed out (between 5.0 and 5.1) is where the disconnect/timeout is introduced, so it would make sense that the problem doesn't happen before that point: https://github.com/torvalds/linux/commit/babea2d4fe76c6515da6231e8d940044bad686b1#diff-87e4eb6cde9b874e4303cc17a793ac5e5fc194f6b22d9d3ae3f74d2fffc71463

> Why all other devices and Windows work as expected except the one with
> iwlwifi on Linux Kernel 5.1 and above?

One possibility is that other drivers don't intentionally disconnect after too many missed beacons. Note that the commit above mentions buggy APs specifically. It seems like the author was hoping that reconnecting would get the AP to start sending beacons again, but this seems like it may not be the case. This would be in line with what Emmanuel said:

> We disconnected because we couldn't hear beacons from almost a second and
> then we couldn't re-connect because we couldn't hear beacons during the
> connection phase.

If this is true and there is nothing else fishy going on in iwlwifi it might even make sense to just revert the commit because it is not accomplishing its intent of getting the AP to start sending beacons again. Or something like my patch would be an alternative.

Is anyone in a position to set up a second machine as an air sniffer? (https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging) As best I can tell this is where the devs left off asking for information, so if we get this to them maybe they will be able to make some progress.
Comment 190 how 2021-01-10 23:21:04 UTC
It is not just disconnects or miss beacons. At times when it is still connected, the performance is very slow and the response is very slow. You can use speedtest, the page actually loads and took a long time to appear. When the test started you only get a few megabits. Other drivers don't have performance or connection issues. All APs works with other wifi hardware except iwlifi, even Windows Intel wifi works. Other Linux drivers uses the same wifi stack. 

The commit suggest to patch all APs? APs are not buggy. There is no AP firmware to suggest connection issues.
Comment 191 how 2021-01-10 23:25:27 UTC
The commit is only for iwlwifi. If it is a buggy AP all other wifi drivers then should have the same code, but is it?
Comment 192 d 2021-01-11 05:58:21 UTC
@how: I'm not a dev but I think it's more complex. If it was just the driver then all Intel chip sets would be affected. And they are not.

It's weird and I think it's interaction with the kernel for some cases. I've used the modules configuration to set both variables to 256 and did not loose connection for the first time for 3 weeks. So I'm still testing.

Look at my earlier comments, initially I blamed the driver but on testing it's an interaction that the devs are having a hard time to clearly identify what's happening. 

My best guess is a kernel/driver edge case that is not handled cleanly but works when the missed beacon setting is set very high.
Comment 193 kernelspace 2021-01-11 06:43:00 UTC
(In reply to d from comment #192)
> My best guess is a kernel/driver edge case that is not handled cleanly but
> works when the missed beacon setting is set very high.

Signal level might affect it as well, so the further away the AP is the more likely the bug will occur.
Comment 194 Mathias Bavay 2021-01-11 07:57:20 UTC
(In reply to kernelspace from comment #193)
> Signal level might affect it as well, so the further away the AP is the more
> likely the bug will occur.

Exactly! Disabling power savings fixed it for me, until I moved my desk into another room where the signal is weaker and now, it is unbearable. I had to bring Ethernet into this room in order to still be able to work (but my wife's mac works perfectly fine with the wifi in this new room). 

And regarding AP bugs: this is beyond the point in the sense that if all other drivers / adapters work fine, this one should implement a workaround if this is truly an AP bug (see the robustness principle: "Be conservative in what you send, be liberal in what you accept"). When traveling, I often have this problem and it is not reasonable to expect that I could patch the firmware of the AP of the hotel / office / conference center where I am. And my wifi getting lost so often gives a bad name to Linux and Intel...
Comment 195 Sergey Slizovskiy 2021-01-11 13:51:26 UTC
The observation of Mathias  about distance-dependence may explain why some people (like me) claim that disabling power-saving  sort of helps. Indeed, I have not tried it further away from the router.
Here I quote an email reply from Andrei and Luca from Intel who worked on the commits https://github.com/torvalds/linux/commit/babea2d4fe76c6515da6231e8d940044bad686b1#diff-87e4eb6cde9b874e4303cc17a793ac5e5fc194f6b22d9d3ae3f74d2fffc71463
quoted above:

I believe that the patch you're referring to is not the root cause for the issue, but rather a real beacon loss that is detected instead of being ignored.
Actually this patch was done as a customer bugfix due to our station not roaming correctly between APs when we stop hearing the beacons of one of them - so it is totally intended.
There might be several reasons for such a large beacon loss - first it may be a really buggy AP that stops transmitting beacons. In this case we have nothing to do.
Keeping connection to an AP that we can't hear its beacons is unhealthy and after some short period of time we will be completely out of sync - loosing multicast traffic, 
not being able to perform any power saving procedures etc.. If this issue is not seen for some larger values of IWL_MVM_MISSED_BEACONS_THRESHOLD_LONG,
we may probably consider making it configurable through some module param. Luca, what do you think?
Anyway, though it is hard to follow the bug thread a bit as it is a mix of multiple different issues, I've noticed that people claim that disabling power save "solves" the issue.
Since disabling power save shouldn't change anything about real over the air beacon loss, I suspect here a fw issue.
Please follow the procedures here: https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging to collect the trace-cmd and core dump as well.
If you have another machine with a wireless NIC that you can use as an air sniffer, this would be very helpful as well.
Comment 196 Luca Coelho 2021-01-11 14:15:46 UTC
I don't think introducing a mod parameters is the right way to go.  As Andrei said, this is probably a FW bug.  Can you try to replace the FW with the ones from this commit (the previous version) instead?

https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/commit/?id=5ee1c7d65c26b90b796362ac1b5715435e2a1384
Comment 197 Ozan Caglayan 2021-01-11 15:02:41 UTC
Switched to it, let's see what happens. Nice to hear from some people involved finally.
Comment 198 Alejandro Alvarez 2021-01-12 10:46:41 UTC
Hi folks!
I just join the discussion and try to read all the comments above.

I'm having the same issue with an **Intel Corporation Wi-Fi 6 AX201**

I tried the following module's configuration but it didn't work as well.

```
options cfg80211 cfg80211_disable_40mhz_24ghz=1
options iwlmvm power_scheme=1
options iwlwifi swcrypto=0
options iwlwifi bt_coex_active=1
options iwlwifi power_save=0
options iwlwifi uapsd_disable=1
```

I set Beacons intervals in my [TP-Link AX50](https://www.tp-link.com/us/home-networking/wifi-router/archer-ax50/) but didn't work as well. I also tried with 2 other routers and I still had the same problem.

My only way to connect in this room for the moment is by using tethering with my mobile. As others have mentioned, all my other devices have no problem connecting in the same zone.

I'm glad to help to find the root of this issue.
Comment 199 d 2021-01-16 15:42:43 UTC
In chasing a moving target I have not had a dropout for 3 days with:

options iwlwifi IWL_MVM_MISSED_BEACONS_THRESHOLD=256
options iwlwifi IWL_MVM_MISSED_BEACONS_THRESHOLD_LONG=256

On Ubuntu 20.04 using the kernel 5.4.0-62-generic.

However there is a firmware update for the EUFI: Version 12.0.70.1652 (LVFS: 192.70.1652)

So this invalidates my current assumptions and will remove my settings and see what happens with a reboot and default values.
Comment 200 Sergey Slizovskiy 2021-01-19 19:34:25 UTC
I have checked the below commands on my 8275 card and these did NOT help.
(In reply to d from comment #199)
> options iwlwifi IWL_MVM_MISSED_BEACONS_THRESHOLD=256
> options iwlwifi IWL_MVM_MISSED_BEACONS_THRESHOLD_LONG=256
Namely, the command ping google.com  gives 8% packet loss.
On the other hand, the option to disable power saving stops the package loss. 

Probably, these above commands had no effect since my kernel have not been patched.
Comment 201 Boris Hollas 2021-01-20 11:10:35 UTC
I can confirm that

options iwlwifi IWL_MVM_MISSED_BEACONS_THRESHOLD=256
options iwlwifi IWL_MVM_MISSED_BEACONS_THRESHOLD_LONG=256

does not solve the problem.

$ nmcli
wlp3s0: nicht verbunden
        "Intel 7265"
        1 Verbindung verfügbar
        wifi (iwlwifi), 94:65:9C:91:36:AB, hw, mtu 1500

The syslog shows

wpa_supplicant[644]: wlp3s0: CTRL-EVENT-BEACON-LOSS
Comment 202 Boris Hollas 2021-01-20 11:16:54 UTC
Created attachment 294779 [details]
complete syslog output

My WLAN lost connection at around 11:57. Then I reconnected and the connection was lost again.
Comment 203 Boris Hollas 2021-01-20 11:26:29 UTC
I use Ubuntu 20.10 with kernel 5.8.0-38-generic. I've had this problem since Ubuntu 20.04. I didn't have this problem with earlier version of Ubuntu.
Comment 204 Boris Hollas 2021-01-20 17:28:41 UTC
Created attachment 294787 [details]
syslog

Another excerpt from syslog.

Also, 

options iwlmvm power_scheme=1
options iwlwifi power_save=0

didn't help.
Comment 205 a.katzander 2021-01-20 17:46:30 UTC
With the upgrade to kernel 5.10 in fedora my wifi work now without any issue.
Comment 206 Dirk Jonker 2021-01-21 07:38:08 UTC
I have not experienced any issues since the upgrade to 5.10 on Debian.
Comment 207 Boris Hollas 2021-01-22 18:10:41 UTC
Created attachment 294811 [details]
dmesg output

After upgrading to 5.10 on Ubuntu 20.10, the messages are still in the syslog. However, the system succesfully reconnected instead of dropping the connection as before.
Comment 208 Boris Hollas 2021-01-25 10:29:20 UTC
Problem persists with kernels 5.10 and 5.11-RC4.
Comment 209 Dirk Jonker 2021-01-25 10:43:13 UTC
It seems I was too fast in reporting the lack of issues with kernel 5.10 as in the past 2 days the logs show beacon loss again, after a period of 10 days without any issues, which was a lot more than before.
Comment 210 mikezackles 2021-01-25 18:44:00 UTC
I was just able to run the following command from a mac connected to the same AP while the problem is happening:

sudo tcpdump -l --interface=en0 -I type mgt subtype beacon and wlan[20:2] == 0xABCD | pv --line-mode -F "Total Beacons: %b Beacons/sec: %r" > /dev/null

(where ABCD is the final two bytes of the BSSID)

This command reports ~10 beacons per second both when the issue is occuring and when it is not.

@Luca To me this seems like pretty strong evidence that this is a firmware bug as opposed to the access point not sending beacons, but maybe I'm misunderstanding as I'm ignorant of the details of this process.

Absent any breakthroughs here I will try to get a dump with more information next time (may take a few weeks for me to get to it).

I have not yet tried the old firmware as requested.
Comment 211 Abhishek Naik 2021-02-10 05:04:35 UTC
Hello @Dirk Jonker 

This looks like a FW issue. So, can you please share FW logs. Following link will assist you to collect FW logs.

https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging#firmware_debugging

Hello mikezackles,
Thanks for the information. Please try old firmware.
If you are still facing the issue. Please share FW logs.
Comment 212 Benjamin Redelings 2021-02-10 16:37:01 UTC
Hi,

Just a note that I'm experiencing this problem on 5.10.  I have a Lenovo X1 Carbon v7 and firmware version 46.4d093a30.0 9000-pu-b0-jf-b0.

Unlike some other people, I experience this problem multiple times per hour.  This might be caused by placing my laptop lid near an external monitor.  We have two APs (one older, one new), and I only experience it with one of them.

Debian kernels don't have CONFIG_IWLWIFI_DEBUG set, so I need to recompile the kernel before I can follow the firmware debugging instructions posted above.  However, I plan to do this in the next week or so.  I plan to try to old firmware as well.  Thanks @Abishek Naik!

-BenRI
Comment 213 Ozan Caglayan 2021-02-10 16:46:22 UTC
So I've been using my laptop with the updated firmware suggested in
https://bugzilla.kernel.org/show_bug.cgi?id=203709#c196

and the issue did not happen again for me in a period of 15 days. I just noticed that the firmware file was replaced back with the Ubuntu's shipped one during a package update.. I just replaced it back with the suggested one (36.79ff3ccf.0) to observe more.
Comment 214 hg 2021-02-13 20:21:05 UTC
Hello. I made an account just to tell what worked for me.

Context: on Ubuntu 20.04, constantly dropped connections, especially apparent with bluetooth enabled. "dmesg | grep iwl" gave the same, repeating "No beacon heard and the time event is over already..." message.

Solution: Changed from WPA&WPA2 to WPA2. I left the encryption mode to AES. I also changed the name of the SSID to all lowercase and no spaces.
Comment 215 GoodMirek 2021-02-21 23:26:30 UTC
Hello, let me share what helped me.  

My Access Point (AP) is Mikrotik, OS Fedora 33, kernel 5.8.18-300.fc33.x86_64, Network controller: Intel Corporation Wireless 8265 / 8275 (rev 78).  

I was getting repeated connectivity losses under load, which were starting with error:
```
wpa_supplicant[1523]: wlp2s0: CTRL-EVENT-BEACON-LOSS
```
followed by:
```
wlp2s0: CTRL-EVENT-DISCONNECTED bssid=74:4d:28:b3:e6:73 reason=4 locally_generated=1
```

The issue started to happen when I played with the AP settings and set "HW. protection mode" to value "CTS to self". It took me a while to realize, but now it works for 3 days already.
Comment 216 Yury Pakin 2021-02-24 13:28:07 UTC
Created attachment 295421 [details]
dmesg after exiting the last suspend

After a short time, after exiting suspend, a single loss of connection to the access point. The frequency and intensity of connection losses with the access point is random, from a single loss to an intense one several times in a row. After the connection to the access point is restored, the connection is stable. Until the next connection loss. The laptop is in the same place and does not move.
Comment 217 Yury Pakin 2021-02-24 13:30:54 UTC
# lspci -k | grep -A2 Net
03:00.0 Network controller: Intel Corporation Wireless 3160 (rev 93)
	Subsystem: Intel Corporation Dual Band Wireless AC 3160
	Kernel driver in use: iwlwifi


# uname -rm
5.4.62-std-def-alt1 x86_64


# iwconfig wlan0 
wlan0     IEEE 802.11  ESSID:"AP"  
          Mode:Managed  Frequency:2.462 GHz  Access Point: xx:xx:xx:xx:xx:xx   
          Bit Rate=65 Mb/s   Tx-Power=22 dBm   
          Retry short limit:7   RTS thr:off   Fragment thr:off
          Encryption key:off
          Power Management:off
          Link Quality=62/70  Signal level=-48 dBm  
          Rx invalid nwid:0  Rx invalid crypt:0  Rx invalid frag:0
          Tx excessive retries:0  Invalid misc:7   Missed beacon:0
Comment 218 Benjamin Redelings 2021-03-03 22:21:14 UTC
As another data point, I used to have my connection dropped multiple times per day with "No beacon heard and the time event is over already..." messages.

However, on Feb 23rd this inexplicably stopped --- without a reboot! 

I'm running Debian unstable, with kernel 5.10.0-3-amd64.  Firmware version is iwlwifi-9000-pu-b0-jf-b0-46.ucode.

I searched for software that was upgraded on that day, but didn't find anything that looked relevant.  The router firmware seems to be unchanged.  So, I can't explain why the problem stopped.

-BenRI
Comment 219 Abhishek Naik 2021-03-04 04:16:13 UTC
Hello @Yury Pakin,
      Thanks for sharing dmesg logs. But this looks like a FW issue. So can you please share FW logs.
      Please refer to Firmware Debugging section in below link for steps to collect firmware logs.
      https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging

Thanks.
Comment 220 testillox 2021-03-04 15:19:19 UTC
Created attachment 295645 [details]
Firmware dump crash

Hello,

This is the card model

00:14.3 Network controller: Intel Corporation Wireless-AC 9560 


Here the error log:
https://pastebin.com/LY771WXg
Comment 221 Victor P 2021-03-08 05:49:41 UTC
I have had this problem on the 7260 ever since upgrading Ubuntu from 18.04 LTS (4.15 kernel / 17.948900127.0 7260-17.ucode) to 20.04 LTS (5.4 kernel / 17.3216344376.0 7260-17.ucode). It manifests as reported above, frequent disconnects, with numerous "No beacon heard and the time event is over already..." and CTRL-EVENT-BEACON-LOSS trace from wpa_supplicant. 

After downgrading the kernel to 4.15 (keeping FW at version 17.3216344376.0), I no longer have the problem with disconnects. Of note, CTRL-EVENT-BEACON-LOSS traces persist, even though there are no disconnects. Seems like a spurious beacon loss reported by the FW. I have not tried downgrading the FW since seeing the bug, but I have never had a single CTRL-EVENT-BEACON-LOSS while running the 17.948900127.0 FW. Other devices on the same AP do not have disconnect issues.

My bet is this is a FW bug, likely dating back to: https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/commit/?id=58265e0def90e8f1b6c1987fbf70af983e84d514.

It did not start manifesting in disconnects until the patch mentioned in earlier comments, to disconnect in case of large beacon loss, which landed in the 5.1 kernel. (https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/commit/drivers/net/wireless/intel/iwlwifi/mvm/mac-ctxt.c?id=babea2d4fe76c6515da6231e8d940044bad686b1).

If it helps, I will collect FW debug logs.
Comment 222 how 2021-03-08 07:22:07 UTC
This beacon problem is not on a particular firmware, not on a particular Intel wifi device. You are using a 7260, I am using a 8260 on a 8000C firmware and I still have the same problem. Kernel 4.15 works fine for everybody.
Comment 223 Abhishek Naik 2021-03-08 09:18:33 UTC
Hello Victor P

Can you please share FW logs. Following link will assist you to collect FW logs.

https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging#firmware_debugging.

Also can you please collect trace-cmd logs too.
https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging#tracing



Hello mikezackles,
Thanks for the information. Please try old firmware.
If you are still facing the issue. Please share FW logs.(In reply to Victor P from comment #221)
> I have had this problem on the 7260 ever since upgrading Ubuntu from 18.04
> LTS (4.15 kernel / 17.948900127.0 7260-17.ucode) to 20.04 LTS (5.4 kernel /
> 17.3216344376.0 7260-17.ucode). It manifests as reported above, frequent
> disconnects, with numerous "No beacon heard and the time event is over
> already..." and CTRL-EVENT-BEACON-LOSS trace from wpa_supplicant. 
> 
> After downgrading the kernel to 4.15 (keeping FW at version
> 17.3216344376.0), I no longer have the problem with disconnects. Of note,
> CTRL-EVENT-BEACON-LOSS traces persist, even though there are no disconnects.
> Seems like a spurious beacon loss reported by the FW. I have not tried
> downgrading the FW since seeing the bug, but I have never had a single
> CTRL-EVENT-BEACON-LOSS while running the 17.948900127.0 FW. Other devices on
> the same AP do not have disconnect issues.
> 
> My bet is this is a FW bug, likely dating back to:
> https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git/
> commit/?id=58265e0def90e8f1b6c1987fbf70af983e84d514.
> 
> It did not start manifesting in disconnects until the patch mentioned in
> earlier comments, to disconnect in case of large beacon loss, which landed
> in the 5.1 kernel.
> (https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/commit/
> drivers/net/wireless/intel/iwlwifi/mvm/mac-ctxt.
> c?id=babea2d4fe76c6515da6231e8d940044bad686b1).
> 
> If it helps, I will collect FW debug logs.
Comment 224 Victor P 2021-03-09 01:25:14 UTC
Created attachment 295757 [details]
7260 system logs, trace and fw core dump

Attached iwlwifi_debug_vp.7z containing the logs and trace on the 7260. The FW did not automatically create any dumps, so I manually obtained a few while the problem was happening. Hope this helps.
Comment 225 Abhishek Naik 2021-03-09 10:12:53 UTC
Hello Victor P,
          Thanks for sharing logs. We will look into it.

Thanks,
Abhishek Naik.
Comment 226 Yury Pakin 2021-03-09 16:44:29 UTC
On the core

# uname -r
5.4.98-std-def-alt1

with an adapter

# lspci -nn | grep Netw
03:00.0 Network controller [0280]: Intel Corporation Wireless 3160 [8086:08b4] (rev 93)

the beacon was lost 44 times,

# iw wlan0 station dump | grep 'beacon\|signal\|failed'
	tx failed:	1
	beacon loss:	44
	beacon rx:	43010
	signal:  	-46 [-46] dBm
	signal avg:	-46 [-46] dBm
	beacon signal avg:	-39 dBm
	beacon interval:100

but not all 44 times the connection is lost

# dmesg | grep 'lost\|No beacon heard'
[23758.984770] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[23763.142388] iwlwifi 0000:03:00.0: No beacon heard and the time event is over already...
[23763.142404] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24029.315008] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24041.503706] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24052.879331] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24056.478335] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24060.625166] iwlwifi 0000:03:00.0: No beacon heard and the time event is over already...
[24060.625209] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24064.260956] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24067.845555] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24185.386849] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24354.445211] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24558.333156] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24561.932078] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24565.515109] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24569.100665] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24575.224355] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[24582.821172] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[32587.612876] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost


When booting the same system without updates with a major kernel version lower than kernel 5.4.98,

# uname -r
4.19.102-std-def-alt1

for 13 hours of the test,

# LC_ALL=C date
Tue Mar  9 05:58:54 +03 2021

# LC_ALL=C date
Tue Mar  9 19:03:11 +03 2021

with the same adapter, on the same system,

# lspci -nn | grep Netw
03:00.0 Network controller [0280]: Intel Corporation Wireless 3160 [8086:08b4] (rev 93)

there was no loss of the beacon and no loss of the connection

# iw wlan0 station dump | grep 'beacon\|signal\|failed'
	tx failed:	0
	beacon loss:	0
	beacon rx:	452364
	signal:  	-42 [-42] dBm
	signal avg:	-42 [-42] dBm
	beacon signal avg:	-39 dBm
	beacon interval:100

# dmesg | grep 'lost\|No beacon heard'
# _


If the problem is in the Intel firmware, then why is it that with the same firmware in the system without updates, when changing the kernel with a downgrade of the kernel version from version 5.{4.10} to kernel 4.19, the problem with the loss of beacons and the loss of the connection is not reproduced?
Comment 227 Victor P 2021-03-09 17:14:26 UTC
(In reply to Yury Pakin from comment #226)
 
> If the problem is in the Intel firmware, then why is it that with the same
> firmware in the system without updates, when changing the kernel with a
> downgrade of the kernel version from version 5.{4.10} to kernel 4.19, the
> problem with the loss of beacons and the loss of the connection is not
> reproduced?

What firmware version are you running? (ethtool -i wlan0)

Could you check the system journal (not just dmesg) for CTRL-EVENT-BEACON-LOSS reported by wpa_supplicant even when running on 4.19? I still see CTRL-EVENT-BEACON-LOSS on 4.15, when running with the latest 7260 firmware (17.3216344376.0). No disconnects on 4.15, the trace there is "harmless".

My hunch is that some change related to reporting beacon loss was introduced in the FW at some point. The iwlwifi driver in kernels before around 5.1 does not disconnect when a huge beacon loss is reported, whereas in newer kernels it does.
Comment 228 Yury Pakin 2021-03-10 05:16:33 UTC
(In reply to Victor P from comment #227)
> (In reply to Yury Pakin from comment #226)
>  
> > If the problem is in the Intel firmware, then why is it that with the same
> > firmware in the system without updates, when changing the kernel with a
> > downgrade of the kernel version from version 5.{4.10} to kernel 4.19, the
> > problem with the loss of beacons and the loss of the connection is not
> > reproduced?
> 
> What firmware version are you running? (ethtool -i wlan0)
> 

# ethtool -i wlan0
driver: iwlwifi
version: 5.4.98-std-def-alt1
firmware-version: 17.3216344376.0
expansion-rom-version: 
bus-info: 0000:03:00.0
supports-statistics: yes
supports-test: no
supports-eeprom-access: no
supports-register-dump: no
supports-priv-flags: no


> Could you check the system journal (not just dmesg) for
> CTRL-EVENT-BEACON-LOSS reported by wpa_supplicant even when running on 4.19?

My logs do not contain this message, neither on kernel 5.4.98 nor on kernel 4.19.102 (rsyslog logging):

# grep -ri CTRL-EVENT-BEACON-LOSS /var/log
#_


> I still see CTRL-EVENT-BEACON-LOSS on 4.15, when running with the latest
> 7260 firmware (17.3216344376.0). No disconnects on 4.15, the trace there is
> "harmless".
> 
> My hunch is that some change related to reporting beacon loss was introduced
> in the FW at some point. The iwlwifi driver in kernels before around 5.1
> does not disconnect when a huge beacon loss is reported, whereas in newer
> kernels it does.

On the core 5.4 and higher, the loss of beacons and connection, I have random and in time and intensity. The command can display lost beacons even when there is no connection loss yet and nothing goes to the log:

# iw wlan0 station dump | grep 'beacon\|signal\|failed'
	tx failed:	2
	beacon loss:	9
	beacon rx:	65423
	signal:  	-46 [-46] dBm
	signal avg:	-46 [-46] dBm
	beacon signal avg:	-40 dBm
	beacon interval:100

But it's worth going back to kernel 4.19 (or any 4.X) and with the same firmware 3160 (17.3216344376.0) the connection is exceptionally stable and there is no loss of any beacon.
Comment 229 a.katzander 2021-03-13 05:27:38 UTC
with the latest update from Fedora 33 
  
  kernel version: 5.10.21-200.fc33.x86_64
  iwwifi firmware-version: 36.ad812ee0.0 8265-36.ucode

I have now:

        tx failed:	167
	beacon loss:	0
	beacon rx:	37551
	signal:  	-63 [-63, -93] dBm
	signal avg:	-63 [-63, -91] dBm
	beacon signal avg:	-74 dBm
	beacon interval:100
Comment 230 Yury Pakin 2021-03-20 14:12:09 UTC
Test week passed:
Forced beacon interval:50
There were no disconnections from the access point

# uname -r
5.4.98-std-def-alt1

# uptime 
 16:09:15 up 7 days, 20:08,  4 users,  load average: 0,57, 0,61, 0,66

# grep -H . /etc/modprobe.d/iwlwifi.conf*
/etc/modprobe.d/iwlwifi.conf.bak:options iwlmvm power_scheme=1
/etc/modprobe.d/iwlwifi.conf.bak:options iwlwifi power_save=0


Despite the fact that iw shows 23 lost access point beacons,

# iw wlan0 station dump | grep 'beacon\|signal\|failed'
	tx failed:	0
	beacon loss:	23
	beacon rx:	178981
	signal:  	-49 [-49] dBm
	signal avg:	-47 [-47] dBm
	beacon signal avg:	-43 dBm
	beacon interval:50

# dmesg | grep 'lost\|No beacon heard' 
[160392.189974] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[351269.779722] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost

iwconfig says there were no beacon losses

# iwconfig wlan0 | grep 'Management\|Signal\|beacon'
          Power Management:off
          Link Quality=60/70  Signal level=-50 dBm  
          Tx excessive retries:0  Invalid misc:15   Missed beacon:0

But the same iwconfig says about fifteen other lost packets

# man iwconfig | grep -A1 'Invalid misc'
       Invalid misc
              Other packets lost in relation with specific wireless operations.


It probably makes sense to return the Beacon Interval to 100 and drive for another week with a lock of 11n

# modinfo iwlwifi | grep '11n\|power_save'
parm:           11n_disable:disable 11n functionality, bitmap: 1: full, 2: disable agg TX, 4: disable agg RX, 8 enable agg TX (uint)
parm:           power_save:enable WiFi power management (default: disable) (bool)

https://bbs.archlinux.org/viewtopic.php?pid=1060792#p1060792

#/etc/modprobe.d/iwlwifi.conf
options iwlwifi 11n_disable=1 swcrypto=1
Comment 231 Martin 2021-04-03 05:26:37 UTC
(In reply to Abhishek Naik from comment #225)
> Hello Victor P,
>           Thanks for sharing logs. We will look into it.
> 
> Thanks,
> Abhishek Naik.

Anything new on that topic?
I am on Fedora 33 with Kernel 5.11 and my 7260 is also affected.
Comment 232 Mathias 2021-04-20 09:42:56 UTC
I downgraded my kernel to 4.19.0-16-amd64 around 2 weeks ago. No more disconnects since then.
Comment 233 triffid.hunter 2021-04-23 06:46:22 UTC
This has started happening to me just recently, with 5.10 series kernel and linux-firmware-20210315 (17.3216344376.0 3160-17.ucode) on an intel 3160 r83 card.
Comment 234 Treah Blade 2021-05-01 18:08:24 UTC
Hello, 
I have this same issue on the card listed below. But I have noticed something odd about the problem. If I connect into a network using standard wireless G 2.4ghz then its fine. If I connect in using 5gz then I get this random disconnection problem. The problem is vastly worse with the new wireless 6 systems however. My work has these wireless routers and its almost impossible to stay connected to them. Its always reason 4 when it disconnects. 

Network controller: Intel Corporation Wireless 8260 (rev 3a)


I hope this is helpful to someone. 

I am running on a Gentoo install using kernel 5.10.27 kernel.
Comment 235 Yury Pakin 2021-05-06 10:43:30 UTC
(In reply to Sai Sindhur Malleni from comment #96)
> Unfortunately, disabling power saving in the driver alone is not solving my
> problem. The only thing that has solved my problem so far is 
>  lsmod | grep -o -e ^iwlmvm -e ^iwldvm -e ^iwlwifi |xargs sudo rmmod &&
> sleep 3 && sudo modprobe iwlwifi 11n_disable=1
> 


Returned 100 ms at the access point for the Beacon Interval.
Set 11n_disable to active with the iwlwifi module reload:

# lsmod | grep -o -e ^iwlmvm -e ^iwldvm -e ^iwlwifi | \
#xargs rmmod && sleep 3 && modprobe iwlwifi 11n_disable=1


Start of the test:

# sh test-beacon-loss.sh

# lspci -knn | grep Net
03:00.0 Network controller [0280]: Intel Corporation Wireless 3160 [8086:08b4] (rev 93)

# ethtool -i wlan0 | grep 'driver\|^version\|firmware\|bus'
driver: iwlwifi
version: 5.4.111-std-def-alt1
firmware-version: 17.3216344376.0
bus-info: 0000:03:00.0

# iw dev wlan0 info | grep 'Interface\|type\|channel\|txpower'
Interface wlan0
	type managed
	channel 1 (2412 MHz), width: 20 MHz (no HT), center1: 2412 MHz
	txpower 20.00 dBm

# iw wlan0 station dump | grep 'beacon\|signal\|failed\|bitrate\|connected'
	tx failed:	0
	beacon loss:	0
	beacon rx:	10608
	signal:  	-49 [-49] dBm
	signal avg:	-49 [-49] dBm
	beacon signal avg:	-44 dBm
	tx bitrate:	54.0 MBit/s
	rx bitrate:	54.0 MBit/s
	beacon interval:100
	connected time:	1113 seconds

# iwconfig wlan0
wlan0     IEEE 802.11  ESSID:"AP0"  
          Mode:Managed  Frequency:2.412 GHz  Access Point: XX:XX:XX:XX:XX:XX   
          Bit Rate=54 Mb/s   Tx-Power=20 dBm   
          Retry short limit:7   RTS thr:off   Fragment thr:off
          Encryption key:off
          Power Management:on
          Link Quality=61/70  Signal level=-49 dBm  
          Rx invalid nwid:0  Rx invalid crypt:0  Rx invalid frag:0
          Tx excessive retries:0  Invalid misc:5   Missed beacon:0


# cat /etc/modprobe.d/iwlwifi.conf
options iwlwifi 11n_disable=1 swcrypto=1


After 2 hours and 49 minutes from the start of the test, the rx bitrate dropped to 1 MBit/s:

# sh test-beacon-loss.sh

# lspci -knn | grep Net
03:00.0 Network controller [0280]: Intel Corporation Wireless 3160 [8086:08b4] (rev 93)

# ethtool -i wlan0 | grep 'driver\|^version\|firmware\|bus'
driver: iwlwifi
version: 5.4.111-std-def-alt1
firmware-version: 17.3216344376.0
bus-info: 0000:03:00.0

# cat /etc/modprobe.d/iwlwifi.conf
options iwlwifi 11n_disable=1 swcrypto=1

# iw dev wlan0 info | grep 'Interface\|type\|channel\|txpower'
Interface wlan0
	type managed
	channel 1 (2412 MHz), width: 20 MHz (no HT), center1: 2412 MHz
	txpower 20.00 dBm

# iw wlan0 station dump | grep 'beacon\|signal\|failed\|bitrate\|connected'
	tx failed:	3
	beacon loss:	0
	beacon rx:	98439
	signal:  	-46 [-46] dBm
	signal avg:	-46 [-46] dBm
	beacon signal avg:	-45 dBm
	tx bitrate:	54.0 MBit/s
	rx bitrate:	1.0 MBit/s
	beacon interval:100
	connected time:	10150 seconds

# iwconfig wlan0 | grep -v 'Retry\|Encryption'
wlan0     IEEE 802.11  ESSID:"AP0"  
          Mode:Managed  Frequency:2.412 GHz  Access Point: XX:XX:XX:XX:XX:XX   
          Bit Rate=54 Mb/s   Tx-Power=20 dBm   
          Power Management:on
          Link Quality=64/70  Signal level=-46 dBm  
          Rx invalid nwid:0  Rx invalid crypt:0  Rx invalid frag:0
          Tx excessive retries:3  Invalid misc:62   Missed beacon:0


And in dmesg:

# dmesg | grep -A1000 '561122.366868' 
[561122.366868] wlan0: deauthenticating from xx:xx:xx:xx:xx:xx by local choice (Reason: 3=DEAUTH_LEAVING)
[561125.460397] Intel(R) Wireless WiFi driver for Linux
[561125.460398] Copyright(c) 2003- 2015 Intel Corporation
[561125.462129] iwlwifi 0000:03:00.0: loaded firmware version 17.3216344376.0 op_mode iwlmvm
[561125.471405] iwlwifi 0000:03:00.0: Detected Intel(R) Dual Band Wireless AC 3160, REV=0x164
[561125.496122] iwlwifi 0000:03:00.0: base HW address: zz:zz:zz:zz:zz:zz
[561125.606517] ieee80211 phy3: Selected rate control algorithm 'iwl-mvm-rs'
[561131.433852] wlan0: authenticate with xx:xx:xx:xx:xx:xx
[561131.438828] wlan0: send auth to xx:xx:xx:xx:xx:xx (try 1/3)
[561131.441713] wlan0: authenticated
[561131.442364] wlan0: associate with xx:xx:xx:xx:xx:xx (try 1/3)
[561131.446169] wlan0: RX AssocResp from xx:xx:xx:xx:xx:xx (capab=0x411 status=0 aid=2)
[561131.459617] wlan0: associated
[561131.481369] IPv6: ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready
[563756.334465] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[563759.881280] wlan0: authenticate with xx:xx:xx:xx:xx:xx
[563759.886050] wlan0: send auth to xx:xx:xx:xx:xx:xx (try 1/3)
[563759.889178] wlan0: authenticated
[563759.890137] wlan0: associate with xx:xx:xx:xx:xx:xx (try 1/3)
[563759.893824] wlan0: RX AssocResp from xx:xx:xx:xx:xx:xx (capab=0x411 status=0 aid=2)
[563759.906462] wlan0: associated
[563760.500179] iwlwifi 0000:03:00.0: No beacon heard and the time event is over already...
[563760.500230] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[563764.033418] wlan0: authenticate with xx:xx:xx:xx:xx:xx
[563764.038493] wlan0: send auth to xx:xx:xx:xx:xx:xx (try 1/3)
[563764.041530] wlan0: authenticated
[563764.042166] wlan0: associate with xx:xx:xx:xx:xx:xx (try 1/3)
[563764.046286] wlan0: RX AssocResp from xx:xx:xx:xx:xx:xx (capab=0x411 status=0 aid=2)
[563764.057577] wlan0: associated
[563786.330867] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[563789.865888] wlan0: authenticate with xx:xx:xx:xx:xx:xx
[563789.871459] wlan0: send auth to xx:xx:xx:xx:xx:xx (try 1/3)
[563789.874912] wlan0: authenticated
[563789.875310] wlan0: associate with xx:xx:xx:xx:xx:xx (try 1/3)
[563789.880372] wlan0: RX AssocResp from xx:xx:xx:xx:xx:xx (capab=0x411 status=0 aid=2)
[563789.881097] wlan0: associated
[563815.317608] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[563827.259950] wlan0: authenticate with xx:xx:xx:xx:xx:xx
[563827.265818] wlan0: send auth to xx:xx:xx:xx:xx:xx (try 1/3)
[563827.268722] wlan0: authenticated
[563827.269533] wlan0: associate with xx:xx:xx:xx:xx:xx (try 1/3)
[563827.273344] wlan0: RX AssocResp from xx:xx:xx:xx:xx:xx (capab=0x411 status=0 aid=2)
[563827.284320] wlan0: associated
[563827.879816] iwlwifi 0000:03:00.0: No beacon heard and the time event is over already...
[563827.879867] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[563831.410640] wlan0: authenticate with xx:xx:xx:xx:xx:xx
[563831.416500] wlan0: send auth to xx:xx:xx:xx:xx:xx (try 1/3)
[563831.419741] wlan0: authenticated
[563831.420563] wlan0: associate with xx:xx:xx:xx:xx:xx (try 1/3)
[563831.424928] wlan0: RX AssocResp from xx:xx:xx:xx:xx:xx (capab=0x411 status=0 aid=2)
[563831.445276] wlan0: associated
[563938.411108] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[563941.944973] wlan0: authenticate with xx:xx:xx:xx:xx:xx
[563941.949913] wlan0: send auth to xx:xx:xx:xx:xx:xx (try 1/3)
[563941.952748] wlan0: authenticated
[563941.953233] wlan0: associate with xx:xx:xx:xx:xx:xx (try 1/3)
[563941.956896] wlan0: RX AssocResp from xx:xx:xx:xx:xx:xx (capab=0x411 status=0 aid=2)
[563941.959181] wlan0: associated
[563945.161711] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[563948.698314] wlan0: authenticate with xx:xx:xx:xx:xx:xx
[563948.703342] wlan0: send auth to xx:xx:xx:xx:xx:xx (try 1/3)
[563948.707781] wlan0: authenticated
[563948.708264] wlan0: associate with xx:xx:xx:xx:xx:xx (try 1/3)
[563948.711856] wlan0: RX AssocResp from xx:xx:xx:xx:xx:xx (capab=0x411 status=0 aid=2)
[563948.722906] wlan0: associated
[563949.317177] iwlwifi 0000:03:00.0: No beacon heard and the time event is over already...
[563949.317203] wlan0: Connection to AP xx:xx:xx:xx:xx:xx lost
[563952.847377] wlan0: authenticate with xx:xx:xx:xx:xx:xx
[563952.853077] wlan0: send auth to xx:xx:xx:xx:xx:xx (try 1/3)
[563952.855840] wlan0: authenticated
[563952.856283] wlan0: associate with xx:xx:xx:xx:xx:xx (try 1/3)
[563952.859953] wlan0: RX AssocResp from xx:xx:xx:xx:xx:xx (capab=0x411 status=0 aid=2)
[563952.871994] wlan0: associated
[564388.957148] wlan0: deauthenticating from xx:xx:xx:xx:xx:xx by local choice (Reason: 3=DEAUTH_LEAVING)
[564392.056491] Intel(R) Wireless WiFi driver for Linux
[564392.056492] Copyright(c) 2003- 2015 Intel Corporation
[564392.058336] iwlwifi 0000:03:00.0: loaded firmware version 17.3216344376.0 op_mode iwlmvm
[564392.067695] iwlwifi 0000:03:00.0: Detected Intel(R) Dual Band Wireless AC 3160, REV=0x164
[564392.090556] iwlwifi 0000:03:00.0: base HW address: zz:zz:zz:zz:zz:zz
[564392.203001] ieee80211 phy4: Selected rate control algorithm 'iwl-mvm-rs'
[564398.025794] wlan0: authenticate with xx:xx:xx:xx:xx:xx
[564398.032179] wlan0: send auth to xx:xx:xx:xx:xx:xx (try 1/3)
[564398.040025] wlan0: authenticated
[564398.040929] wlan0: associate with xx:xx:xx:xx:xx:xx (try 1/3)
[564398.044861] wlan0: RX AssocResp from xx:xx:xx:xx:xx:xx (capab=0x411 status=0 aid=2)
[564398.046605] wlan0: associated
[564398.079560] IPv6: ADDRCONF(NETDEV_CHANGE): wlan0: link becomes ready


At the same time, there was no network loss:

# sh test-beacon-loss.sh

# lspci -knn | grep Net
03:00.0 Network controller [0280]: Intel Corporation Wireless 3160 [8086:08b4] (rev 93)

# ethtool -i wlan0 | grep 'driver\|^version\|firmware\|bus'
driver: iwlwifi
version: 5.4.111-std-def-alt1
firmware-version: 17.3216344376.0
bus-info: 0000:03:00.0

# cat /etc/modprobe.d/iwlwifi.conf
options iwlwifi 11n_disable=1 swcrypto=1

# iw dev wlan0 info | grep 'Interface\|type\|channel\|txpower'
Interface wlan0
	type managed
	channel 1 (2412 MHz), width: 20 MHz (no HT), center1: 2412 MHz
	txpower 20.00 dBm

# iw wlan0 station dump | grep 'beacon\|signal\|failed\|bitrate\|connected'
	tx failed:	3
	beacon loss:	0
	beacon rx:	103574
	signal:  	-52 [-52] dBm
	signal avg:	-49 [-49] dBm
	beacon signal avg:	-46 dBm
	tx bitrate:	54.0 MBit/s
	rx bitrate:	54.0 MBit/s
	beacon interval:100
	connected time:	10689 seconds

# iwconfig wlan0 | grep -v 'Retry\|Encryption'
wlan0     IEEE 802.11  ESSID:"AP0"  
          Mode:Managed  Frequency:2.412 GHz  Access Point: XX:XX:XX:XX:XX:XX   
          Bit Rate=54 Mb/s   Tx-Power=20 dBm   
          Power Management:on
          Link Quality=58/70  Signal level=-52 dBm  
          Rx invalid nwid:0  Rx invalid crypt:0  Rx invalid frag:0
          Tx excessive retries:3  Invalid misc:62   Missed beacon:0
Comment 236 Jaime T 2021-05-10 18:20:32 UTC
This bug's status is currently showing as "NEEDINFO". Is this correct (or should it say "IN_PROGRESS")? If it is correct, what INFO do you NEED?
Comment 237 RussianNeuroMancer 2021-05-16 14:29:52 UTC
Comment 190: 

> It is not just disconnects or miss beacons. At times when it is still
> connected, the performance is very slow and the response is very slow. You
> can use speedtest, the page actually loads and took a long time to appear.
> When the test started you only get a few megabits.

Comment 193:

> Signal level might affect it as well, so the further away the AP is the more
> likely the bug will occur.

I can confirm both observations. 

In my case, AP signal is weak (and there is nothing I can do about it) and there is also a Bluetooth keyboard and mouse added to the mix (see Comment 214). For a first few hours after boot (or reconnect, as it seems) I getting just a few megabits per second (iperf3). However, by the next morning, I always get around 0-1 megabits per second and have to reboot/reconnect. So from my point of view, a weak signal does trigger bug with higher probability, and performance does degrade over time (at least with older kernel and firmware).

After upgrade from Linux 5.8 to 5.12 (and also updating the firmware package) I still getting disconnects due to beacon loss (probably due to weak signal) but then connection is re-established so speed degradation does not happen.

driver: iwlwifi
version: 5.12.4-051204-generic
firmware-version: 29.4063824552.0 7265D-29.ucode
expansion-rom-version: 
bus-info: 0000:01:00.0
supports-statistics: yes
supports-test: no
supports-eeprom-access: no
supports-register-dump: no
supports-priv-flags: no

I will try to decrease beacon_int on AP from 100 to 50 and see how it goes in the next few days.
Comment 238 oleal 2021-05-19 01:17:34 UTC
Hi,

I'm also having the same problem using Intel 8260, rev 3a.
It seems that the problem is only happening on 5GHz, and it's worse when trying to connect to Wi-Fi 6 AP.

On dmesg I can see the following errors:
...
iwlwifi 0000:01:00.0: No beacon heard and the time event is over already...
...
wlan0: authenticate with 7x:xx:xx:xx:xx:xc
[Tue May 18 23:16:38 2021] wlan0: send auth to 7x:xx:xx:xx:xx:xc (try 1/3)
[Tue May 18 23:16:38 2021] wlan0: send auth to 7x:xx:xx:xx:xx:xc (try 2/3)
[Tue May 18 23:16:38 2021] wlan0: send auth to 7x:xx:xx:xx:xx:xc (try 3/3)
[Tue May 18 23:16:38 2021] wlan0: authentication with 7x:xx:xx:xx:xx:xc timed out
...

Information from the system:

cat /proc/version
Linux version 5.10.18- (gcc (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU Binutils for Debian) 2.35.1) #1 SMP PREEMPT Wed Feb 24 13:53:52 WET 2021

lshw -c network
  *-network
       description: Wireless interface
       product: Wireless 8260
       vendor: Intel Corporation
       physical id: 0
       bus info: pci@0000:01:00.0
       logical name: wlan0
       version: 3a
       serial: b8:xx:xx:xx:dd:ad
       width: 64 bits
       clock: 33MHz
       capabilities: pm msi pciexpress bus_master cap_list ethernet physical wireless
       configuration: broadcast=yes driver=iwlwifi driverversion=5.10.18 firmware=36.ad812ee0.0 8000C-36.ucode latency=0 link=no multicast=yes wireless=IEEE 802.11
       resources: irq:137 memory:c1200000-c1201fff

lspci -knn | grep Net
01:00.0 Network controller [0280]: Intel Corporation Wireless 8260 [8086:24f3] (rev 3a)

ethtool -i wlan0 | grep 'driver\|^version\|firmware\|bus'
driver: iwlwifi
version: 5.10.18-catf
firmware-version: 36.ad812ee0.0 8000C-36.ucode
bus-info: 0000:01:00.0

lsb_release -a
No LSB modules are available.
Distributor ID: Debian
Description:    Debian GNU/Linux 9.13 (stretch)
Release:        9.13
Codename:       stretch

I can replicate this easily. How can I help with this?
Thanks.
Comment 239 Decent Mix 2021-05-21 06:42:43 UTC
(In reply to oleal from comment #238)
> Hi,
> 
> I'm also having the same problem using Intel 8260, rev 3a.
> It seems that the problem is only happening on 5GHz, and it's worse when
> trying to connect to Wi-Fi 6 AP.
> 
> On dmesg I can see the following errors:
> ...
> iwlwifi 0000:01:00.0: No beacon heard and the time event is over already...
> ...
> wlan0: authenticate with 7x:xx:xx:xx:xx:xc
> [Tue May 18 23:16:38 2021] wlan0: send auth to 7x:xx:xx:xx:xx:xc (try 1/3)
> [Tue May 18 23:16:38 2021] wlan0: send auth to 7x:xx:xx:xx:xx:xc (try 2/3)
> [Tue May 18 23:16:38 2021] wlan0: send auth to 7x:xx:xx:xx:xx:xc (try 3/3)
> [Tue May 18 23:16:38 2021] wlan0: authentication with 7x:xx:xx:xx:xx:xc
> timed out
> ...
> 
> Information from the system:
> 
> cat /proc/version
> Linux version 5.10.18- (gcc (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU
> Binutils for Debian) 2.35.1) #1 SMP PREEMPT Wed Feb 24 13:53:52 WET 2021
> 
> lshw -c network
>   *-network
>        description: Wireless interface
>        product: Wireless 8260
>        vendor: Intel Corporation
>        physical id: 0
>        bus info: pci@0000:01:00.0
>        logical name: wlan0
>        version: 3a
>        serial: b8:xx:xx:xx:dd:ad
>        width: 64 bits
>        clock: 33MHz
>        capabilities: pm msi pciexpress bus_master cap_list ethernet physical
> wireless
>        configuration: broadcast=yes driver=iwlwifi driverversion=5.10.18
> firmware=36.ad812ee0.0 8000C-36.ucode latency=0 link=no multicast=yes
> wireless=IEEE 802.11
>        resources: irq:137 memory:c1200000-c1201fff
> 
> lspci -knn | grep Net
> 01:00.0 Network controller [0280]: Intel Corporation Wireless 8260
> [8086:24f3] (rev 3a)
> 
> ethtool -i wlan0 | grep 'driver\|^version\|firmware\|bus'
> driver: iwlwifi
> version: 5.10.18-catf
> firmware-version: 36.ad812ee0.0 8000C-36.ucode
> bus-info: 0000:01:00.0
> 
> lsb_release -a
> No LSB modules are available.
> Distributor ID: Debian
> Description:    Debian GNU/Linux 9.13 (stretch)
> Release:        9.13
> Codename:       stretch
> 
> I can replicate this easily. How can I help with this?
> Thanks.

Holy cow, limiting the frequency to the 2.4 ghz range seems to have made my problems go away completely. Thank you for that, oleal.

I could not stay connected to save my life until trying that (I've also got power save turned off, but I've been doing that for months with no luck).

I wonder if I limit it to just the 5Ghz band, if it comes back or not. I will experiment over the next few days and report back
Comment 240 oleal 2021-05-21 19:57:15 UTC
(In reply to Decent Mix from comment #239)
> (In reply to oleal from comment #238)
> > Hi,
> > 
> > I'm also having the same problem using Intel 8260, rev 3a.
> > It seems that the problem is only happening on 5GHz, and it's worse when
> > trying to connect to Wi-Fi 6 AP.
> > 
> > On dmesg I can see the following errors:
> > ...
> > iwlwifi 0000:01:00.0: No beacon heard and the time event is over already...
> > ...
> > wlan0: authenticate with 7x:xx:xx:xx:xx:xc
> > [Tue May 18 23:16:38 2021] wlan0: send auth to 7x:xx:xx:xx:xx:xc (try 1/3)
> > [Tue May 18 23:16:38 2021] wlan0: send auth to 7x:xx:xx:xx:xx:xc (try 2/3)
> > [Tue May 18 23:16:38 2021] wlan0: send auth to 7x:xx:xx:xx:xx:xc (try 3/3)
> > [Tue May 18 23:16:38 2021] wlan0: authentication with 7x:xx:xx:xx:xx:xc
> > timed out
> > ...
> > 
> > Information from the system:
> > 
> > cat /proc/version
> > Linux version 5.10.18- (gcc (Debian 10.2.1-6) 10.2.1 20210110, GNU ld (GNU
> > Binutils for Debian) 2.35.1) #1 SMP PREEMPT Wed Feb 24 13:53:52 WET 2021
> > 
> > lshw -c network
> >   *-network
> >        description: Wireless interface
> >        product: Wireless 8260
> >        vendor: Intel Corporation
> >        physical id: 0
> >        bus info: pci@0000:01:00.0
> >        logical name: wlan0
> >        version: 3a
> >        serial: b8:xx:xx:xx:dd:ad
> >        width: 64 bits
> >        clock: 33MHz
> >        capabilities: pm msi pciexpress bus_master cap_list ethernet
> physical
> > wireless
> >        configuration: broadcast=yes driver=iwlwifi driverversion=5.10.18
> > firmware=36.ad812ee0.0 8000C-36.ucode latency=0 link=no multicast=yes
> > wireless=IEEE 802.11
> >        resources: irq:137 memory:c1200000-c1201fff
> > 
> > lspci -knn | grep Net
> > 01:00.0 Network controller [0280]: Intel Corporation Wireless 8260
> > [8086:24f3] (rev 3a)
> > 
> > ethtool -i wlan0 | grep 'driver\|^version\|firmware\|bus'
> > driver: iwlwifi
> > version: 5.10.18-catf
> > firmware-version: 36.ad812ee0.0 8000C-36.ucode
> > bus-info: 0000:01:00.0
> > 
> > lsb_release -a
> > No LSB modules are available.
> > Distributor ID: Debian
> > Description:    Debian GNU/Linux 9.13 (stretch)
> > Release:        9.13
> > Codename:       stretch
> > 
> > I can replicate this easily. How can I help with this?
> > Thanks.
> 
> Holy cow, limiting the frequency to the 2.4 ghz range seems to have made my
> problems go away completely. Thank you for that, oleal.
> 
> I could not stay connected to save my life until trying that (I've also got
> power save turned off, but I've been doing that for months with no luck).
> 
> I wonder if I limit it to just the 5Ghz band, if it comes back or not. I
> will experiment over the next few days and report back

Glad that I'm able to help :)
Unfortunately, no one from intel is answering to these issues. This is a critical issue and will become even worst when people start having Wi-Fi 6 in their homes.
Comment 241 Treah Blade 2021-05-25 01:46:18 UTC
On Fri, May 21, 2021 at 07:57:15PM +0000, bugzilla-daemon@bugzilla.kernel.org wrote:
Yes that was my experiance too with my card as well. Limiting it to
2.4ghz was the way I was able to fix the problem as well. My wifi at
work however uses the newer standard and it was the only way I could get
it to stay connected. 

> https://bugzilla.kernel.org/show_bug.cgi?id=203709
> 
> --- Comment #240 from oleal (oscarleal.network@gmail.com) ---
> (In reply to Decent Mix from comment #239)
> > (In reply to oleal from comment #238)
> > > Hi,
> > > 
> > > I'm also having the same problem using Intel 8260, rev 3a.
> > > It seems that the problem is only happening on 5GHz, and it's worse when
> > > trying to connect to Wi-Fi 6 AP.
> > > 
> > > On dmesg I can see the following errors:
> > > ...
> > > iwlwifi 0000:01:00.0: No beacon heard and the time event is over
> already...
> > > ...
> > > wlan0: authenticate with 7x:xx:xx:xx:xx:xc
> > > [Tue May 18 23:16:38 2021] wlan0: send auth to 7x:xx:xx:xx:xx:xc (try
> 1/3)
> > > [Tue May 18 23:16:38 2021] wlan0: send auth to 7x:xx:xx:xx:xx:xc (try
> 2/3)
> > > [Tue May 18 23:16:38 2021] wlan0: send auth to 7x:xx:xx:xx:xx:xc (try
> 3/3)
> > > [Tue May 18 23:16:38 2021] wlan0: authentication with 7x:xx:xx:xx:xx:xc
> > > timed out
> > > ...
> > > 
> > > Information from the system:
> > > 
> > > cat /proc/version
> > > Linux version 5.10.18- (gcc (Debian 10.2.1-6) 10.2.1 20210110, GNU ld
> (GNU
> > > Binutils for Debian) 2.35.1) #1 SMP PREEMPT Wed Feb 24 13:53:52 WET 2021
> > > 
> > > lshw -c network
> > >   *-network
> > >        description: Wireless interface
> > >        product: Wireless 8260
> > >        vendor: Intel Corporation
> > >        physical id: 0
> > >        bus info: pci@0000:01:00.0
> > >        logical name: wlan0
> > >        version: 3a
> > >        serial: b8:xx:xx:xx:dd:ad
> > >        width: 64 bits
> > >        clock: 33MHz
> > >        capabilities: pm msi pciexpress bus_master cap_list ethernet
> > physical
> > > wireless
> > >        configuration: broadcast=yes driver=iwlwifi driverversion=5.10.18
> > > firmware=36.ad812ee0.0 8000C-36.ucode latency=0 link=no multicast=yes
> > > wireless=IEEE 802.11
> > >        resources: irq:137 memory:c1200000-c1201fff
> > > 
> > > lspci -knn | grep Net
> > > 01:00.0 Network controller [0280]: Intel Corporation Wireless 8260
> > > [8086:24f3] (rev 3a)
> > > 
> > > ethtool -i wlan0 | grep 'driver\|^version\|firmware\|bus'
> > > driver: iwlwifi
> > > version: 5.10.18-catf
> > > firmware-version: 36.ad812ee0.0 8000C-36.ucode
> > > bus-info: 0000:01:00.0
> > > 
> > > lsb_release -a
> > > No LSB modules are available.
> > > Distributor ID: Debian
> > > Description:    Debian GNU/Linux 9.13 (stretch)
> > > Release:        9.13
> > > Codename:       stretch
> > > 
> > > I can replicate this easily. How can I help with this?
> > > Thanks.
> > 
> > Holy cow, limiting the frequency to the 2.4 ghz range seems to have made my
> > problems go away completely. Thank you for that, oleal.
> > 
> > I could not stay connected to save my life until trying that (I've also got
> > power save turned off, but I've been doing that for months with no luck).
> > 
> > I wonder if I limit it to just the 5Ghz band, if it comes back or not. I
> > will experiment over the next few days and report back
> 
> Glad that I'm able to help :)
> Unfortunately, no one from intel is answering to these issues. This is a
> critical issue and will become even worst when people start having Wi-Fi 6 in
> their homes.
> 
> -- 
> You may reply to this email to add a comment.
> 
> You are receiving this mail because:
> You are on the CC list for the bug.
Comment 242 Abhishek Naik 2021-05-26 04:36:05 UTC
Hello All,
     Please provide FW dump from 9260 or 9560 for further analysis.

Please follow the procedures to collect the trace-cmd and FWdump.
 
https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging 

Thanks
Comment 243 triffid.hunter 2021-05-27 06:00:29 UTC
After enabling all the required debugging options from that link (same kernel version and no other changes), the issue hasn't occurred - seems like we have a heisenbug?

I'll report back if the aberrant behaviour pops up again.

PS: /sys/kernel/debug/iwlwifi doesn't exist even after following those instructions, are they outdated for 5.10-series kernels or am I missing something?
Comment 244 triffid.hunter 2021-05-27 08:45:13 UTC
Created attachment 297003 [details]
iwl-fw-error_2021-05-27_16-14-36.dump

I got a firmware crash (dump attached), after which it started dropping the WiFi connection a lot - tracing now
Comment 245 triffid.hunter 2021-05-27 09:24:58 UTC
I have posted a zstd-compressed trace at https://triffid-hunter.no-ip.info/2021-05-27T172251trace.dat.zst (23MB, 375MB uncompressed) since it's too large to attach here.

It corresponds to the following dmesg events:

[13663.985250] wlp4s0: Connection to AP 4c:77:6d:96:ae:4e lost
[13667.533496] wlp4s0: authenticate with 4c:77:6d:96:ae:4e
[13667.535518] wlp4s0: send auth to 4c:77:6d:96:ae:4e (try 1/3)
[13667.536852] wlp4s0: authenticated
[13667.537349] wlp4s0: associate with 4c:77:6d:96:ae:4e (try 1/3)
[13667.538890] wlp4s0: RX AssocResp from 4c:77:6d:96:ae:4e (capab=0x411 status=0 aid=189)
[13667.539815] wlp4s0: associated
[13667.676923] wlp4s0: Limiting TX power to 23 (23 - 0) dBm as advertised by 4c:77:6d:96:ae:4e
[14136.974041] wlp4s0: Connection to AP 4c:77:6d:96:ae:4e lost
[14140.621631] wlp4s0: authenticate with 4c:77:6d:96:ae:4e
[14140.622545] wlp4s0: send auth to 4c:77:6d:96:ae:4e (try 1/3)
[14140.625115] wlp4s0: authenticated
[14140.626017] wlp4s0: associate with 4c:77:6d:96:ae:4e (try 1/3)
[14140.630738] wlp4s0: RX AssocResp from 4c:77:6d:96:ae:4e (capab=0x411 status=0 aid=15)
[14140.632519] wlp4s0: associated
[14140.662973] wlp4s0: Limiting TX power to 23 (23 - 0) dBm as advertised by 4c:77:6d:96:ae:4e
[14229.023360] wlp4s0: Connection to AP 4c:77:6d:96:ae:4e lost
[14229.646871] wlp4s0: authenticate with 4c:77:6d:05:7f:2e
[14229.650137] wlp4s0: send auth to 4c:77:6d:05:7f:2e (try 1/3)
[14229.650987] wlp4s0: authenticated
[14229.651352] wlp4s0: associate with 4c:77:6d:05:7f:2e (try 1/3)
[14229.652796] wlp4s0: RX AssocResp from 4c:77:6d:05:7f:2e (capab=0x411 status=0 aid=28)
[14229.656151] wlp4s0: associated
[14229.722667] wlp4s0: Limiting TX power to 21 (21 - 0) dBm as advertised by 4c:77:6d:05:7f:2e
Comment 246 Abhishek Naik 2021-05-31 09:22:53 UTC
Hello triffid.hunter

Thanks for the fw dump. But looks like for some reason fw dump is corrupted. We are not able to parse it.

I believe you have provided firmware dump for 3160.
We need fw dump for 9260 or 9560 for further analysis.

If you cant find /sys/kernel/debug/iwlwifi. It can be kernel configuration issue.
you need to have at least these three options enabled when they compile the kernel 
CPTCFG_IWLWIFI_DEBUG=y
CPTCFG_IWLWIFI_DEBUGFS=y
CPTCFG_IWLWIFI_DEVICE_TRACING=y
Comment 247 triffid.hunter 2021-05-31 09:32:49 UTC
(In reply to Abhishek Naik from comment #246)
> I believe you have provided firmware dump for 3160.

Yeah, I have a 3160 chipset

> If you cant find /sys/kernel/debug/iwlwifi. It can be kernel configuration
> issue.
> you need to have at least these three options enabled when they compile the
> kernel 
> CPTCFG_IWLWIFI_DEBUG=y
> CPTCFG_IWLWIFI_DEBUGFS=y
> CPTCFG_IWLWIFI_DEVICE_TRACING=y

$ zgrep IWLWIFI_DE /proc/config.gz
CONFIG_IWLWIFI_DEBUG=y
CONFIG_IWLWIFI_DEVICE_TRACING=y

Seems I'm missing CONFIG_IWLWIFI_DEBUGFS, but that config option isn't mentioned in your https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging instructions - perhaps they need an update?
Comment 248 Abhishek Naik 2021-05-31 10:29:32 UTC
Hello sebalinux, Alexander Tsoy and testillox,
     From the logs you pasted here, I can see that you are using 9560 chipset. Can you please try to reproduce this issue and share following logs.
1. trace-cmd log
2. Fw-dump

Please refer to below link for collecting logs
https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging

Thanks
Abhishek Naik.
Comment 249 Lucas Correia Villa Real 2021-06-02 16:18:47 UTC
(In reply to mikezackles from comment #172)
> Created attachment 294585 [details]
> Module parameter for beacon timeout
> 
> Building on Ozan and Nhonda's observations, I made this little patch that
> makes the beacon timeout a runtime-configurable kernel parameter, keeping
> the default of 16 missed beacons. It seems to resolve the issue for me (with
> a timeout of 256 missed beacons).
> 
> Even if this is an issue entirely external to iwlwifi (buggy AP etc.), it
> seems that other drivers both for Linux and other operating systems tolerate
> these conditions, so there should still be some sort of fix in iwlwifi.
> 
> Without this fix, my wifi becomes completely unusable for long stretches due
> to repeated disconnects every few days (likely correlated with network
> conditions).

Your patch, with a timeout of 256 missed beacons, has fixed the problem for me. Thank you so much, this problem was driving me crazy.
Comment 250 Mathias 2021-06-02 16:20:12 UTC
Created attachment 297119 [details]
attachment-1050-0.html

Dear sender,

I am out of office until 7th of June 2021 and will answer your email after my return.
In urgent cases, please contact Mr. Gerd Brost (gerd.brost@aisec.fraunhofer.de), he will redirect you to the right person.

Best regards,
Mathias Morbitzer
Comment 251 mikezackles 2021-06-02 17:09:48 UTC
(In reply to Lucas Correia Villa Real from comment #249)
> Your patch, with a timeout of 256 missed beacons, has fixed the problem for
> me. Thank you so much, this problem was driving me crazy.

Glad it was helpful! This is indeed an extremely frustrating issue.

As this thread is a mile long, I thought I'd reiterate:
Abhishek from Intel is looking for firmware dumps from 9260 and 9560. (Unfortunately I have 8265/8275 if I'm looking at the right number.)
https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging

Also, given the longevity (2 years!) and severity of the issue, is it possible that Intel might please devote some resources toward reproducing this internally? As users we don't have much recourse since the firmware is closed.
Comment 252 mikezackles 2021-06-06 19:07:47 UTC
In case it is helpful for Arch users, I made a kernel package that includes the beacon timeout patch, and I also uploaded a binary package to github:
https://aur.archlinux.org/pkgbase/linux-beacon/
https://github.com/mikezackles/linux-beacon-pkgbuild/releases/tag/v5.12.9.arch1

(This is just to hopefully help people work around the issue until there is a firmware fix.)
Comment 253 triffid.hunter 2021-06-07 02:58:12 UTC
(In reply to mikezackles from comment #172)
> Created attachment 294585 [details]
> Module parameter for beacon timeout

I've applied this patch locally (running Gentoo so just dropped it in /etc/portage/patches/ and tweaked /etc/modprobe.d/), and haven't had WiFi reconnects since.

I'm not sure if it's a workaround or an actual solution, but it definitely seems to help.
Comment 254 Chemary 2021-06-23 09:54:53 UTC
(In reply to mikezackles from comment #252)
> In case it is helpful for Arch users, I made a kernel package that includes
> the beacon timeout patch, and I also uploaded a binary package to github:
> https://aur.archlinux.org/pkgbase/linux-beacon/
> https://github.com/mikezackles/linux-beacon-pkgbuild/releases/tag/v5.12.9.
> arch1
> 
> (This is just to hopefully help people work around the issue until there is
> a firmware fix.)

Hello mikezackles, this issue is driving me crazy since almost a year ago I  install every new kernel released and any update hoping one would be the solution but the problem never goes away, I lost the count of working hours I have spent due to this issue.
I use Ubuntu 20.04.2 LTS with kernel 5.12.9-051209-generic can you guide me to a tutorial or give me some help about how to apply this patch to my system? I don't know if I have to compile the kernel, the driver or what.
Thanks for your effort, I don't undestand why Intel does not revert the changes that are causing so many headhaches to users.
Comment 255 mikezackles 2021-06-23 14:50:53 UTC
> I lost the count of working hours I have spent due to this issue.

I completely understand :(

> I use Ubuntu 20.04.2 LTS with kernel 5.12.9-051209-generic can you guide me
> to a tutorial or give me some help about how to apply this patch to my
> system?

It's been a while since I've had to do this on Ubuntu. I think you can follow this guide to recompile your kernel:
https://help.ubuntu.com/community/Kernel/Compile

I'm pretty sure that once you get the sources on your system there will be a subdirectory named debian or something similar, and somewhere inside that you'll find a folder containing Ubuntu's kernel patches. You should be able to put the patch in that folder before compiling (which will take a while), and the build will apply it automatically.

Other people here may be able to point out if I'm missing or forgetting anything.
Comment 256 Chemary 2021-06-28 10:05:56 UTC
(In reply to mikezackles from comment #255)
> > I lost the count of working hours I have spent due to this issue.
> 
> I completely understand :(
> 
> > I use Ubuntu 20.04.2 LTS with kernel 5.12.9-051209-generic can you guide me
> > to a tutorial or give me some help about how to apply this patch to my
> > system?
> 
> It's been a while since I've had to do this on Ubuntu. I think you can
> follow this guide to recompile your kernel:
> https://help.ubuntu.com/community/Kernel/Compile
> 
> I'm pretty sure that once you get the sources on your system there will be a
> subdirectory named debian or something similar, and somewhere inside that
> you'll find a folder containing Ubuntu's kernel patches. You should be able
> to put the patch in that folder before compiling (which will take a while),
> and the build will apply it automatically.
> 
> Other people here may be able to point out if I'm missing or forgetting
> anything.

Thanks mikezackles, I compiled the kernel 5.13.0-rc6-custom with the patch and set beacon_timeout=256 few days ago, so far I got no problems, but problem normaly appeared after some weeks with laptop on, so lets see if this solution is definitive.
Comment 257 Jintao Zhang 2021-06-30 03:26:05 UTC
When I upgraded the kernel version to kernel-5.12.12-300.fc34.x86_64, I encountered the same problem.

But when I downgraded to 5.12.10-300.fc34.x86_64 kernel, the problem disappeared.



The logs pate here: https://paste.ubuntu.com/p/tVVZJG9zpM/

6月 30 10:44:11 moelove NetworkManager[871]: <info>  [1625021051.0711] device (p2p-dev-wlp0s20f3): supplicant management interface state: authenticating -> disconnected
6月 30 10:44:11 moelove NetworkManager[871]: <info>  [1625021051.0711] device (wlp0s20f3): supplicant interface state: authenticating -> disconnected
6月 30 10:44:11 moelove wpa_supplicant[1377]: wlp0s20f3: CTRL-EVENT-SSID-TEMP-DISABLED id=0 ssid="Krspace-Guest" auth_failures=2 duration=20 reason=CONN_FAILED
6月 30 10:44:11 moelove kernel: wlp0s20f3: authentication with 20:a6:cd:9c:15:e2 timed out
6月 30 10:44:10 moelove kernel: wlp0s20f3: Connection to AP 20:a6:cd:9c:15:e2 lost
6月 30 10:44:10 moelove kernel: iwlwifi 0000:00:14.3: No beacon heard and the session protection is over already...
6月 30 10:44:10 moelove kernel: wlp0s20f3: send auth to 20:a6:cd:9c:15:e2 (try 3/3)
6月 30 10:44:09 moelove kernel: wlp0s20f3: Connection to AP 20:a6:cd:9c:15:e2 lost
6月 30 10:44:09 moelove kernel: iwlwifi 0000:00:14.3: No beacon heard and the session protection is over already...
6月 30 10:44:09 moelove kernel: wlp0s20f3: send auth to 20:a6:cd:9c:15:e2 (try 2/3)
6月 30 10:44:08 moelove kernel: wlp0s20f3: Connection to AP 20:a6:cd:9c:15:e2 lost
6月 30 10:44:08 moelove kernel: iwlwifi 0000:00:14.3: No beacon heard and the session protection is over already...
Comment 258 Abhishek Naik 2021-06-30 07:14:47 UTC
(In reply to Jintao Zhang from comment #257)
> When I upgraded the kernel version to kernel-5.12.12-300.fc34.x86_64, I
> encountered the same problem.
> 
> But when I downgraded to 5.12.10-300.fc34.x86_64 kernel, the problem
> disappeared.
> 
> 
> 
> The logs pate here: https://paste.ubuntu.com/p/tVVZJG9zpM/
> 
> 6月 30 10:44:11 moelove NetworkManager[871]: <info>  [1625021051.0711] device
> (p2p-dev-wlp0s20f3): supplicant management interface state: authenticating
> -> disconnected
> 6月 30 10:44:11 moelove NetworkManager[871]: <info>  [1625021051.0711] device
> (wlp0s20f3): supplicant interface state: authenticating -> disconnected
> 6月 30 10:44:11 moelove wpa_supplicant[1377]: wlp0s20f3:
> CTRL-EVENT-SSID-TEMP-DISABLED id=0 ssid="Krspace-Guest" auth_failures=2
> duration=20 reason=CONN_FAILED
> 6月 30 10:44:11 moelove kernel: wlp0s20f3: authentication with
> 20:a6:cd:9c:15:e2 timed out
> 6月 30 10:44:10 moelove kernel: wlp0s20f3: Connection to AP 20:a6:cd:9c:15:e2
> lost
> 6月 30 10:44:10 moelove kernel: iwlwifi 0000:00:14.3: No beacon heard and the
> session protection is over already...
> 6月 30 10:44:10 moelove kernel: wlp0s20f3: send auth to 20:a6:cd:9c:15:e2
> (try 3/3)
> 6月 30 10:44:09 moelove kernel: wlp0s20f3: Connection to AP 20:a6:cd:9c:15:e2
> lost
> 6月 30 10:44:09 moelove kernel: iwlwifi 0000:00:14.3: No beacon heard and the
> session protection is over already...
> 6月 30 10:44:09 moelove kernel: wlp0s20f3: send auth to 20:a6:cd:9c:15:e2
> (try 2/3)
> 6月 30 10:44:08 moelove kernel: wlp0s20f3: Connection to AP 20:a6:cd:9c:15:e2
> lost
> 6月 30 10:44:08 moelove kernel: iwlwifi 0000:00:14.3: No beacon heard and the
> session protection is over already...

Hello Zhang,
     What is your nic type? Is it 9260 and 9560? We need a firmware dump from 9260 or 9560 for further analysis.
Can you please share the firmware dump? Please follow the below link.

https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging

Thanks
Comment 259 triffid.hunter 2021-06-30 08:02:19 UTC
As an update to my comment #253, I've had a few random WiFi drops since then - but the incidence of random drops is drastically reduced with mikezackles's comment #172 patch and associated sysfs setting.
Comment 260 Jiri Konecny 2021-07-02 08:14:14 UTC
Hello,

experiencing this issue on my P1Gen2 ThinkPad NB with Fedora SilverBlue system.

$ uname -r
5.12.13-300.fc34.x86_64

My WiFi:
52:00.0 Network controller: Intel Corporation Wi-Fi 6 AX200 (rev 1a)

journal log output:

čec 02 09:52:01 loki.packetseekers.eu kernel: wlp82s0: associated
čec 02 09:52:01 loki.packetseekers.eu NetworkManager[1176]: <info>  [1625212321.2311] device (wlp82s0): supplicant interface state: associating -> 4way_handshake
čec 02 09:52:01 loki.packetseekers.eu NetworkManager[1176]: <info>  [1625212321.2311] device (p2p-dev-wlp82s0): supplicant management interface state: associating -> 4way_handshake
čec 02 09:52:01 loki.packetseekers.eu NetworkManager[1176]: <info>  [1625212321.2312] device (wlp82s0): DHCPv4 lease renewal requested
čec 02 09:52:01 loki.packetseekers.eu NetworkManager[1176]: <info>  [1625212321.2396] dhcp4 (wlp82s0): canceled DHCP transaction
čec 02 09:52:01 loki.packetseekers.eu NetworkManager[1176]: <info>  [1625212321.2396] dhcp4 (wlp82s0): state changed bound -> terminated
čec 02 09:52:01 loki.packetseekers.eu NetworkManager[1176]: <info>  [1625212321.2399] dhcp4 (wlp82s0): activation: beginning transaction (timeout in 45 seconds)
čec 02 09:52:01 loki.packetseekers.eu wpa_supplicant[1277]: wlp82s0: WPA: Key negotiation completed with 48:8f:5a:34:06:47 [PTK=CCMP GTK=CCMP]
čec 02 09:52:01 loki.packetseekers.eu wpa_supplicant[1277]: wlp82s0: CTRL-EVENT-CONNECTED - Connection to 48:8f:5a:34:06:47 completed [id=0 id_str=]
čec 02 09:52:01 loki.packetseekers.eu NetworkManager[1176]: <info>  [1625212321.2610] device (wlp82s0): supplicant interface state: 4way_handshake -> completed
čec 02 09:52:01 loki.packetseekers.eu NetworkManager[1176]: <info>  [1625212321.2740] device (p2p-dev-wlp82s0): supplicant management interface state: 4way_handshake -> completed
čec 02 09:52:02 loki.packetseekers.eu kernel: iwlwifi 0000:52:00.0: No beacon heard and the session protection is over already...
čec 02 09:52:02 loki.packetseekers.eu kernel: wlp82s0: Connection to AP 48:8f:5a:34:06:47 lost
čec 02 09:52:02 loki.packetseekers.eu wpa_supplicant[1277]: wlp82s0: CTRL-EVENT-SIGNAL-CHANGE above=0 signal=0 noise=9999 txrate=0
čec 02 09:52:02 loki.packetseekers.eu wpa_supplicant[1277]: wlp82s0: CTRL-EVENT-DISCONNECTED bssid=48:8f:5a:34:06:47 reason=4 locally_generated=1
čec 02 09:52:02 loki.packetseekers.eu wpa_supplicant[1277]: dbus: wpa_dbus_property_changed: no property SessionLength in object /fi/w1/wpa_supplicant1/Interfaces/0
čec 02 09:52:02 loki.packetseekers.eu NetworkManager[1176]: <info>  [1625212322.1372] device (wlp82s0): supplicant interface state: completed -> disconnected
čec 02 09:52:02 loki.packetseekers.eu NetworkManager[1176]: <info>  [1625212322.1373] device (p2p-dev-wlp82s0): supplicant management interface state: completed -> disconnected
čec 02 09:52:02 loki.packetseekers.eu NetworkManager[1176]: <info>  [1625212322.2310] device (wlp82s0): supplicant interface state: disconnected -> scanning
čec 02 09:52:02 loki.packetseekers.eu NetworkManager[1176]: <info>  [1625212322.2310] device (p2p-dev-wlp82s0): supplicant management interface state: disconnected -> scanning
čec 02 09:52:03 loki.packetseekers.eu wpa_supplicant[1277]: wlp82s0: SME: Trying to authenticate with 48:8f:5a:34:06:47 (SSID='TAJ-2' freq=2447 MHz)

Output from my Mikrotik router log:
A8:7E:EA:56:A4:E6@wlan1: disconnected, received deauth: no activity (4)


I read something that changing the AP settings could help. Do someone here knows how to do that on my Mikrotik router?
Comment 261 marc 2021-07-02 11:07:06 UTC
after a full 2 days of my holidays pulling my hair out & alienating myself from my family, I found something that is working well for me (for now).

I run fedora 34 on a fairly broken Toshiba Satellite LG50-A.  Suites me fine, & I get a kick out of wringing every last semblance of life out of this thing!

Long story short, to try:

echo 5000 > /sys/module/mac80211/parameters/probe_wait_ms 
echo 50 > /sys/module/mac80211/parameters/beacon_loss_count

If that works for you, you can make it permanent by:

cat << EOF > /etc/modprobe.d/mac80211.conf
option mac80211 beacon_loss_count=50
option mac80211 probe_wait_ms=5000
EOF

Probably overkill, & I can supply more info if asked.

My laptop's wifi has always been a bit crappy, tbh, & it's much better now, especially because I am forcing 40MHz wide wireless N - I know I shouldn't, but 2.4g is a cluster-truck where I am anyway.

Usual stuff; your mileage may vary etc. ad nausuem; all care no responsibility.  Good luck.
Comment 262 marc 2021-07-02 11:14:45 UTC
(In reply to marc from comment #261)
> after a full 2 days of my holidays pulling my hair out & alienating myself
> from my family, I found something that is working well for me (for now).
> 
> I run fedora 34 on a fairly broken Toshiba Satellite LG50-A.  Suites me
> fine, & I get a kick out of wringing every last semblance of life out of
> this thing!
> 
> Long story short, to try:
> 
> echo 5000 > /sys/module/mac80211/parameters/probe_wait_ms 
> echo 50 > /sys/module/mac80211/parameters/beacon_loss_count
> 
> If that works for you, you can make it permanent by:
> 
> cat << EOF > /etc/modprobe.d/mac80211.conf
> option mac80211 beacon_loss_count=50
> option mac80211 probe_wait_ms=5000
> EOF
> 
> Probably overkill, & I can supply more info if asked.
> 
> My laptop's wifi has always been a bit crappy, tbh, & it's much better now,
> especially because I am forcing 40MHz wide wireless N - I know I shouldn't,
> but 2.4g is a cluster-truck where I am anyway.
> 
> Usual stuff; your mileage may vary etc. ad nausuem; all care no
> responsibility.  Good luck.

btw, I replied here, because it's this post that helped me the most, so big thanks to all of those that helped me (indirectly) + a special thanks to whomever it was that talked about beacon intervals.  Also beware that this mod will likely have implications for your WPA2 setup (you'll be able to use it again :D )
Comment 263 mikezackles 2021-07-02 14:21:53 UTC
(In reply to marc from comment #261)

> option mac80211 beacon_loss_count=50
> option mac80211 probe_wait_ms=5000

Seems promising to me. I can't test this for a little while, but I'm hopeful that it will provide some relief without patching. If this turns out to be a definitive workaround, I can't believe it took this long for someone to point it out! Thanks for sharing!
Comment 264 marc 2021-07-03 09:09:27 UTC
(In reply to mikezackles from comment #263)
> (In reply to marc from comment #261)
> 
> > option mac80211 beacon_loss_count=50
> > option mac80211 probe_wait_ms=5000
> 
> Seems promising to me. I can't test this for a little while, but I'm hopeful
> that it will provide some relief without patching. If this turns out to be a
> definitive workaround, I can't believe it took this long for someone to
> point it out! Thanks for sharing!

So I got more curious about this & continued tinkering.  Here's what I found:

* my modprobe config was wrong.  Here it is now, with some further tweaks:
# cat /etc/modprobe.d/mac80211.conf 
options mac80211 beacon_loss_count=1000 probe_wait_ms=75
options ath9k debug=0xffffffff btcoex_enable=0 ps_enable=0 use_msi=0

I encourage any noobs (like me) to use modinfo & google to work out what I've done.

This config prevented the connection dropping.  I still get HUGE ping spikes - up to 3 seconds! but my connection doesn't actually drop anymore.

I can push data at about 5 to 8 MB/s to a local server using scp.  It would be faster, but I get about 10 second blocks followed by about 2 to 3 second periods of negligable data rates when I look using the system monitor; you could nearly time a German train on it; it's very frustrating.

Turning off IPv6 removed all errors from debug in my router & my laptop, so everything is hunky-dory according to that, & I'm just not going to know what's causing those flat spots in transmission rates.

I tried all sorts of different g & n, wide & 20MHz channels & the flat spots in my data rates do not go away.

I'm now so curious about this I could burst tbh.
Comment 265 marc 2021-07-04 04:17:24 UTC
(In reply to marc from comment #264)
> So I got more curious about this & continued tinkering.  Here's what I found:
> 
> * my modprobe config was wrong.  Here it is now, with some further tweaks:
> # cat /etc/modprobe.d/mac80211.conf 
> options mac80211 beacon_loss_count=1000 probe_wait_ms=75
> options ath9k debug=0xffffffff btcoex_enable=0 ps_enable=0 use_msi=0
> 
ok.  grr.  problem appears completely fixed for me now.  But I learned a lot.  But I'm embarrassed.  Here it is:
1) systemctl disable --now bluetooth # I used "rfkill event" to see that my radio was getting turned off every ~10 seconds!  I don't use bluetooth on my laptop, tough if I did
2) turn off ipv6 (fedora 34 is NetworkManager or a lot of work - I'm  allergic to work - that was the disconnect problem - I don't need IPv6 for now, so I'm happy, & there's definitely a problem there I think)
3) I had some acpi tuning in my kernel options that I turned off.  I don't think this made any difference at all, but I'm glad to be rid of having anything custom & for that exact same reason I was able to remove my modprobe customizations.

My wifi works for me now.  I average about 5MB/s (because people get confused, I'm saying ~ 5 * 1024 * 1204 * 8 bits per second) with anywhere from 2MB/s to about 12MB/s common variance probably due to interference on my HT40 2.4g 802.11n connection.

Very interestingly, I did find a 99999 us (10 seconds!) parameter in the aqm code which is just too interesting to pass up but I think that's OT to this bug report.

I really hope this helps someone - directly or otherwise.  Personally I resent the time I have to spend rooting these problems out, & I know it'll be a PITA if I ever decide I want to use bluetooh again.

Ping me if I can help you, I may be willing to - but OT for this bug.
Comment 266 kotrfa 2021-07-04 18:39:32 UTC
I can confirm that with the patch https://aur.archlinux.org/pkgbase/linux-beacon applied my wifi is working flawlessly (on Archlinux).
Comment 267 how 2021-07-05 08:22:47 UTC
I tried mac80211 module config beacon_loss_count=50 probe_wait_ms=5000 does not work, still has beacon lost and disconnect
Comment 268 marc 2021-07-05 09:10:47 UTC
(In reply to how from comment #267)
> I tried mac80211 module config beacon_loss_count=50 probe_wait_ms=5000 does
> not work, still has beacon lost and disconnect

see my previous post about MY hardware (tobshiba satellite l50-a, atheros bluetooth/2.4g pcie card) - in summary: try these things * turn ipv6 off * turn bluetooth off * increase the beacon loss count (probe wait seems less important).  see how you go.  Also I tried to bear in mind that this is bugzilla, not tech support.
Comment 269 triffid.hunter 2021-07-22 12:05:19 UTC
I just updated to 5.10.52 (LTS) (from 5.10.51) and despite having the workaround from comment #172 in my local patches, my WiFi is constantly dropping with a vengeance once again - every 20 seconds to a few minutes!

It occasionally states "No beacon heard and the time event is over already" but frequently just disconnects without giving a reason.

I'll try downgrading to 5.10.51 tomorrow and see if the errant behaviour also reverts.
Comment 270 Mathias 2021-07-22 12:06:44 UTC
Created attachment 297995 [details]
attachment-10273-0.html

Dear sender,

I am out of office until 26th of July 2021 and will answer your email after my return.
In urgent cases, please contact Mr. Gerd Brost (gerd.brost@aisec.fraunhofer.de), he will redirect you to the right person.

Best regards,
Mathias Morbitzer
Comment 271 nomos48143 2021-09-14 22:15:16 UTC
Disabling IPv6 via sysctl.conf really helped with the beacons dropping.
Comment 272 Prabesh 2021-09-19 04:29:30 UTC
Created attachment 298877 [details]
dmseg log in Fedora 34 which has 5.13.16-200.fc34.x86_64 kernel at the moment

```
pranav@fedora ~> ethtool -i wlp3s0
driver: iwlwifi
version: 5.13.16-200.fc34.x86_64
firmware-version: 17.3216344376.0 3160-17.ucode
expansion-rom-version:
bus-info: 0000:03:00.0
supports-statistics: yes
supports-test: no
supports-eeprom-access: no
supports-register-dump: no
supports-priv-flags: no
pranav@fedora ~> lspci -kvnn | sed -n '/Network/,/^$/ p'
03:00.0 Network controller [0280]: Intel Corporation Wireless 3160 [8086:08b3] (rev 83)
	Subsystem: Intel Corporation Dual Band Wireless AC 3160 [8086:8470]
	Flags: bus master, fast devsel, latency 0, IRQ 49
	Memory at c1000000 (64-bit, non-prefetchable) [size=8K]
	Capabilities: <access denied>
	Kernel driver in use: iwlwifi
	Kernel modules: iwlwifi
pranav@fedora ~>
```

Experienced similar bug today.
Comment 273 Niclaas Grehling 2021-11-11 12:19:25 UTC
Network controller [0280]: Intel Corporation Wi-Fi 6 AX201
driver: iwlwifi
version: 5.15.1-2.gdcfd611-default
firmware-version: 63.c04f3485.0 QuZ-a0-hr-b0-63.u

None of the proposed actions did really work, but switching from leap to tumbleweed and kernel 5.15.1-2.gdcfd611-default with the latest firmware didn't make the problem disappear, but 5ghz wifi now becomes usable (less than 1% packet loss).
Comment 274 elfosardo 2021-12-10 07:27:19 UTC
Same issue here with Fedora 35

Dec10 08:14] wlp4s0: Connection to AP 44:a6:1e:ea:9a:d5 lost
[  +0.288456] wlp4s0: authenticate with 44:a6:1e:ea:9a:d0
[  +0.002411] wlp4s0: send auth to 44:a6:1e:ea:9a:d0 (try 1/3)
[  +0.030269] wlp4s0: authenticated
[  +0.000575] wlp4s0: associate with 44:a6:1e:ea:9a:d0 (try 1/3)
[  +0.010794] wlp4s0: RX AssocResp from 44:a6:1e:ea:9a:d0 (capab=0x1411 status=0 aid=3)
[  +0.006745] wlp4s0: associated
[  +0.160819] wlp4s0: Limiting TX power to 20 (20 - 0) dBm as advertised by 44:a6:1e:ea:9a:d0
[  +4.147438] wlp4s0: deauthenticated from 44:a6:1e:ea:9a:d0 (Reason: 12=<unknown>)
[  +0.644001] wlp4s0: authenticate with 44:a6:1e:ea:9a:d5
[  +0.003845] wlp4s0: send auth to 44:a6:1e:ea:9a:d5 (try 1/3)
[  +0.004685] wlp4s0: authenticated
[  +0.000759] wlp4s0: associate with 44:a6:1e:ea:9a:d5 (try 1/3)
[  +0.003442] wlp4s0: RX AssocResp from 44:a6:1e:ea:9a:d5 (capab=0x1511 status=0 aid=8)
[  +0.002473] wlp4s0: associated
[  +0.603012] iwlwifi 0000:04:00.0: No beacon heard and the time event is over already...


$ ethtool -i wlp4s0
driver: iwlwifi
version: 5.15.6-200.fc35.x86_64
firmware-version: 36.ca7b901d.0 8265-36.ucode
expansion-rom-version: 
bus-info: 0000:04:00.0
supports-statistics: yes
supports-test: no
supports-eeprom-access: no
supports-register-dump: no
supports-priv-flags: no

$  lspci -kvnn | sed -n '/Network/,/^$/ p'
04:00.0 Network controller [0280]: Intel Corporation Wireless 8265 / 8275 [8086:24fd] (rev 78)
	Subsystem: Intel Corporation Dual Band Wireless-AC 8265 [8086:0010]
	Flags: bus master, fast devsel, latency 0, IRQ 163
	Memory at ec100000 (64-bit, non-prefetchable) [size=8K]
	Capabilities: <access denied>
	Kernel driver in use: iwlwifi
	Kernel modules: iwlwifi

$ uname -a
5.15.6-200.fc35.x86_64 #1 SMP Wed Dec 1 13:41:10 UTC 2021 x86_64 x86_64 x86_64 GNU/Linux
Comment 275 Esokrarkose 2021-12-12 10:21:51 UTC
Same thing happens to me on a Dell XPS 13 9310 2-in-1 (Intel Corporation Wi-Fi 6 AX201 (rev 20)):

[ 4136.870490] wlp0s20f3: authenticate with 3c:a6:2f:23:d9:7d
[ 4136.881492] wlp0s20f3: send auth to 3c:a6:2f:23:d9:7d (try 1/3)
[ 4137.001509] wlp0s20f3: authenticated
[ 4137.003196] wlp0s20f3: associate with 3c:a6:2f:23:d9:7d (try 1/3)
[ 4137.006526] wlp0s20f3: RX AssocResp from 3c:a6:2f:23:d9:7d (capab=0x1511 status=0 aid=1)
[ 4137.018060] wlp0s20f3: associated
[ 4137.781572] iwlwifi 0000:00:14.3: No beacon heard and the session protection is over already...
[ 4137.781672] wlp0s20f3: Connection to AP 3c:a6:2f:23:d9:7d lost

Router: Fritzbox 7530 and Fritz Repeater 2400: It tries to connect via 5Ghz to the 2400 AP but usually drops within minutes. Very annoying issue, especially when working via ssh on a remote server. Happens to me with latest firmware and kernel 5.15.7.

Why is this still flagged as NEEDINFO? What info do you need?
Comment 276 Frederico K. 2021-12-16 15:17:20 UTC
Not sure if it will help anyone, but I solved this with the core64 release of the iwlwifi drivers ( https://git.kernel.org/pub/scm/linux/kernel/git/iwlwifi/backport-iwlwifi.git/log/?h=release/core64 )


compiled with the instructions from: https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/core_release


my reasoning is that it already had the bug fix from https://git.kernel.org/pub/scm/linux/kernel/git/iwlwifi/backport-iwlwifi.git/commit/?id=53faa7fb4f69b2e28495b05e0a8bae7bcf47d60e


The next thing i did was get the microcode from here: https://git.kernel.org/pub/scm/linux/kernel/git/firmware/linux-firmware.git 



I used: iwlwifi-QuZ-a0-hr-b0-63.ucode



which i just copied to /lib/firmware



I am way out of my league here, but I thought it might help someone. This worked on a debian 11 



$ uname -a
Linux bml01 5.10.0-10-amd64 #1 SMP Debian 5.10.84-1 (2021-12-08) x86_64 GNU/Linux



with an intel comet lake wifi interface.  



$lspci -kvnn | sed -n '/Network/,/^$/ p'
00:14.3 Network controller [0280]: Intel Corporation Comet Lake PCH CNVi WiFi [8086:06f0]
	Subsystem: Intel Corporation Comet Lake PCH CNVi WiFi [8086:0070]
	Flags: bus master, fast devsel, latency 0, IRQ 16, IOMMU group 7
	Memory at 604111c000 (64-bit, non-prefetchable) [size=16K]
	Capabilities: [c8] Power Management version 3
	Capabilities: [d0] MSI: Enable- Count=1/1 Maskable- 64bit+
	Capabilities: [40] Express Root Complex Integrated Endpoint, MSI 00
	Capabilities: [80] MSI-X: Enable+ Count=16 Masked-
	Capabilities: [100] Latency Tolerance Reporting
	Capabilities: [164] Vendor Specific Information: ID=0010 Rev=0 Len=014 <?>
	Kernel driver in use: iwlwifi
	Kernel modules: iwlwifi
Comment 277 Jose Silva 2022-01-04 16:15:37 UTC
I've recently upgraded the kernel to 5.12.12 and it has become a constant annoyance with my 9260. Guess I'll need to start compiling the kernel with mikezackles patch...


Also this has been asked multiple times but who "NEEDSINFO"?
Comment 278 Jose Silva 2022-01-04 16:58:26 UTC
Installed the microcode from Luca's 196 comment and apparently it has a higher version than the intalled with Arch pacman????

old -> Jan 04 16:35:04 krystal kernel: iwlwifi 0000:04:00.0: loaded firmware version 46.6b541b68.0 9260-th-b0-jf-b0-46.ucode op_mode iwlmvm
new -> Jan 04 16:46:20 krystal kernel: iwlwifi 0000:04:00.0: loaded firmware version 46.8902351f.0 9260-th-b0-jf-b0-46.ucode op_mode iwlmvm


It fixed my issue. For anyone looking for a fix, just downgrade the uCode for the iwlwifi don't mess with anything else.
Comment 279 how 2022-01-18 11:09:08 UTC
what do you mean by downgrade ucode? There are several Intel wifi cards mentioned. The initial report is for 8260. Each card has different firmware, downgrade which?
Comment 280 Fabio Coatti 2022-01-26 12:16:39 UTC
(In reply to Yury Pakin from comment #235)

> After 2 hours and 49 minutes from the start of the test, the rx bitrate
> dropped to 1 MBit/s:
> 

Interesting, I'm experiencing the same behavior after roughly 3 hours after the machine is turned on and connected.
I have a Lenovo P50, (Intel Corporation Wireless 8260 (rev 3a)) kernel 5.16.2 and I'm experiencing this issue quite regularly since a lot of time. Oddly enough, after several disconnections the issue solves itself.

I'm going to try some of the suggestions in this thread to see if something changes. I have no problem into patching the kernel itself, I'm only not sure which patch to try, this thread is really something huge :)
Comment 281 NHonda 2022-01-26 12:25:32 UTC
(In reply to Fabio Coatti from comment #280)
> (In reply to Yury Pakin from comment #235)
> 
> > After 2 hours and 49 minutes from the start of the test, the rx bitrate
> > dropped to 1 MBit/s:
> > 
> 
> Interesting, I'm experiencing the same behavior after roughly 3 hours after
> the machine is turned on and connected.
> I have a Lenovo P50, (Intel Corporation Wireless 8260 (rev 3a)) kernel
> 5.16.2 and I'm experiencing this issue quite regularly since a lot of time.
> Oddly enough, after several disconnections the issue solves itself.
> 
> I'm going to try some of the suggestions in this thread to see if something
> changes. I have no problem into patching the kernel itself, I'm only not
> sure which patch to try, this thread is really something huge :)

Open the file drivers/net/wireless/intel/iwlwifi/mvm/mvm.h,
and change the value of 
IWL_MVM_MISSED_BEACONS_THRESHOLD_LONG 
much larger, for examle, from 16 to 256 or more.
Comment 282 Jose Silva 2022-01-27 12:23:52 UTC
(In reply to how from comment #279)
> what do you mean by downgrade ucode? There are several Intel wifi cards
> mentioned. The initial report is for 8260. Each card has different firmware,
> downgrade which?

9260, it's in the logs I included -> "9260-th-b0-jf-b0-46.ucode"
Comment 283 Fabio Coatti 2022-01-31 12:54:01 UTC
(In reply to NHonda from comment #281)
> (In reply to Fabio Coatti from comment #280)
> > (In reply to Yury Pakin from comment #235)
> > 
> > > After 2 hours and 49 minutes from the start of the test, the rx bitrate
> > > dropped to 1 MBit/s:
> > > 
> > 
> > Interesting, I'm experiencing the same behavior after roughly 3 hours after
> > the machine is turned on and connected.
> > I have a Lenovo P50, (Intel Corporation Wireless 8260 (rev 3a)) kernel
> > 5.16.2 and I'm experiencing this issue quite regularly since a lot of time.
> > Oddly enough, after several disconnections the issue solves itself.
> > 
> > I'm going to try some of the suggestions in this thread to see if something
> > changes. I have no problem into patching the kernel itself, I'm only not
> > sure which patch to try, this thread is really something huge :)
> 
> Open the file drivers/net/wireless/intel/iwlwifi/mvm/mvm.h,
> and change the value of 
> IWL_MVM_MISSED_BEACONS_THRESHOLD_LONG 
> much larger, for examle, from 16 to 256 or more.

Since I applied the patch (a couple of days) the behavior did not repeated and no issues with wifi connections. Thanks!
Comment 284 teisho 2022-02-19 12:21:24 UTC
Since I removed my AX200 from my Thinkpad, threw it in the bin and installed an RTL8852AE for a few bucks, I have had no problems at all.
I can finally use WiFi again.
Comment 285 Troy Volin 2022-02-25 16:17:49 UTC
Hi. I've had this problem for a very long time, but just now found that THIS bug report is really the place to be.
To sum up key points from this 3 year saga of a bug:

1) The patch enabling IWL_MVM_MISSED_BEACONS_THRESHOLD_LONG to be an exposed parameter (configurable in modprobe options; possibly even writable under /sys/modules/iwlmvm/parameters) seems to be *harmless*. There is debate about whether it fixes it for everyone, and whether a high setting may cause other delays. Linux is all about YMMV. I ask that this patch, leaving the default as-is, but making the setting configurable, be considered for merge.

2) It's about the access point driver. My google wifi hockey puck access point never gives me a problem, but my ISP-provided access point does.
I totally agree it's an AP driver bug, but that's not helpful. I'd like to be able to function when I'm in an airport, bookstore, etc. where I don't own the AP. 
As others have said, LOTS of other wifi adapters handle this particular access point driver "bug" without a problem. So inflexibility in THIS driver is a bug too. Note, even, the Windows driver for this same hardware handles this situation just fine.

3) Please be decent to each other ;)
Comment 286 mikezackles 2022-02-25 17:34:03 UTC
(In reply to Troy Volin from comment #285)
> I ask that this patch, leaving the default as-is, but making the setting
> configurable, be considered for merge.

At this point I'm a bit out of the loop when it comes to this bug, but if you're serious about getting the patch merged I'm guessing it might get more attention from people that matter on the linux wireless mailing list than here.
Comment 287 Troy Volin 2022-02-25 18:29:25 UTC
(In reply to mikezackles from comment #286)
> (In reply to Troy Volin from comment #285)
> > I ask that this patch, leaving the default as-is, but making the setting
> > configurable, be considered for merge.
> 
> At this point I'm a bit out of the loop when it comes to this bug, but if
> you're serious about getting the patch merged I'm guessing it might get more
> attention from people that matter on the linux wireless mailing list than
> here.

Thanks Mike.
How do you feel about submitting the patch you wrote, as per https://wireless.wiki.kernel.org/en/developers/documentation/submittingpatches ?
attachment 294585 [details]

Even if you feel out of the loop on the symptoms etc, this is still a "make a parameter out of what used to be a static value" patch.
If you'd rather, I can do it.
Comment 288 mikezackles 2022-02-25 18:36:18 UTC
(In reply to Troy Volin from comment #287)
> (In reply to mikezackles from comment #286)
> How do you feel about submitting the patch you wrote, as per
> https://wireless.wiki.kernel.org/en/developers/documentation/
> submittingpatches ?

Ha I guess I was being lazy :) Sure, I will do my best to submit it over the weekend and CC you.
Comment 289 Thomas Hinterberger 2022-03-16 12:44:51 UTC
I don't know, if my comment here is on the right place, because I have the "no beacon" message in dmesg very seldom, but all the symptoms are the same as described.

My router is a Fritz.Box 7530 XU - and 2 laptops with  MX-Linux 5.10.0-9-amd64


1) Acer R5-131T with AC 3165

intel wifi iwlwifi 0000:02:00.0: Detected Intel(R) Dual Band Wireless AC 3165, REV=0x210

2) Toshiba Portege X30-D with AC 8265

02:00.0 Network controller [0280]: Intel Corporation Wireless 8265 / 
8275 [8086:24fd] (rev 78)

The Acer works without any problems, when the router is set to use 2.4 GHz and 5 GHz on the same ssid. When I start beside the router it uses 5 GHz, when I walk away it switches to 2.4 GHz and I even don't notice, when watching a film and it stays on 2.4 GHz

The Portege works, when I set the router to use only 2.4 GHz without any problems - the same on Win Pro 10, when the router is set to 2.4 + 5 GHz.

in Linux I have freezes - films are stopping sometime breakdowns etc...

when I look in wavemon, it starts with 2.4 GHz and after about 45 sec it changes to 5 GHz and there the drops are starting.

So I thought, lets tell Network Manager to use only 2.4 GHz -  so I edited /etc/NetworkManager/system-connections under the [wifi] section, edit the band field to the following [wifi]): band=n

but this does not work - it is overruled the same behavior as before - so looking to something other possibility

modinfo iwlwifi | grep disable
                parm:           11n_disable:disable 11n functionality, bitmap: 1: full, 2: disable agg TX, 4: disable agg RX, 8 enable agg TX (uint)
                parm:           uapsd_disable:disable U-APSD functionality bitmap 1: BSS 2: P2P Client (default: 3) (uint)
                parm:           power_save:enable WiFi power management (default: disable) (bool)
                parm:           disable_11ac:Disable VHT capabilities (default: false) (bool)
                parm:           disable_11ax:Disable HE capabilities (default: false) (bool)


gives the possibility to add iwlwifi.disable_11ac=1 iwlwifi.disable_11ax=1 to grub

it still does not use the 2.4 GHz, it switches to 5 GHz after 45 sec, but it starts switching back to 2.4 GHz when the connection is lost - so its a permanent switching between 2.4 GHz and 5 GHz  - and the good thing is, that the film is not stopping - sometimes there are artefacts, but it continues

I think - to really use the card, it should be possible that iwlwifi uses the boot parameters to disable ac and ax.

And in the long term the AC 8265 should behave like the AC 3165
Comment 290 Mathias 2022-03-16 12:46:43 UTC
Created attachment 300577 [details]
attachment-10582-0.html

Dear sender,


I am out of office and have limited access to my e-mail. I will be back on March 21.


In urgent cases, please contact Christian Banse (christian.banse@aisec.fraunhofer.de).



Best regards


Mathias Morbitzer
Comment 291 Petr Kracik 2022-04-18 22:51:18 UTC
Kernel  5.17.3-arch1-1

problem still persist

[130573.479815] iwlwifi 0000:02:00.0: No beacon heard and the session protection is over already...


it is really annoying already.... 2 years and still it is NOT FIXED!
Comment 292 how 2022-04-19 02:45:48 UTC
try higher beacon_loss_count in iwlwifi modprobe. Should help a bit. I have no more beacon lost message to be found in system logs
Comment 293 wifiintelbugsolution 2022-05-05 19:25:06 UTC
Hello,

I had the same issues with my WiFi in 2021

      lspci -kvnn | sed -n '/Network/,/^$/ p'
04:00.0 Network controller [0280]: Intel Corporation Wireless 7260 [8086:08b2] (rev 83)
        Subsystem: Intel Corporation Wireless-N 7260 [8086:4270]
        Flags: bus master, fast devsel, latency 0, IRQ 49
        Memory at f0500000 (64-bit, non-prefetchable) [size=8K]
        Capabilities: [c8] Power Management version 3
        Capabilities: [d0] MSI: Enable+ Count=1/1 Maskable- 64bit+
        Capabilities: [40] Express Endpoint, MSI 00
        Capabilities: [100] Advanced Error Reporting
        
since I patch

a/drivers/net/wireless/intel/iwlwifi/mvm/utils.c b/drivers/net/wireless/intel/iwlwifi/mvm/utils.c
index d2cada0ab4264..3303fc85d76f5 100644
--- a/drivers/net/wireless/intel/iwlwifi/mvm/utils.c
+++ b/drivers/net/wireless/intel/iwlwifi/mvm/utils.c
@@ -1029,9 +1029,6 @@ bool iwl_mvm_rx_diversity_allowed(struct iwl_mvm *mvm)

        lockdep_assert_held(&mvm->mutex);

-       if (iwlmvm_mod_params.power_scheme != IWL_POWER_SCHEME_CAM)
-               return false;
-
        if (num_of_ant(iwl_mvm_get_valid_rx_ant(mvm)) == 1)
                return false;

--
2.33.0

my self build kernel WiFi works like a charm. 

I think it has somethink to do with antennas in energy saving mode, and WiFi module switch to other antenna to early without energizing it before using other antenna. So for me it seems that there are no checks if antenna has energy and is ready for use. After switching to other antenna without energy connection will not work well ...

Maybe I'll be wrong, but for me it worked. I'm not a developer, so I'm not able to implement such checks correctly

regards
Comment 294 triffid.hunter 2022-05-06 07:39:50 UTC
You're suggesting that reverting https://lore.kernel.org/all/iwlwifi.20211017113927.fc896bc5cdaa.I1d11da71b8a5cbe921a37058d5f578f1b14a2023@changeid/ fixes the issue?

If this code is related to power saving, might be fine for desktops but perhaps not laptops or other portables where power saving is more important
Comment 295 wifiintelbugsolution 2022-05-06 08:48:11 UTC
This could be the first step. I dont suggest reverting this, but doing this prevents the described problems for me. My suggestion is to implement checks to not use the antenna(s) if there is no power on it. It shold be used only if its powered up. For me stability is more importend then powersaving. I know other like longer runs of battery driven device, but it's no fun if WiFi don't work and maybe you need to reboot, which consumes eventually more energy.

Many comments here switch off powersavings to get stable WiFi, other put WiFi in debug-mode which also switch of powersavings and get more stable WiFi-device. Attemps to make time longer to wake up for the antennas in energy-saving mode helped a bit, but problems still present. The logical decission is to first power up antenna and then use it. Not to hope because signal is comming it switches on. It has to be check if its energized before using it.

I use this since November 2021 and at the moment at kernel 5.17.5. I had never seen any error messages after using this. 

regards
Comment 296 Matt 2022-05-13 19:42:19 UTC
(In reply to mikezackles from comment #288)
> (In reply to Troy Volin from comment #287)
> > (In reply to mikezackles from comment #286)
> > How do you feel about submitting the patch you wrote, as per
> > https://wireless.wiki.kernel.org/en/developers/documentation/
> > submittingpatches ?
> 
> Ha I guess I was being lazy :) Sure, I will do my best to submit it over the
> weekend and CC you.

https://patchwork.kernel.org/project/linux-wireless/patch/20220226045047.643695-1-mikezackles@gmail.com/ is this the patch that you created?
Comment 297 mikezackles 2022-05-13 20:39:20 UTC
(In reply to Matt from comment #296)
> (In reply to mikezackles from comment #288)
> > (In reply to Troy Volin from comment #287)
> > > (In reply to mikezackles from comment #286)
> > > How do you feel about submitting the patch you wrote, as per
> > > https://wireless.wiki.kernel.org/en/developers/documentation/
> > > submittingpatches ?
> > 
> > Ha I guess I was being lazy :) Sure, I will do my best to submit it over
> the
> > weekend and CC you.
> 
> https://patchwork.kernel.org/project/linux-wireless/patch/20220226045047.
> 643695-1-mikezackles@gmail.com/ is this the patch that you created?

Yes, that's the one! :)
Comment 298 triffid.hunter 2022-05-27 09:08:08 UTC
Hmm, after firing up 5.15.43, this issue has come back with a vengeance - it's disconnecting with "No beacon heard and the session protection is over already..." literally every 10 seconds like clockwork even though I have the patch from comment #172 and have tried values from 256 to 1024 in /sys/module/iwlwifi/parameters/beacon_timeout

Seems like turning off power management helps though…
Comment 299 NHonda 2022-05-27 11:01:56 UTC
(In reply to triffid.hunter from comment #298)
> Hmm, after firing up 5.15.43, this issue has come back with a vengeance -
> it's disconnecting with "No beacon heard and the session protection is over
> already..." literally every 10 seconds like clockwork even though I have the
> patch from comment #172 and have tried values from 256 to 1024 in
> /sys/module/iwlwifi/parameters/beacon_timeout
> 
> Seems like turning off power management helps though…

Don't confuse both the kernel messages
"No beacon heard and the session protection is over already"
and
"No beacon heard and the time event is over already",
because they are different kinds of timeout caused 
by different reasons or bugs.
The patch is irrelevant and has no effect
with the former "session protection timeout", I think.
Comment 300 mikecookie101 2022-08-26 15:54:16 UTC
I had a similar issue with Intel AX200 on Ubuntu 20.04 and 5.4.0-125-generic. My motherboard, ASUS B550-F gaming, had an antenna attachment for the WiFi port. I tried disabling IPv6 and power savings, but strangely enough, removing the antenna was the best fix for me, which makes it believable that it's related to the physical properties of the antenna.
Comment 301 Mathias 2022-08-26 15:55:49 UTC
Created attachment 301675 [details]
attachment-29102-0.html

Dear sender,


I am out of office and have limited access to my e-mail. I will be back on September 8.

In urgent cases, please contact my group leader, Gerd Brost (gerd.brost@aisec.fraunhofer.de).


Best regards

Mathias Morbitzer
Comment 302 Gene Stark 2022-09-06 22:34:26 UTC
Created attachment 301757 [details]
Log showing apparent race if beacon is lost during association

I'd really like to see this issue fixed.  I am not so concerned about the reason for beacon loss.  The real issue for me is that the system will drop offline permanently in some situations.  The attached log shows evidence of what I believe to be a race that occurs when the beacon is lost during re-association after a previous loss.  When this happens, the authorization fails with 'no-secrets' and will not restart.  I have confirmed that restarting the NetworkManager service will cause reconnection (I did this by setting a cron script to restart the NetworkManager at a specified time each day).  Instead of having to do this, the NetworkManager should be able to recover from this race and not drop permanently offline.
Comment 303 Nicolò 2022-09-30 13:01:46 UTC
I have the same issue with Intel 7265D. I have enabled debug flags for iwlwifi and installed the debug firmware. Which debug flags (https://git.kernel.org/pub/scm/linux/kernel/git/stable/linux.git/tree/drivers/net/wireless/intel/iwlwifi/iwl-debug.h#n129) would be interesting to enable to help debugging?
Comment 304 Jack Bamford 2022-09-30 20:25:10 UTC
Hi folks,

I managed to fix this problem. Had this issue with the 7260 and 8260 and AX200, the fix I found that worked was to change the channel to 52 which is a DFS channel, I tried different channels that were not DFS and those channels had the problem, No beacon heard at this time and constant disconnecting, reconnecting. 

Can some of you guys try a DFS channel if either the Access Point or Router supports it please, as I believe that it could be a problem with normal channels as such interference problems or nearby APs and Routers knocking you off 5Ghz, I live in a heavily populated location and there are not any APs and Routers that uses DFS channels. 

Something to note, the Laptop with the Intel 7260 had the antenna around the wrong way as HP labelled the cables wrong so check this as this was a issue on my old HP 4540s as some WIFI Cards have the antenna connections reversed.

Tested this with Ubuntu Desktop 18.04, 20.04 and 22.04. No tweaks have been made with Power Management or other configuration were changed.

Keep me updated.

Regards

Jack
Comment 305 James Addison (inactive) 2022-12-17 16:49:34 UTC
This issue appears fairly reproducible for me between a TP-Link router running OpenWRT and an Intel Corporation Wireless 8260 chipset.

When using Linux 6.0.0-5-amd64 the issue causes a wifi disconnection; NetworkManager attempts reconnection/re-authentication and this fails.

When using Linux 5.10.0-19-amd64 the issue either does not occur -- or it is silently handled: no disconnection occurs.

I haven't yet recompiled a kernel with iwlwifi firmware debugging support enabled but may do soon.
Comment 306 TJ 2023-04-01 08:18:53 UTC
I've recently begun experiencing this issue on an Asus T300CHI with 7265D that has, for the last several years, been entirely well behaved. Using mainline kernel builds, most recently v6.2.9, with Kubuntu 22.04, NetworkManager, wpa_supplicant, and:

tj@t300chi:~$ uname -r
6.2.9-tj+
tj@t300chi:~$ lspci -nnk -d ::280
02:00.0 Network controller [0280]: Intel Corporation Wireless 7265 [8086:095a] (rev 59)
        Subsystem: Intel Corporation Dual Band Wireless-AC 7265 [8086:5100]
        Kernel driver in use: iwlwifi
        Kernel modules: iwlwifi
tj@t300chi:~$ journalctl -k --grep 'iwlwifi.*Firmware loaded'
Apr 01 08:36:05 t300chi kernel: iwlwifi 0000:02:00.0: Firmware loaded: iwlwifi-7265D-29.ucode
$ iwconfig wlp2s0
wlp2s0    IEEE 802.11  ESSID:"AP"  
          Mode:Managed  Frequency:2.412 GHz  Access Point: B8:27:EB:4D:55:39   
          Bit Rate=57.8 Mb/s   Tx-Power=22 dBm   
          Retry short limit:7   RTS thr:off   Fragment thr:off
          Power Management:on
          Link Quality=42/70  Signal level=-68 dBm  
          Rx invalid nwid:0  Rx invalid crypt:0  Rx invalid frag:0
          Tx excessive retries:0  Invalid misc:200   Missed beacon:0

I have a couple of access points, both using Debian or Debian-based distro. Currently mostly see this with a RasPi operating on 2.4GHz with no other WiFi devices within range (I'm on a farm in midst of fields):

tj@noc:~$ uname -r
5.15.32-v7+
tj@noc$ cat /sys/firmware/devicetree/base/model
Raspberry Pi 3 Model B Rev 1.2

The RasPi doesn't log any messages when the client is failing.

When the repeated disconnects occur client logs show:

Apr 01 07:39:58 t300chi wpa_supplicant[730]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 01 07:39:58 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
...
Apr 01 07:39:59 t300chi wpa_supplicant[730]: wlp2s0: CTRL-EVENT-DISCONNECTED bssid=b8:27:eb:4d:55:39 reason=4 locally_generated=1
...
Apr 01 07:40:00 t300chi wpa_supplicant[730]: wlp2s0: SME: Trying to authenticate with b8:27:eb:4d:55:39 (SSID='soggy' freq=2412 MHz)
Apr 01 07:40:00 t300chi kernel: wlp2s0: authenticate with b8:27:eb:4d:55:39
...
Apr 01 07:40:00 t300chi kernel: wlp2s0: associate with b8:27:eb:4d:55:39 (try 1/3)
Apr 01 07:40:00 t300chi kernel: wlp2s0: RX AssocResp from b8:27:eb:4d:55:39 (capab=0x411 status=0 aid=1)
Apr 01 07:40:00 t300chi kernel: wlp2s0: associated
...
Apr 01 07:40:00 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 01 07:40:00 t300chi wpa_supplicant[730]: wlp2s0: WPA: Key negotiation completed with b8:27:eb:4d:55:39 [PTK=CCMP GTK=CCMP]
Apr 01 07:40:00 t300chi wpa_supplicant[730]: wlp2s0: CTRL-EVENT-CONNECTED - Connection to b8:27:eb:4d:55:39 completed [id=0 id_str=]
...
Apr 01 07:40:00 t300chi wpa_supplicant[730]: wlp2s0: CTRL-EVENT-DISCONNECTED bssid=b8:27:eb:4d:55:39 reason=4 locally_generated=1

At this point the entire connect - authenticate - disconnect loop constantly repeats every few seconds with IP addresses being gained and lost immediately.

I initially had occasional success by removing the PCI device and then rescanning, but mostly even that does not help:

$ iwl=$(readlink -e /sys/class/net/wlp2s0/device)
$ echo 1 | sudo dd of=${iwl}/remove
$ echo 1 | sudo dd of=${iwl%/*}/rescan

Log then shows:

Apr 01 07:53:50 t300chi kernel: pci 0000:02:00.0: [8086:095a] type 00 class 0xffffff
Apr 01 07:53:50 t300chi kernel: pci 0000:02:00.0: Max Payload Size set to 128 (was 16384, max 128)
Apr 01 07:53:50 t300chi kernel: pci 0000:02:00.0: 2.000 Gb/s available PCIe bandwidth, limited by 2.5 GT/s PCIe x1 link at 0000:00:1c.3 (capable of 3969.945 Gb/s...
...
Apr 01 07:53:50 t300chi kernel: pci 0000:02:00.0: buffer not found in pci_save_pcie_state
...
Apr 01 07:53:50 t300chi libvirtd[844]: internal error: Unknown PCI header type '127' for device '0000:02:00.0'
...
Apr 01 07:55:21 t300chi kernel: pci 0000:02:00.0: buffer not found in pci_save_pcie_state
Apr 01 07:55:22 t300chi kernel: pci 0000:02:00.0: timed out waiting for pending transaction; performing function level reset anyway
Apr 01 07:55:23 t300chi kernel: pci 0000:02:00.0: not ready 1023ms after FLR; waiting
Apr 01 07:55:24 t300chi kernel: pci 0000:02:00.0: not ready 2047ms after FLR; waiting
Apr 01 07:55:26 t300chi kernel: pci 0000:02:00.0: not ready 4095ms after FLR; waiting
Apr 01 07:55:31 t300chi kernel: pci 0000:02:00.0: not ready 8191ms after FLR; waiting
Apr 01 07:55:39 t300chi kernel: pci 0000:02:00.0: not ready 16383ms after FLR; waiting
Apr 01 07:55:56 t300chi kernel: pci 0000:02:00.0: not ready 32767ms after FLR; waiting
Apr 01 07:56:32 t300chi kernel: pci 0000:02:00.0: not ready 65535ms after FLR; giving up
Apr 01 07:56:32 t300chi kernel: pci 0000:02:00.0: buffer not found in pci_save_pcie_state

At this point a complete power-off to cold (not a warm reboot) resets the device to working condition.

As others have recently pointed out this does seem to be some interaction between firmware and driver.
Comment 307 TJ 2023-04-01 10:35:06 UTC
Having followed the instructions for Firmware Debugging from:

https://wireless.wiki.kernel.org/en/users/drivers/iwlwifi/debugging#firmware_debugging

And triggering a coredump the expected sysfs devcoredump node doesn't appear. Is the core dump only created if there is an error detected by the driver?

$ uname -r;
6.2.9-tj+

$ grep -E 'COREDUMP|IWLWIFI_DEBUG' /boot/config-$(uname -r);
CONFIG_COREDUMP=y
CONFIG_WANT_DEV_COREDUMP=y
CONFIG_ALLOW_DEV_COREDUMP=y
CONFIG_DEV_COREDUMP=y
CONFIG_IWLWIFI_DEBUG=y
CONFIG_IWLWIFI_DEBUGFS=y

$ journalctl -k --grep 'iwlwifi.*Firmware loaded'
Apr 01 11:09:45 t300chi kernel: iwlwifi 0000:02:00.0: Firmware loaded: iwlwifi-7265D-29.ucode

$ ls -l /usr/lib/firmware/iwlwifi-7265D-29*
-rw-r--r-- 1 root root 1036644 Apr  1 10:00 /usr/lib/firmware/iwlwifi-7265D-29.debug.ucode
-rw-r--r-- 1 root root 1036772 Mar 24 08:09 /usr/lib/firmware/iwlwifi-7265D-29.release.ucode
lrwxrwxrwx 1 root root      28 Apr  1 10:02 /usr/lib/firmware/iwlwifi-7265D-29.ucode -> iwlwifi-7265D-29.debug.ucode

$ cat /sys/class/devcoredump/disabled;
0

$ echo 1 | sudo dd of=/sys/kernel/debug/iwlwifi/0000:02:00.0/iwlmvm/fw_dbg_collect

$ journalctl -k --grep 'iwlwifi.*Collecting data';
Apr 01 11:17:13 t300chi kernel: iwlwifi 0000:02:00.0: Collecting data: trigger 1 fired.

$ ls /sys/devices/virtual/devcoredump
ls: cannot access '/sys/devices/virtual/devcoredump': No such file or directory
Comment 308 TJ 2023-04-01 16:03:45 UTC
It seems I followed the instructions a little too assiduously before testing it - and the udev rule + shell script were always triggered and writing the dump to /var/log/ and then - of course - clearing the /sys/devices/virtual/devcoredump/devcd*/data node. I didn't realise that would cause the /sys/devices/virtual/devcoredump/ node to be removed as well!

So, debug is ready to go. I have a hunch this may be related to power-saving since I'm almost convinced I've only seen it when the system has been running on battery (this morning's bug report was when the battery was at 47%), but much of the time the system is on AC charger power. So, I'm currently keeping it on AC for a few days without any suspend/resume cycles, then if no failures, will do some suspend/resume cycles and then let it run for another few days. After that I'll do a cold boot to battery power and give it a few days, etc.
Comment 309 TJ 2023-04-02 10:11:09 UTC
Interesting occurrence overnight. Did a cold boot at 19:35 and left it with GUI logged into my regular user account with screen locked. Came back to it at 10:13 and found there had been a prolonged period of "connection lost" - 544 in fact - between 08:21 and 08:44 at which point it connected successfully again and remained connected (and was usable when I returned). Theree were  

tj@t300chi $ journalctl -k --since="08:21" --until="08:44" --grep 'Connection to AP.*lost'  | wc -l
544

Apr 02 08:21:07 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 02 08:21:09 t300chi kernel: wlp2s0: authenticate with b8:27:eb:4d:55:39
Apr 02 08:21:09 t300chi kernel: wlp2s0: 80 MHz not supported, disabling VHT
Apr 02 08:21:09 t300chi kernel: wlp2s0: send auth to b8:27:eb:4d:55:39 (try 1/3)
Apr 02 08:21:09 t300chi kernel: wlp2s0: authenticated
Apr 02 08:21:09 t300chi kernel: wlp2s0: associate with b8:27:eb:4d:55:39 (try 1/3)
Apr 02 08:21:09 t300chi kernel: wlp2s0: RX AssocResp from b8:27:eb:4d:55:39 (capab=0x411 status=0 aid=1)
Apr 02 08:21:09 t300chi kernel: wlp2s0: associated
Apr 02 08:21:09 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 02 08:21:13 t300chi kernel: wlp2s0: authenticate with b8:27:eb:4d:55:39
Apr 02 08:21:13 t300chi kernel: wlp2s0: 80 MHz not supported, disabling VHT
Apr 02 08:21:13 t300chi kernel: wlp2s0: send auth to b8:27:eb:4d:55:39 (try 1/3)
Apr 02 08:21:13 t300chi kernel: wlp2s0: authenticated
Apr 02 08:21:13 t300chi kernel: wlp2s0: associate with b8:27:eb:4d:55:39 (try 1/3)
Apr 02 08:21:13 t300chi kernel: wlp2s0: RX AssocResp from b8:27:eb:4d:55:39 (capab=0x411 status=0 aid=1)
Apr 02 08:21:13 t300chi kernel: wlp2s0: associated
Apr 02 08:21:13 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost

There were no firmware crashes and no udevd derived dumps.

At those times, on the AP, hostapd reports repeated "disassociated" messages. What is interesting there is that for each client "connection lost" hostapd shows two "disassociated" reports. I wonder if this is 'normal' or some kind of clue.

tj@noc:~$ journalctl --since="08:21" --until="08:44" --grep 'disassociated'  | wc -l
1097

Apr 02 08:21:07 noc hostapd[545]: wlan0: STA 34:02:86:fc:ad:55 IEEE 802.11: disassociated
Apr 02 08:21:07 noc hostapd[545]: wlan0: STA 34:02:86:fc:ad:55 IEEE 802.11: disassociated
Apr 02 08:21:09 noc hostapd[545]: wlan0: STA 34:02:86:fc:ad:55 IEEE 802.11: associated
Apr 02 08:21:09 noc hostapd[545]: wlan0: STA 34:02:86:fc:ad:55 RADIUS: starting accounting session 8FEA9E8CA240D5D6
Apr 02 08:21:09 noc hostapd[545]: wlan0: STA 34:02:86:fc:ad:55 WPA: pairwise key handshake completed (RSN)
Apr 02 08:21:09 noc hostapd[545]: wlan0: STA 34:02:86:fc:ad:55 IEEE 802.11: disassociated
Apr 02 08:21:09 noc hostapd[545]: wlan0: STA 34:02:86:fc:ad:55 IEEE 802.11: disassociated
Apr 02 08:21:13 noc hostapd[545]: wlan0: STA 34:02:86:fc:ad:55 IEEE 802.11: associated
Apr 02 08:21:13 noc hostapd[545]: wlan0: STA 34:02:86:fc:ad:55 RADIUS: starting accounting session D373C9B8EA96D736
Apr 02 08:21:13 noc hostapd[545]: wlan0: STA 34:02:86:fc:ad:55 WPA: pairwise key handshake completed (RSN)
Apr 02 08:21:13 noc hostapd[545]: wlan0: STA 34:02:86:fc:ad:55 IEEE 802.11: disassociated
Apr 02 08:21:13 noc hostapd[545]: wlan0: STA 34:02:86:fc:ad:55 IEEE 802.11: disassociated
Comment 310 TJ 2023-04-02 10:42:11 UTC
In the immediate previous report I neglected to include info on the beacon loss. Not many were reported:

tj@t300chi:~$ journalctl --since="08:21" --until="08:44" --grep "CTRL-EVENT-BEACON-LOSS" 
Apr 02 08:21:00 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:21:06 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:21:31 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:22:11 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:22:13 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:22:14 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:22:28 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:22:39 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:22:59 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:36:14 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:36:16 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:41:57 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:42:00 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:42:02 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 02 08:42:46 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Comment 311 Frederico K. 2023-04-03 08:06:20 UTC
> I've recently begun experiencing this issue on an Asus T300CHI with 7265D
> that has, for the last several years, been entirely well behaved

This sounds like a hardware issue.
Comment 312 TJ 2023-04-04 14:07:41 UTC
I don't think it is hardware. Had another episode of loss that recovered fine with very few BEACON_LOSS events:

$ journalctl -k  --since="01:30" --until=02:00 | grep 'Connection to AP.*lost'  | wc -l
535

Apr 04 01:36:08 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 04 01:36:10 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 04 01:36:14 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
...
Apr 04 01:59:50 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 04 01:59:54 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 04 01:59:58 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost

$ journalctl  --since="00:00" -u wpa_supplicant --grep CTRL-EVENT-BEACON-LOSS
Apr 04 01:37:12 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 04 01:37:19 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 04 01:42:44 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 04 01:47:18 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 04 01:47:20 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 04 01:47:35 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 04 01:58:20 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Comment 313 TJ 2023-04-07 08:53:50 UTC
One more incident, system not powered down or suspended yet.

tj@t300chi:~$ uname -r
6.2.9-tj+

tj@t300chi:~$ uptime
 09:49:49 up 5 days, 14:14,  9 users,  load average: 0.05, 0.08, 0.10

$ journalctl -b --grep 'Connection.* lost|CTRL-EVENT-BEACON-LOSS' --since="2023-04-05" --until="2023-04-06" | awk '{lines[NR] = $0; if(NR>=5){delete lines[NR-5]}} NR<5 || /CTRL-EVENT-BEACON-LOSS/{print;next}END{for(l in lines){print lines[l]}}'
Apr 05 04:42:20 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 05 04:42:21 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 05 04:42:28 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 05 04:42:30 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 05 04:44:43 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 05 04:44:44 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 05 04:44:46 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 05 04:48:16 t300chi wpa_supplicant[698]: wlp2s0: CTRL-EVENT-BEACON-LOSS
Apr 05 04:51:14 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 05 04:51:16 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 05 04:51:18 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 05 04:51:19 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost
Apr 05 04:51:23 t300chi kernel: wlp2s0: Connection to AP b8:27:eb:4d:55:39 lost

I'm going to complete a full week (7 days) like this, then remove external power and do a cold start -  only recharging to 100% when battery level drops to around 20%.
Comment 314 untainsYD 2023-09-12 10:59:47 UTC
I have the same error, it's a really annoying one with `lost beacon`.
```sh
> uname -r
6.5.2-arch1-1
```

```sh
> lspci -kvnn | sed -n '/Network/,/^$/ p'
09:00.0 Network controller [0280]: Intel Corporation Wireless 7260 [8086:08b2] (rev 73)
	Subsystem: Intel Corporation Wireless-N 7260 [Wilkins Peak 2] [8086:4262]
	Flags: bus master, fast devsel, latency 0, IRQ 33
	Memory at d5000000 (64-bit, non-prefetchable) [size=8K]
	Capabilities: <access denied>
	Kernel driver in use: iwlwifi
	Kernel modules: iwlwifi
```