Bug 218888

Summary: ath11k: regression in 6.9: connecting to 6Ghz fails
Product: Drivers Reporter: Andres Freund (andres)
Component: network-wirelessAssignee: drivers_network-wireless (drivers_network-wireless)
Status: RESOLVED PATCH_ALREADY_AVAILABLE    
Severity: normal CC: juansmurphy, kvalo, skysaildiamond
Priority: P3    
Hardware: All   
OS: Linux   
Kernel Version: 6.9 Subsystem:
Regression: Yes Bisected commit-id: bc8a0fac8677f729ab17176023be1a2653e8b9c6
Attachments: lspci-output

Description Andres Freund 2024-05-25 02:49:42 UTC
Created attachment 306342 [details]
lspci-output

Hi,

On my Lenovo P14s Gen 4 AMD, I noticed that with 6.9 I can't connect to a 6ghz AP anymore:

2024-05-24T13:46:06.027948-07:00 alap6 wpa_supplicant[1173]: wl0: SME: Trying to authenticate with 9e:05:d6:ae:ed:32 (SSID='void-6g' freq=6135 MHz)
2024-05-24T13:46:06.028110-07:00 alap6 wpa_supplicant[1173]: wl0: SME: Authentication request to the driver failed

The, I think, relevant excerpt from wpa_supplication debug-level logs shows:
2024-05-24T13:46:06.028040-07:00 alap6 wpa_supplicant[1173]: nl80211: Authenticate (ifindex=5)
2024-05-24T13:46:06.028042-07:00 alap6 wpa_supplicant[1173]:   * bssid=9e:05:d6:ae:ed:32
2024-05-24T13:46:06.028043-07:00 alap6 wpa_supplicant[1173]:   * freq=6135
2024-05-24T13:46:06.028044-07:00 alap6 wpa_supplicant[1173]:   * SSID=void-6g
2024-05-24T13:46:06.028046-07:00 alap6 wpa_supplicant[1173]:   * IEs - hexdump(len=0): [NULL]
2024-05-24T13:46:06.028050-07:00 alap6 wpa_supplicant[1173]:   * auth_data - hexdump(len=102): 01 00 7e 00 13 00 62 28 8c 9b 95 8c d1 0e 3d ea 04 78 38 dc 87 79 7c cf 16 de 0d df 54 31 1d 50 7a da 1a 2a 56 e9 a6 a2 de b9 ae 7b e6 1b 82 e7 cb b8 45 3e 77 76 f3 fb 2e 52 74 8a 5d c3 66 36 46 4c 12 4b ec 80 d2 f6 91 ec 5>
2024-05-24T13:46:06.028053-07:00 alap6 wpa_supplicant[1173]:   * Auth Type 4
2024-05-24T13:46:06.028108-07:00 alap6 wpa_supplicant[1173]: wl0: nl80211: MLME command failed (auth): count=0 ret=-22 (Invalid argument)


The exact same AP configuration does work with earlier kernels. I bisected the regression, which unfortunately ended up blaming a merge.

Works:
2024-01-23  9d5f28c1366f4  wifi: ath11k: fix connection failure due to unexpected peer delete                        
2024-01-26  28b3df1fe6ba2  kunit: add wireless unit tests                                                            

Does not work:
2024-01-31  6c76dd3a91e33  Merge tag 'ath-next-20240130' of git://git.kernel.org/pub/scm/linux/kernel/git/kvalo/ath

I.e. both parents of the merge individually work, just after the merge it doesn't.  I was suspicious of the bisection result, but it reproduces across several boots.

I tried upgrading the firmware from Debian's firmware-atheros=20230625-2. But there was no improvement with the firmware from linux-firmware (ec8627ef), nor ath11k-firmware (ec8627ef).


Laptop model: Lenovo P14s Gen 4 AMD - 21K5CTO1WW
BIOS version: R2FET55W (1.35 )
Reproducibility: 100%, many tries
Distribution: Debian unstable
lspci: attached
dmesg excerpt:

Working:
2024-05-24T18:43:40.219336-07:00 alap6 kernel: Linux version 6.8.0-rc1-andres-00350-g28b3df1fe6ba (andres@alap6) (x86_64-linux-gnu-gcc (Debian 13.2.0-25) 13.2.0, GNU ld (GNU Binutils for Debian) 2.42) #138 SMP PREEMPT_DYNAMIC Fri May 24 18:02:51 PDT 2024
2024-05-24T18:48:45.444248-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: BAR 0 [mem 0x78600000-0x787fffff 64bit]: assigned
2024-05-24T18:48:45.444275-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: enabling device (0000 -> 0002)
2024-05-24T18:48:45.445931-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: MSI vectors: 32
2024-05-24T18:48:45.448760-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: wcn6855 hw2.1
2024-05-24T18:48:46.600269-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: chip_id 0x2 chip_family 0xb board_id 0xff soc_id 0x400c0210
2024-05-24T18:48:46.601670-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: fw_version 0x11088c35 fw_build_timestamp 2024-04-17 08:34 fw_build_id WLAN.HSP.1.1-03125-QCAHSPSWPL_V1_V2_SILICONZ_LITE-3.6510.41
2024-05-24T18:48:47.086641-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: WBRF is not supported
2024-05-24T18:48:47.088656-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: Failed to set the requested Country regulatory setting
2024-05-24T18:48:47.089710-07:00 alap6 kernel: ath11k_pci 0000:02:00.0 wl0: renamed from wlan0
2024-05-24T18:48:47.090361-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: Failed to set the requested Country regulatory setting
2024-05-24T18:49:18.996390-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: failed to enqueue rx buf: -28

Broken:
2024-05-24T18:36:19.187177-07:00 alap6 kernel: Linux version 6.8.0-rc1-andres-00422-g6c76dd3a91e3 (andres@alap6) (x86_64-linux-gnu-gcc (Debian 13.2.0-25) 13.2.0, GNU ld (GNU Binutils for Debian) 2.42) #132 SMP PREEMPT_DYNAMIC Fri May 24 16:29:04 PDT 2024
2024-05-24T18:36:49.621362-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: BAR 0 [mem 0x78600000-0x787fffff 64bit]: assigned
2024-05-24T18:36:49.621395-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: enabling device (0000 -> 0002)
2024-05-24T18:36:49.623245-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: MSI vectors: 32
2024-05-24T18:36:49.624226-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: wcn6855 hw2.1
2024-05-24T18:36:50.772797-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: chip_id 0x2 chip_family 0xb board_id 0xff soc_id 0x400c0210
2024-05-24T18:36:50.773826-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: fw_version 0x11088c35 fw_build_timestamp 2024-04-17 08:34 fw_build_id WLAN.HSP.1.1-03125-QCAHSPSWPL_V1_V2_SILICONZ_LITE-3.6510.41
2024-05-24T18:36:51.287518-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: WBRF is not supported
2024-05-24T18:36:51.289293-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: Failed to set the requested Country regulatory setting
2024-05-24T18:36:51.291131-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: failed to process regulatory info -22
2024-05-24T18:36:51.291431-07:00 alap6 kernel: ath11k_pci 0000:02:00.0 wl0: renamed from wlan0
2024-05-24T18:36:51.292083-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: Failed to set the requested Country regulatory setting
2024-05-24T18:36:51.293331-07:00 alap6 kernel: ath11k_pci 0000:02:00.0: failed to process regulatory info -22


I tried enabling ath11k debug (0xffffffff), but it very quickly gets voluminous. Happy to send a full log or logs with a more appropriate mask.

Greetings,

Andres
Comment 1 Kalle Valo 2024-05-25 06:39:17 UTC
This commit should fix it:

https://git.kernel.org/pub/scm/linux/kernel/git/kvalo/ath.git/commit/?h=ath-current&id=6e16782d6b4a724f9c9dcd49471219643593b60c

Please let me know if it helped or not. That commit is queued for v6.10.
Comment 2 Andres Freund 2024-05-25 20:50:51 UTC
Yes, it indeed does fix the problem. I applied it on top of 6c76dd3a91e33 (the first commit that didn't work), and that sufficed to be able to connect again.
Comment 4 Myrtle Hendricks 2024-11-18 09:16:12 UTC Comment hidden (spam)