Bug 202971

Summary: Sometimes when i boot/reboot on linux atheros QCA9377 doesnt work
Product: Drivers Reporter: siyia (eutychios23)
Component: network-wirelessAssignee: drivers_network-wireless (drivers_network-wireless)
Status: NEW ---    
Severity: high    
Priority: P1    
Hardware: All   
OS: Linux   
Kernel Version: 4.20/5.0 Subsystem:
Regression: No Bisected commit-id:

Description siyia 2019-03-19 16:07:14 UTC
when i boot on linux sometimes the wireless doesnt work, and i cannot start/restart them module.This is what i get when it doesnt work:

dmesg | grep ath
[   16.447342] ath10k_pci 0000:03:00.0: pci irq msi oper_irq_mode 2 irq_mode 0 reset_mode 0
[   16.793430] ath10k_pci 0000:03:00.0: qca9377 hw1.1 target 0x05020001 chip_id 0x003821ff sub 11ad:08a6
[   16.793433] ath10k_pci 0000:03:00.0: kconfig debug 1 debugfs 1 tracing 1 dfs 0 testmode 0
[   16.793950] ath10k_pci 0000:03:00.0: firmware ver WLAN.TF.2.1-00021-QCARMSWP-1 api 6 features wowlan,ignore-otp crc32 42e41877
[   17.205385] ath10k_pci 0000:03:00.0: board_file api 2 bmi_id N/A crc32 8aedfa4a
[   17.286869] ath10k_pci 0000:03:00.0: unsupported HTC service id: 1536
[   17.305564] ath10k_pci 0000:03:00.0: htt-ver 3.56 wmi-op 4 htt-op 3 cal otp max-sta 32 raw 0 hwcrypto 1
[   17.316371] ath10k_pci 0000:03:00.0: failed to read calibration data: -16
[   20.401914] ath10k_pci 0000:03:00.0: wmi command 16389 timeout, restarting hardware
[   23.601933] ath10k_pci 0000:03:00.0: wmi command 69633 timeout, restarting hardware
[   23.601937] ath10k_pci 0000:03:00.0: could not suspend target (-11)
[   23.602879] ath: EEPROM regdomain: 0x69
[   23.602880] ath: EEPROM indicates we should expect a direct regpair map
[   23.602881] ath: Country alpha2 being used: 00
[   23.602882] ath: Regpair used: 0x69
[   23.729911] ath10k_pci 0000:03:00.0 wlp3s0: renamed from wlan0
[   23.750750] ath10k_pci 0000:03:00.0: cannot restart a device that hasn't been started
[   25.232954] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   25.232958] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   25.233053] ath10k_pci 0000:03:00.0: Could not init hif: -16
[   25.386229] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   25.386232] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   25.386324] ath10k_pci 0000:03:00.0: Could not init hif: -16
[   25.546287] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   25.546289] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   25.546377] ath10k_pci 0000:03:00.0: Could not init hif: -16
[   25.699659] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   25.699662] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   25.699760] ath10k_pci 0000:03:00.0: Could not init hif: -16
[   36.476717] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   36.476720] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   36.476812] ath10k_pci 0000:03:00.0: Could not init hif: -16
[   36.629608] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   36.629610] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   36.629698] ath10k_pci 0000:03:00.0: Could not init hif: -16
[   47.476241] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   47.476246] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   47.476346] ath10k_pci 0000:03:00.0: Could not init hif: -16
[   47.632925] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   47.632928] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   47.633023] ath10k_pci 0000:03:00.0: Could not init hif: -16
[   58.476640] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   58.476644] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   58.476736] ath10k_pci 0000:03:00.0: Could not init hif: -16
[   58.629983] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   58.629986] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   58.630079] ath10k_pci 0000:03:00.0: Could not init hif: -16
[   69.472894] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   69.472896] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   69.472984] ath10k_pci 0000:03:00.0: Could not init hif: -16
[   69.626575] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   69.626577] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   69.626665] ath10k_pci 0000:03:00.0: Could not init hif: -16
[   80.473314] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   80.473318] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   80.473410] ath10k_pci 0000:03:00.0: Could not init hif: -16
[   80.626644] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[   80.626647] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[   80.626740] ath10k_pci 0000:03:00.0: Could not init hif: -16
[  600.526174] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[  600.526177] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[  600.526270] ath10k_pci 0000:03:00.0: Could not init hif: -16

when it works i get this:

dmesg | grep ath
[    9.939189] ath10k_pci 0000:03:00.0: pci irq msi oper_irq_mode 2 irq_mode 0 reset_mode 0
[   10.229739] ath10k_pci 0000:03:00.0: qca9377 hw1.1 target 0x05020001 chip_id 0x003821ff sub 11ad:08a6
[   10.229742] ath10k_pci 0000:03:00.0: kconfig debug 1 debugfs 1 tracing 1 dfs 0 testmode 0
[   10.230252] ath10k_pci 0000:03:00.0: firmware ver WLAN.TF.2.1-00021-QCARMSWP-1 api 6 features wowlan,ignore-otp crc32 42e41877
[   10.308172] ath10k_pci 0000:03:00.0: board_file api 2 bmi_id N/A crc32 8aedfa4a
[   10.384193] ath10k_pci 0000:03:00.0: unsupported HTC service id: 1536
[   10.402877] ath10k_pci 0000:03:00.0: htt-ver 3.56 wmi-op 4 htt-op 3 cal otp max-sta 32 raw 0 hwcrypto 1
[   10.404443] ath: EEPROM regdomain: 0x69
[   10.404444] ath: EEPROM indicates we should expect a direct regpair map
[   10.404445] ath: Country alpha2 being used: 00
[   10.404445] ath: Regpair used: 0x69
[   10.612880] ath10k_pci 0000:03:00.0 wlp3s0: renamed from wlan0
[   14.813185] ath10k_pci 0000:03:00.0: unsupported HTC service id: 1536
[   15.096431] ath10k_pci 0000:03:00.0: unsupported HTC service id: 1536

specs:
OS:Manjaro
Kernels: 4.20/5.0
Firmware:Latest available
acer aspire a315-41g
Atheros QCA9377
Comment 1 siyia 2019-03-19 18:19:11 UTC
i alos get this when the wireless works and then suddenly stops:

[  128.139114] ath10k_pci 0000:03:00.0: failed to receive scan abortion completion: timed out
[  128.139123] ath10k_pci 0000:03:00.0: failed to stop scan: -110
[  128.139127] ath10k_pci 0000:03:00.0: failed to start hw scan: -110
[  131.339080] ath10k_pci 0000:03:00.0: wmi command 24579 timeout, restarting hardware
[  131.349897] ath10k_pci 0000:03:00.0: failed to read hi_board_data address: -16
[  131.351103] ath10k_pci 0000:03:00.0: failed to flush transmit queue (skip 1 ar-state 2): 1500
[  131.352168] ath10k_pci 0000:03:00.0: failed to delete peer 5e:07:26:35:80:f1 for vdev 0: -108
[  131.352223] Modules linked in: cmac rfcomm ccm fuse i2c_hid bnep nls_iso8859_1 nls_cp437 vfat fat uvcvideo videobuf2_vmalloc videobuf2_memops amdgpu videobuf2_v4l2 videobuf2_common chash edac_mce_amd videodev media kvm_amd amd_iommu_v2 ccp gpu_sched rng_core kvm i2c_algo_bit ttm irqbypass btusb btrtl btbcm btintel crct10dif_pclmul crc32_pclmul ghash_clmulni_intel bluetooth drm_kms_helper arc4 aesni_intel drm snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio aes_x86_64 ecdh_generic snd_hda_codec_hdmi crypto_simd agpgart ath10k_pci ath10k_core ath mac80211 snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer snd syscopyarea sysfillrect sp5100_tco sysimgblt fb_sys_fops acer_wmi joydev cfg80211 r8168(OE) r8169 sparse_keymap mousedev crc16 realtek soundcore cryptd hid_multitouch rfkill libphy rtsx_pci_ms wmi_bmof glue_helper k10temp memstick i2c_piix4 ac pcc_cpufreq acpi_cpufreq wmi input_leds pcspkr battery acer_wireless pinctrl_amd evdev mac_hid uinput crypto_user
[  131.352612] ath10k_pci 0000:03:00.0: failed to recalculate rts/cts prot for vdev 0: -108
[  131.352614] ath10k_pci 0000:03:00.0: failed to set cts protection for vdev 0: -108
[  131.352616] ath10k_pci 0000:03:00.0: failed to set erp slot for vdev 0: -108
[  131.506490] ath10k_pci 0000:03:00.0: Failed to get pcie state addr: -16
[  131.506494] ath10k_pci 0000:03:00.0: failed to setup init config: -16
[  131.506586] ath10k_pci 0000:03:00.0: Could not init hif: -16
[  131.506657] Modules linked in: cmac rfcomm ccm fuse i2c_hid bnep nls_iso8859_1 nls_cp437 vfat fat uvcvideo videobuf2_vmalloc videobuf2_memops amdgpu videobuf2_v4l2 videobuf2_common chash edac_mce_amd videodev media kvm_amd amd_iommu_v2 ccp gpu_sched rng_core kvm i2c_algo_bit ttm irqbypass btusb btrtl btbcm btintel crct10dif_pclmul crc32_pclmul ghash_clmulni_intel bluetooth drm_kms_helper arc4 aesni_intel drm snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio aes_x86_64 ecdh_generic snd_hda_codec_hdmi crypto_simd agpgart ath10k_pci ath10k_core ath mac80211 snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer snd syscopyarea sysfillrect sp5100_tco sysimgblt fb_sys_fops acer_wmi joydev cfg80211 r8168(OE) r8169 sparse_keymap mousedev crc16 realtek soundcore cryptd hid_multitouch rfkill libphy rtsx_pci_ms wmi_bmof glue_helper k10temp memstick i2c_piix4 ac pcc_cpufreq acpi_cpufreq wmi input_leds pcspkr battery acer_wireless pinctrl_amd evdev mac_hid uinput crypto_user
[  131.506919] Modules linked in: cmac rfcomm ccm fuse i2c_hid bnep nls_iso8859_1 nls_cp437 vfat fat uvcvideo videobuf2_vmalloc videobuf2_memops amdgpu videobuf2_v4l2 videobuf2_common chash edac_mce_amd videodev media kvm_amd amd_iommu_v2 ccp gpu_sched rng_core kvm i2c_algo_bit ttm irqbypass btusb btrtl btbcm btintel crct10dif_pclmul crc32_pclmul ghash_clmulni_intel bluetooth drm_kms_helper arc4 aesni_intel drm snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio aes_x86_64 ecdh_generic snd_hda_codec_hdmi crypto_simd agpgart ath10k_pci ath10k_core ath mac80211 snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer snd syscopyarea sysfillrect sp5100_tco sysimgblt fb_sys_fops acer_wmi joydev cfg80211 r8168(OE) r8169 sparse_keymap mousedev crc16 realtek soundcore cryptd hid_multitouch rfkill libphy rtsx_pci_ms wmi_bmof glue_helper k10temp memstick i2c_piix4 ac pcc_cpufreq acpi_cpufreq wmi input_leds pcspkr battery acer_wireless pinctrl_amd evdev mac_hid uinput crypto_user
[  131.507198] Modules linked in: cmac rfcomm ccm fuse i2c_hid bnep nls_iso8859_1 nls_cp437 vfat fat uvcvideo videobuf2_vmalloc videobuf2_memops amdgpu videobuf2_v4l2 videobuf2_common chash edac_mce_amd videodev media kvm_amd amd_iommu_v2 ccp gpu_sched rng_core kvm i2c_algo_bit ttm irqbypass btusb btrtl btbcm btintel crct10dif_pclmul crc32_pclmul ghash_clmulni_intel bluetooth drm_kms_helper arc4 aesni_intel drm snd_hda_codec_realtek snd_hda_codec_generic ledtrig_audio aes_x86_64 ecdh_generic snd_hda_codec_hdmi crypto_simd agpgart ath10k_pci ath10k_core ath mac80211 snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep snd_pcm snd_timer snd syscopyarea sysfillrect sp5100_tco sysimgblt fb_sys_fops acer_wmi joydev cfg80211 r8168(OE) r8169 sparse_keymap mousedev crc16 realtek soundcore cryptd hid_multitouch rfkill libphy rtsx_pci_ms wmi_bmof glue_helper k10temp memstick i2c_piix4 ac pcc_cpufreq acpi_cpufreq wmi input_leds pcspkr battery acer_wireless pinctrl_amd evdev mac_hid uinput crypto_user
Comment 2 siyia 2019-03-20 09:57:56 UTC
I think it is relate to pcie power management, i was getting some bad tlp errors as aer with the parameter pcie_aspm=off they do not appear anymore, i will further test the wireless.
Comment 3 siyia 2019-03-20 13:56:55 UTC
this is what i get without pcie_aspm=off:

[ 21.803636] r8169 0000:02:00.1 enp2s0f1: Link is Down
[ 21.809096] pcieport 0000:00:01.7: AER: Corrected error received: 0000:00:01.0
[ 21.809129] pcieport 0000:00:01.7: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
[ 21.809131] pcieport 0000:00:01.7: device [1022:15d3] error status/mask=00000080/00006000
[ 21.809134] pcieport 0000:00:01.7: [ 7] BadDLLP
[ 21.847679] pcieport 0000:00:01.7: AER: Multiple Corrected error received: 0000:00:01.0
[ 21.847710] pcieport 0000:00:01.7: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
[ 21.847713] pcieport 0000:00:01.7: device [1022:15d3] error status/mask=00000040/00006000
[ 21.847715] pcieport 0000:00:01.7: [ 6] BadTLP
[ 22.033999] ath10k_pci 0000:03:00.0: unsupported HTC service id: 1536
[ 22.067062] pcieport 0000:00:01.7: AER: Corrected error received: 0000:00:01.0
[ 22.067093] pcieport 0000:00:01.7: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
[ 22.067095] pcieport 0000:00:01.7: device [1022:15d3] error status/mask=00000040/00006000
[ 22.067098] pcieport 0000:00:01.7: [ 6] BadTLP
[ 22.100974] pcieport 0000:00:01.7: AER: Corrected error received: 0000:00:01.0
[ 22.101001] pcieport 0000:00:01.7: PCIe Bus Error: severity=Corrected, type=Data Link Layer, (Receiver ID)
[ 22.101003] pcieport 0000:00:01.7: device [1022:15d3] error status/mask=00000040/00006000
[ 22.101013] pcieport 0000:00:01.7: [ 6] BadTLP
Comment 4 siyia 2019-03-20 13:58:06 UTC
pcie_aspm=off completely removes those errors and fixes any wireless connectivy problems with atheros QCA9377, but at the cost of some power savings.