Bug 153961 - mac80211 crash under monitoring
Summary: mac80211 crash under monitoring
Status: CLOSED CODE_FIX
Alias: None
Product: Drivers
Classification: Unclassified
Component: network-wireless (show other bugs)
Hardware: Intel Linux
: P1 normal
Assignee: Johannes Berg
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2016-08-25 20:25 UTC by Kamil Toman
Modified: 2016-08-30 06:00 UTC (History)
2 users (show)

See Also:
Kernel Version: 4.6.4
Subsystem:
Regression: No
Bisected commit-id:


Attachments

Description Kamil Toman 2016-08-25 20:25:34 UTC
System went unresponsive under a monitored networking operation:
Aug 25 21:25:18 oryx kernel: [  400.464843] NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s! [wpa_supplicant:4260]

Aug 25 21:24:50 oryx kernel: [  372.764887] ------------[ cut here ]------------
Aug 25 21:24:50 oryx kernel: [  372.764940] WARNING: CPU: 0 PID: 4260 at net/mac80211/util.c:2847 ieee80211_chandef_downgrade+0x40/0x160 [mac80211]
Aug 25 21:24:50 oryx kernel: [  372.764944] Modules linked in: bnep vmnet(OE) vmw_vsock_vmci_transport vsock vmw_vmci vmmon(OE) drbg ansi_cprng ctr ccm ipt_MASQUERADE nf_nat_masquerade_ipv4 iptable_nat nf_conntrack_ipv4 nf_defrag_ipv4 nf_nat_ipv4 xt_addrtype iptable_filter ip_tables xt_conntrack x_tables nf_nat nf_conntrack br_netfilter bridge stp llc overlay ec_sys binfmt_misc nls_iso8859_1 snd_hda_codec_hdmi mxm_wmi arc4 intel_rapl x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel kvm snd_hda_codec_realtek irqbypass snd_hda_codec_generic crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel snd_hda_intel aes_x86_64 lrw gf128mul glue_helper snd_hda_codec ablk_helper cryptd snd_hda_core snd_hwdep snd_pcm snd_seq_midi snd_seq_midi_event snd_rawmidi uvcvideo videobuf2_vmalloc videobuf2_memops videobuf2_v4l2 videobuf2_core rtsx_pci_ms input_leds iwlmvm joydev videodev memstick snd_seq mac80211 media snd_seq_device snd_timer serio_raw iwlwifi cfg80211 mei_me snd soundcore mei shpchp nvidia_uvm(POE) hci_uart btbcm wmi btqca btintel bluetooth intel_lpss_acpi intel_lpss acpi_pad mac_hid parport_pc ppdev lp parport autofs4 btrfs xor raid6_pq dm_mirror dm_region_hash dm_log hid_generic usbhid rtsx_pci_sdmmc nvidia_drm(POE) nvidia_modeset(POE) drm_kms_helper syscopyarea sysfillrect sysimgblt fb_sys_fops drm psmouse nvidia(POE) nvme r8169 nvme_core rtsx_pci ahci mii libahci pinctrl_sunrisepoint i2c_hid video pinctrl_intel hid fjes
Aug 25 21:24:50 oryx kernel: [  372.765089] CPU: 0 PID: 4260 Comm: wpa_supplicant Tainted: P           OE   4.6.4-katox #2
Aug 25 21:24:50 oryx kernel: [  372.765092] Hardware name: System76, Inc. Oryx Pro/Oryx Pro, BIOS 1.05.09RSA1 11/16/2015
Aug 25 21:24:50 oryx kernel: [  372.765096]  0000000000000286 0000000048d9dce9 ffff88084ca93840 ffffffff813ede93
Aug 25 21:24:50 oryx kernel: [  372.765102]  0000000000000000 0000000000000000 ffff88084ca93880 ffffffff8108184b
Aug 25 21:24:50 oryx kernel: [  372.765108]  00000b1fc1034392 ffff88084ca938c0 ffff88084eae4000 ffff880851e90840
Aug 25 21:24:50 oryx kernel: [  372.765113] Call Trace:
Aug 25 21:24:50 oryx kernel: [  372.765124]  [<ffffffff813ede93>] dump_stack+0x63/0x90
Aug 25 21:24:50 oryx kernel: [  372.765131]  [<ffffffff8108184b>] __warn+0xcb/0xf0
Aug 25 21:24:50 oryx kernel: [  372.765137]  [<ffffffff8108197d>] warn_slowpath_null+0x1d/0x20
Aug 25 21:24:50 oryx kernel: [  372.765174]  [<ffffffffc1123cd0>] ieee80211_chandef_downgrade+0x40/0x160 [mac80211]
Aug 25 21:24:50 oryx kernel: [  372.765216]  [<ffffffffc1143595>] ieee80211_tdls_chandef_vht_upgrade+0x105/0x1f0 [mac80211]
Aug 25 21:24:50 oryx kernel: [  372.765254]  [<ffffffffc11442e4>] ieee80211_tdls_build_mgmt_packet_data+0xa44/0x1090 [mac80211]
Aug 25 21:24:50 oryx kernel: [  372.765290]  [<ffffffffc114537f>] ieee80211_tdls_prep_mgmt_packet.constprop.7+0xaf/0x2c0 [mac80211]
Aug 25 21:24:50 oryx kernel: [  372.765322]  [<ffffffffc1145904>] ieee80211_tdls_mgmt+0x2e4/0x4c0 [mac80211]
Aug 25 21:24:50 oryx kernel: [  372.765355]  [<ffffffffc1019942>] nl80211_tdls_mgmt+0x132/0x240 [cfg80211]
Aug 25 21:24:50 oryx kernel: [  372.765364]  [<ffffffff817609bb>] genl_family_rcv_msg+0x1db/0x3b0
Aug 25 21:24:50 oryx kernel: [  372.765371]  [<ffffffff81713cb3>] ? skb_queue_tail+0x43/0x50
Aug 25 21:24:50 oryx kernel: [  372.765377]  [<ffffffff8175d24e>] ? __netlink_sendskb+0x3e/0x60
Aug 25 21:24:50 oryx kernel: [  372.765384]  [<ffffffff81760b90>] ? genl_family_rcv_msg+0x3b0/0x3b0
Aug 25 21:24:50 oryx kernel: [  372.765390]  [<ffffffff81760c06>] genl_rcv_msg+0x76/0xb0
Aug 25 21:24:50 oryx kernel: [  372.765396]  [<ffffffff81760114>] netlink_rcv_skb+0xa4/0xc0
Aug 25 21:24:50 oryx kernel: [  372.765401]  [<ffffffff817607c8>] genl_rcv+0x28/0x40
Aug 25 21:24:50 oryx kernel: [  372.765407]  [<ffffffff8175faf3>] netlink_unicast+0x183/0x230
Aug 25 21:24:50 oryx kernel: [  372.765413]  [<ffffffff8175fe9b>] netlink_sendmsg+0x2fb/0x3a0
Aug 25 21:24:50 oryx kernel: [  372.765421]  [<ffffffff8170ce68>] sock_sendmsg+0x38/0x50
Aug 25 21:24:50 oryx kernel: [  372.765428]  [<ffffffff8170d91e>] ___sys_sendmsg+0x28e/0x2a0
Aug 25 21:24:50 oryx kernel: [  372.765433]  [<ffffffff813f6e92>] ? memzero_explicit+0x12/0x20
Aug 25 21:24:50 oryx kernel: [  372.765440]  [<ffffffff8151c7ec>] ? extract_entropy_user+0x11c/0x1b0
Aug 25 21:24:50 oryx kernel: [  372.765448]  [<ffffffff81236153>] ? dput+0xb3/0x220
Aug 25 21:24:50 oryx kernel: [  372.765454]  [<ffffffff8123f814>] ? mntput+0x24/0x40
Aug 25 21:24:50 oryx kernel: [  372.765460]  [<ffffffff812204e3>] ? __fput+0x193/0x230
Aug 25 21:24:50 oryx kernel: [  372.765464]  [<ffffffff8170e274>] __sys_sendmsg+0x54/0x90
Aug 25 21:24:50 oryx kernel: [  372.765469]  [<ffffffff8170e2c2>] SyS_sendmsg+0x12/0x20
Aug 25 21:24:50 oryx kernel: [  372.765477]  [<ffffffff818385b6>] entry_SYSCALL_64_fastpath+0x1e/0xa8
Aug 25 21:24:50 oryx kernel: [  372.765481] ---[ end trace e487d8b0f5c0eff6 ]---
Comment 1 Luca Coelho 2016-08-29 11:24:41 UTC
Arik has found the problem and has implemented a fix for it, which we will soon send out.

Assigning to Johannes so he can close it when the fix is applied.

Note You need to log in before you can comment on or make changes to this bug.