Bug 195433 - CPU softlocks frequently in recent kernel versions
Summary: CPU softlocks frequently in recent kernel versions
Status: NEW
Alias: None
Product: Other
Classification: Unclassified
Component: Other (show other bugs)
Hardware: x86-64 Linux
: P1 normal
Assignee: other_other
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2017-04-13 22:31 UTC by sworddragon2
Modified: 2017-05-11 18:34 UTC (History)
1 user (show)

See Also:
Kernel Version: 4.10.8
Subsystem:
Regression: No
Bisected commit-id:


Attachments

Description sworddragon2 2017-04-13 22:31:49 UTC
Around 1-2 weeks ago I noticed that my system became unstable resulting that applications couldn't be started anymore but I was still able to move the mouse cursor so it was not a full system freeze.

Today this issue happened again and only partly processes froze (for example I could still open a terminal with top but not htop and closing processes worked not very well) so I was able to make a look at dmesg. Mainly this message got spammed several times:


[32973.433769] ------------[ cut here ]------------
[32973.433775] kernel BUG at /build/linux-Fk60NP/linux-4.10.0/include/linux/swapops.h:129!
[32973.433777] invalid opcode: 0000 [#1] SMP
[32973.433778] Modules linked in: snd_seq_dummy snd_seq snd_seq_device pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack nf_conntrack ntfs nls_utf8 isofs dm_crypt iptable_filter snd_hda_codec_hdmi snd_hda_codec_realtek snd_hda_codec_generic input_leds joydev ppdev snd_hda_intel snd_hda_codec snd_hda_core snd_hwdep edac_mce_amd edac_core snd_pcm kvm_amd shpchp kvm snd_timer irqbypass snd soundcore mac_hid serio_raw nuvoton_cir k10temp rc_core wmi parport_pc i2c_piix4 nvidia_uvm(POE) binfmt_misc lp parport ip_tables x_tables autofs4 pata_acpi hid_generic nvidia_drm(POE) nvidia_modeset(POE) hid_cherry nvidia(POE) usbhid hid drm_kms_helper syscopyarea sysfillrect uas sysimgblt fb_sys_fops usb_storage psmouse ahci drm pata_atiixp r8169 libahci mii
[32973.433804]  fjes
[32973.433807] CPU: 1 PID: 1932 Comm: JS Helper Tainted: P           OE   4.10.0-19-generic #21-Ubuntu
[32973.433809] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./960GM/U3S3 FX, BIOS P1.50 07/23/2015
[32973.433811] task: ffff9976b0db0000 task.stack: ffffba190526c000
[32973.433816] RIP: 0010:__migration_entry_wait+0x16a/0x180
[32973.433818] RSP: 0000:ffffba190526fd68 EFLAGS: 00010246
[32973.433819] RAX: 000fffffc0048078 RBX: fffff99e087fc1f0 RCX: fffff99e087fc1f0
[32973.433820] RDX: 0000000000000001 RSI: ffff99761ff07800 RDI: fffff99e01354000
[32973.433822] RBP: ffffba190526fd80 R08: ffff99781c83e4c0 R09: ffff99781c83e4c0
[32973.433823] R10: 00007f203a0000c8 R11: 0000000000000206 R12: fffff99e01354000
[32973.433824] R13: 3e0000000004d500 R14: ffffba190526fe30 R15: ffff99769bb40ed8
[32973.433825] FS:  00007f20353fc700(0000) GS:ffff997837c40000(0000) knlGS:0000000000000000
[32973.433827] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[32973.433828] CR2: 00007f2019900018 CR3: 00000002b0f59000 CR4: 00000000000006e0
[32973.433829] Call Trace:
[32973.433833]  migration_entry_wait+0x74/0x80
[32973.433836]  do_swap_page+0x5b3/0x770
[32973.433838]  handle_mm_fault+0x873/0x1360
[32973.433841]  __do_page_fault+0x23e/0x4e0
[32973.433843]  do_page_fault+0x22/0x30
[32973.433846]  page_fault+0x28/0x30
[32973.433847] RIP: 0033:0x5594fe478d78
[32973.433848] RSP: 002b:00007f20353fb880 EFLAGS: 00010203
[32973.433849] RAX: 0000000000000000 RBX: 00007f203a000040 RCX: 00007f2046783137
[32973.433850] RDX: 0000000000000008 RSI: 0000000000006000 RDI: 00007f2019902000
[32973.433851] RBP: 00007f2019900000 R08: 00007f203a0000c8 R09: 0000000000000000
[32973.433852] R10: 00007f203a0000c8 R11: 0000000000000206 R12: 0000000000000080
[32973.433853] R13: 00007f203a0000c8 R14: 0000000000000001 R15: 0000000000000006
[32973.433855] Code: ff ff ff 4c 89 e7 e8 96 a2 f8 ff e9 3c ff ff ff 85 d2 0f 84 2a ff ff ff 8d 4a 01 89 d0 f0 41 0f b1 4d 00 39 d0 74 81 89 c2 eb e5 <0f> 0b 4c 89 e7 e8 0c fb f9 ff eb b8 4c 8d 60 ff 4c 8d 68 1b eb 
[32973.433888] RIP: __migration_entry_wait+0x16a/0x180 RSP: ffffba190526fd68
[32973.433890] ---[ end trace 3a7500656316e758 ]---


Also sometimes I have seen additionally these 2 messages:

[33027.919262] NMI watchdog: BUG: soft lockup - CPU#4 stuck for 22s! [JS Helper:1937]

and

[33033.447885] INFO: rcu_sched self-detected stall on CPU
[33033.447891] 	4-...: (15000 ticks this GP) idle=a53/140000000000001/0 softirq=224363/224363 fqs=7500 
[33033.447892] 	 (t=15000 jiffies g=100739 c=100738 q=632)
[33033.447895] Task dump for CPU 4:
[33033.447896] JS Helper       R  running task        0  1937   1827 0x00000008

Note You need to log in before you can comment on or make changes to this bug.