Bug 108641 - [REGRESSION] crash on boot: ksoftirqd Fatal exception in interrupt, pids_uncharge.constprop, pids_free, cgroup_free
Summary: [REGRESSION] crash on boot: ksoftirqd Fatal exception in interrupt, pids_unch...
Status: RESOLVED UNREPRODUCIBLE
Alias: None
Product: Process Management
Classification: Unclassified
Component: Scheduler (show other bugs)
Hardware: Intel Linux
: P1 high
Assignee: Ingo Molnar
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-11-30 17:14 UTC by Martin Steigerwald
Modified: 2015-12-14 12:27 UTC (History)
0 users

See Also:
Kernel Version: 4.4-rc2
Subsystem:
Regression: No
Bisected commit-id:


Attachments
config of affected 4.4-rc2 kernel (29.29 KB, application/x-xz)
2015-11-30 17:14 UTC, Martin Steigerwald
Details
config of non affected 4.3 kernel (29.09 KB, text/plain)
2015-11-30 17:16 UTC, Martin Steigerwald
Details
screenshot of backtrace during crash on boot #1 (1.44 MB, image/jpeg)
2015-11-30 17:17 UTC, Martin Steigerwald
Details
screenshot of backtrace during crash on boot #2 (1.53 MB, image/jpeg)
2015-11-30 17:18 UTC, Martin Steigerwald
Details

Description Martin Steigerwald 2015-11-30 17:14:23 UTC
Created attachment 196111 [details]
config of affected 4.4-rc2 kernel

I think this may more be related to control groups, but I didn´t find them product list.

Crash happens on ThinkPad T520 on boot since 4.4-rc2. With 4.4-rc2 I have been able to boot once and use this boot session via hibernation for some days. 4.4-rc3 didn´t boot at all in more than 5 attempts. This crash does not happen with 4.3.

martin@merkaba:~> phoronix-test-suite system-info

Phoronix Test Suite v5.2.1
System Information

Hardware:
Processor: Intel Core i5-2520M @ 3.20GHz (4 Cores), Motherboard: LENOVO 42433WG, Chipset: Intel 2nd Generation Core Family DRAM, Memory: 16384MB, Disk: 300GB INTEL SSDSA2CW30 + 480GB Crucial_CT480M50, Graphics: Intel HD 3000 (1300MHz), Audio: Conexant CX20590, Network: Intel 82579LM Gigabit Connection + Intel Centrino Advanced-N 6205

Software:
OS: Debian unstable, Kernel: 4.3.0-tp520-btrfstrim+ (x86_64), Desktop: KDE Frameworks 5, Display Server: X Server 1.17.3, Display Driver: intel 2.99.917, OpenGL: 3.3 Mesa 11.0.6, Compiler: GCC 5.2.1 20151125, File-System: btrfs, Screen Resolution: 1920x1080

I will attach some I bet incomplete screenshots of the traces and the kernel configuration in use. I will also attach configuration of 4.3 kernel which doesn´t show this crash.

This is mainline kernel with a little fix for BTRFS SSD trimming.

martin@merkaba:~> cat /proc/version
Linux version 4.3.0-tp520-btrfstrim+ (martin@merkaba) (gcc version 5.2.1 20151028 (Debian 5.2.1-23) ) #43 SMP PREEMPT Mon Nov 2 10:43:19 CET 2015

I do not want to spent the time and risk of bisecting this issue on this production machine. I am open for other suggestions to pin-point the cause.

I mentioned this already on LKML:

Subject	[4.4-rc2] regresstion: ksoftirqd/2 Fatal exception in interrupt pids.freepids.uncharge.constprop
Date	Sat, 28 Nov 2015 12:38:46 +0100

https://lkml.org/lkml/2015/11/28/36

Thanks, Martin
Comment 1 Martin Steigerwald 2015-11-30 17:16:58 UTC
Created attachment 196121 [details]
config of non affected 4.3 kernel
Comment 2 Martin Steigerwald 2015-11-30 17:17:53 UTC
Created attachment 196131 [details]
screenshot of backtrace during crash on boot #1
Comment 3 Martin Steigerwald 2015-11-30 17:18:11 UTC
Created attachment 196141 [details]
screenshot of backtrace during crash on boot #2
Comment 4 Martin Steigerwald 2015-12-14 12:27:10 UTC
This seems to be gone with 4.4-rc5. Thanks, Martin

Note You need to log in before you can comment on or make changes to this bug.