Bug 46831 - Kernel Panics During High Workload
Summary: Kernel Panics During High Workload
Status: CLOSED CODE_FIX
Alias: None
Product: ACPI
Classification: Unclassified
Component: Power-Thermal (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: Zhang Rui
URL:
Keywords:
: 50681 (view as bug list)
Depends on:
Blocks:
 
Reported: 2012-09-02 00:36 UTC by Mike Lothian
Modified: 2013-03-11 03:16 UTC (History)
3 users (show)

See Also:
Kernel Version: 3.6.0-rc3
Subsystem:
Regression: Yes
Bisected commit-id:


Attachments
Kernel Config (72.26 KB, text/plain)
2012-09-02 00:36 UTC, Mike Lothian
Details
Extract from /var/log/messages (45.13 KB, text/plain)
2012-09-02 00:45 UTC, Mike Lothian
Details
Picture of Panic (635.55 KB, image/jpeg)
2012-09-02 01:01 UTC, Mike Lothian
Details
Dmesg from SandyBridge Laptop (83.44 KB, text/plain)
2012-09-02 01:09 UTC, Mike Lothian
Details
Newest panic (845.23 KB, image/jpeg)
2012-11-22 19:53 UTC, Mike Lothian
Details

Description Mike Lothian 2012-09-02 00:36:53 UTC
Created attachment 78991 [details]
Kernel Config

My machine panics when it has a high workload - usually compiling

I managed to photograph the panic and I'll attach it, the .config and info from my logs

I'm not sure if I've assigned the bug to the right group

This bug is reproducable - each time I've tried to recompile icedtea
Comment 1 Mike Lothian 2012-09-02 00:45:43 UTC
Created attachment 79001 [details]
Extract from /var/log/messages
Comment 2 Mike Lothian 2012-09-02 01:01:44 UTC
Created attachment 79011 [details]
Picture of Panic

Sorry I don't have a text version of this
Comment 3 Mike Lothian 2012-09-02 01:03:57 UTC
I've been noticing this since the start of the 3.6 serious but was unable to reliably bisect the issue.
Comment 4 Mike Lothian 2012-09-02 01:09:20 UTC
Created attachment 79021 [details]
Dmesg from SandyBridge Laptop
Comment 5 Zhang Rui 2012-11-13 06:27:26 UTC
does the problem still exist in the latest upstream kernel?
Comment 6 Mike Lothian 2012-11-13 17:37:43 UTC
I've just tried running 9924a1992a86ebdb7ca36ef790d2ba0da506296c and so far it hasn't failed (I tired compiling libreoffice and icedtea whilst watching a video)

I've been mostly running 3.6.0-rc3 from git://people.freedesktop.org/~airlied/linux so my discreet card gets switched off when not in use

I'll keep using the latest from master to see if it's truly resolved

Would you like me to see figure out which rc's this bug was present in?
Comment 7 Zhang Rui 2012-11-14 02:03:33 UTC
(In reply to comment #6)
> I've just tried running 9924a1992a86ebdb7ca36ef790d2ba0da506296c and so far
> it
> hasn't failed (I tired compiling libreoffice and icedtea whilst watching a
> video)
> 
> I've been mostly running 3.6.0-rc3 from
> git://people.freedesktop.org/~airlied/linux so my discreet card gets switched
> off when not in use
> 
> I'll keep using the latest from master to see if it's truly resolved
> 
> Would you like me to see figure out which rc's this bug was present in?

No, if the bug is fixed in the latest upstream kernel.
Yes, if the problem still exists. :)
Comment 8 Zhang Rui 2012-11-21 01:14:07 UTC
ping...
Comment 9 Mike Lothian 2012-11-22 19:40:42 UTC
Damn I thought it was fixed but it happened again just now

3.7.0-rc6-tip+ 3587b1b097d70c2eb9fee95ea7995d13c05f66e5

Was the running kernel and the SHA of the first commit in the log

I have pictures of the panic which I'll attach now - I'd like to point out this is the first panic I've had since switching to the newer kernels so something has changed to make it less frequent
Comment 10 Mike Lothian 2012-11-22 19:53:29 UTC
Created attachment 87061 [details]
Newest panic

This is the latest panic - I have a less clearer one the goes right to the right edge of the screen if you need any of the longer lines
Comment 11 Zhang Rui 2012-11-23 07:14:20 UTC
(In reply to comment #3)
> I've been noticing this since the start of the 3.6 serious but was unable to
> reliably bisect the issue.

can you reproduce it with 3.6 kernel?
Comment 12 Mike Lothian 2012-11-23 08:27:18 UTC
3.6 final or the latest point release?
Comment 13 Zhang Rui 2012-11-23 08:36:25 UTC
(In reply to comment #3)
> I've been noticing this since the start of the 3.6 serious but was unable to
> reliably bisect the issue.

I asked just to verify that which release that you did not notice the problem.
Comment 14 Mike Lothian 2012-11-23 08:37:32 UTC
Then yes it was first noticed under a 3.6 rc
Comment 15 Zhang Rui 2012-11-28 16:09:40 UTC
does the problem exists in 3.5?
Comment 16 Mike Lothian 2012-11-28 18:06:28 UTC
No I'm quite sure the issue only appeared in the 3.6 merge window - unfortunately due to its infrequent nature I wasn't able to bisect it down to the problem commit. The idea of compiling icedtea between each bisect is a bit off putting even with an i7 - but I could give it a go this weekend if you think it'll help
Comment 17 Zhang Rui 2012-11-28 18:16:48 UTC
*** Bug 50681 has been marked as a duplicate of this bug. ***
Comment 18 Zhang Rui 2013-03-08 06:04:41 UTC
hi, Mike,

can you help me check if the problem still exists in 3.9-rc1 please?
Comment 19 Zhang Rui 2013-03-08 06:08:04 UTC
you can try to reproduce this by setting the cur_state of the processor cooling device to the max_state.
Comment 20 Mike Lothian 2013-03-11 02:37:34 UTC
Hi Zhang

I did encounter the pstates issue in rc1 (now fixed of course) but I've not had the original issue in a while

I'm quite happy for this bug to be closed

Note You need to log in before you can comment on or make changes to this bug.