Bug 13190 - 2.6.28 thermal shutdown - Compaq N600c
Summary: 2.6.28 thermal shutdown - Compaq N600c
Status: CLOSED UNREPRODUCIBLE
Alias: None
Product: ACPI
Classification: Unclassified
Component: Power-Fan (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: Zhang Rui
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2009-04-26 10:55 UTC by Tarek Loubani
Modified: 2009-05-27 02:09 UTC (History)
2 users (show)

See Also:
Kernel Version: 2.6.28
Subsystem:
Regression: Yes
Bisected commit-id:


Attachments
ACPI Dump from n600c (129.74 KB, application/octet-stream)
2009-04-27 01:36 UTC, Tarek Loubani
Details
"grep . /proc/acpi/thermal_zone/TZ1/*" in 2.6.28 (693 bytes, application/octet-stream)
2009-04-27 12:22 UTC, Tarek Loubani
Details
dmesg from 2.6.28 after fresh boot (38.45 KB, application/octet-stream)
2009-04-27 12:31 UTC, Tarek Loubani
Details

Description Tarek Loubani 2009-04-26 10:55:45 UTC
I have basically never had heating issues with my Compaq Evo N600c until a recent upgrade to Ubuntu's Jaunty 2.6.28. Now, the computer appears to overheat and shut down regularly.

In doing some checking, these are the assigned trip points:

cat /proc/acpi/thermal_zone/TZ1/trip_points 
critical (S5):           108 C
passive:                 98 C: tc1=1 tc2=2 tsp=100 devices=C000 
active[0]:               80 C: devices=C1E1 
active[1]:               70 C: devices=C1E2 
active[2]:               55 C: devices=C1E3


Which seem quite high. I cannot change the trip points manually, and have played with the DSDT, though it is far above my ability.

I know this is an incomplete bug report, so maybe the next question is: What else do I need to make it more complete?

tarek : )
Comment 1 Zhang Rui 2009-04-27 01:13:22 UTC
please attach the acpidump output using the latest pmtools here:
http://www.lesswatts.org/projects/acpi/utilities.php

please attach the content of "grep . /proc/acpi/thermal_zone/TZ1/*" both in 2.6.28 and in the non-overheating kernel.
Comment 2 Tarek Loubani 2009-04-27 01:36:47 UTC
Created attachment 21129 [details]
ACPI Dump from n600c

Hello,

Thank you for your prompt reply. Here is the ACPI dump from the N600c
Comment 3 Zhang Rui 2009-04-27 07:44:31 UTC
>> Name (_CRT, 0x0EE4)

From the acpidump output, we can see that the critical trip point is hardcoded to 108C, which means that it's the same in the kernel which used to work.
Plus, a critical trip point of 108C is also normal in other laptops.

this is rather a thermal management regression than a trip point bug to me.
can you make sure that
1. the overheating problem goes away if you switch back to the old kernel?
2. the laptop is hotter when you are running 2.6.28?

Plus, please attach the full dmesg output and the result of "grep . /proc/acpi/thermal_zone/TZ1/*" both in 2.6.28 and in the non-overheating kernel.
Comment 4 Tarek Loubani 2009-04-27 12:22:54 UTC
Created attachment 21134 [details]
"grep . /proc/acpi/thermal_zone/TZ1/*" in 2.6.28

Here is the output from 2.6.28. I shall go back to the old kernel shortly to get all the information from there as well. As well, I'll be tracking the logs to see if there are any contributory messages and post them here.

tarek : )
Comment 5 Tarek Loubani 2009-04-27 12:31:07 UTC
Created attachment 21135 [details]
dmesg from 2.6.28 after fresh boot

dmesg output immediately after boot.
Comment 6 Len Brown 2009-04-28 01:30:58 UTC
What is the latest version of the kernel that works properly?

Can you reproduce this failure using a kernel.org kernel?

upon the faiure, what is the reported temperature.
Is the reading jumping around?

you can monitor it with 
watch grep . /proc/acpi/thermal_zone/*/* /proc/acpi/fan/*/*

also, when the temperature exceeds the active trip points
in the output above, do the fans run (you should be able to
see "on" in the output above, and hear them as well)

finally, for testing, you can disable the actual shutdown
by booting with thermal.nocrt=1
which will still print out, but not shutdown on CRT trip.
Comment 7 Zhang Rui 2009-05-05 02:09:53 UTC
ping tarek.
Comment 8 Shaohua 2009-05-06 01:54:20 UTC
did you feel the system really heat or just the ACPI report the temperature is high?
Comment 9 Zhang Rui 2009-05-11 06:34:57 UTC
ping Tarek...
Comment 10 Tarek Loubani 2009-05-11 10:17:38 UTC
Apologies. Things have been a bit hectic on this end. I'll put some updated information here later today.

tarek : )
Comment 11 Zhang Rui 2009-05-21 08:09:06 UTC
ping tarek again. :)
Comment 12 Len Brown 2009-05-27 02:09:32 UTC
please re-open if this issue is reproducible.

Note You need to log in before you can comment on or make changes to this bug.