Bug 15897 - Dell Studio 15 overheats
Summary: Dell Studio 15 overheats
Status: CLOSED DOCUMENTED
Alias: None
Product: ACPI
Classification: Unclassified
Component: Power-Fan (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: Lan Tianyu
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2010-05-02 18:29 UTC by Matteo Croce
Modified: 2013-07-12 03:12 UTC (History)
5 users (show)

See Also:
Kernel Version: 3.2.0
Subsystem:
Regression: No
Bisected commit-id:


Attachments
syslog (37.36 KB, text/plain)
2010-05-02 18:29 UTC, Matteo Croce
Details
syslog (37.36 KB, text/plain)
2010-05-02 18:31 UTC, Matteo Croce
Details
acpidump (209.64 KB, text/plain)
2010-06-08 08:22 UTC, Matteo Croce
Details

Description Matteo Croce 2010-05-02 18:29:15 UTC
Hi,

I have a Dell Studio 15 Notebook with a Core i7 CPU.
It works well but when I do CPU intensive tasks the notebook reaches 100°C and shuts down.
I tried a windows tool (cpuburn) which keeps the CPUs busy to test the system stability and the maximum temèerature I get was 94°C.
I noticed that uner Windows the fan were more noisy so I guess that they wer spinning faster.
That's what acpi says:

$ acpi -V
Battery 0: Charging, 33%, 01:16:23 until charged
Battery 0: design capacity 662 mAh, last full capacity 621 mAh = 93%
Adapter 0: on-line
Thermal 0: active, 72.0 degrees C
Thermal 0: trip point 0 switches to mode critical at temperature 85.0 degrees C
Thermal 0: trip point 1 switches to mode passive at temperature 95.0 degrees C
Thermal 0: trip point 2 switches to mode active at temperature 71.0 degrees C
Thermal 0: trip point 3 switches to mode active at temperature 55.0 degrees C
Thermal 1: ok, 26.8 degrees C
Thermal 1: trip point 0 switches to mode critical at temperature 127.0 degrees C
Cooling 0: Fan 0 of 1
Cooling 1: Fan 0 of 1
Cooling 2: Processor 0 of 10
Cooling 3: Processor 0 of 10
Cooling 4: Processor 0 of 10
Cooling 5: Processor 0 of 10
Cooling 6: Processor 0 of 10
Cooling 7: Processor 0 of 10
Cooling 8: Processor 0 of 10
Cooling 9: Processor 0 of 10
Comment 1 Matteo Croce 2010-05-02 18:29:53 UTC
Created attachment 26197 [details]
syslog
Comment 2 Matteo Croce 2010-05-02 18:31:07 UTC
Created attachment 26198 [details]
syslog
Comment 3 Zhang Rui 2010-05-10 07:14:15 UTC
so the fan is still off when in active mode?
please attach the output of "grep /sys/class/thermal/*/*"
Comment 4 Zhang Rui 2010-05-19 06:45:55 UTC
ping ...
Comment 5 Matteo Croce 2010-05-19 13:48:44 UTC
# ll /sys/class/thermal/
totale 0
lrwxrwxrwx 1 root root 0 2010-05-19 14:53 cooling_device0 -> ../../devices/virtual/thermal/cooling_device0
lrwxrwxrwx 1 root root 0 2010-05-19 14:53 cooling_device1 -> ../../devices/virtual/thermal/cooling_device1
lrwxrwxrwx 1 root root 0 2010-05-19 14:53 cooling_device2 -> ../../devices/virtual/thermal/cooling_device2
lrwxrwxrwx 1 root root 0 2010-05-19 14:53 cooling_device3 -> ../../devices/virtual/thermal/cooling_device3
lrwxrwxrwx 1 root root 0 2010-05-19 14:53 cooling_device4 -> ../../devices/virtual/thermal/cooling_device4
lrwxrwxrwx 1 root root 0 2010-05-19 14:53 cooling_device5 -> ../../devices/virtual/thermal/cooling_device5
lrwxrwxrwx 1 root root 0 2010-05-19 14:53 cooling_device6 -> ../../devices/virtual/thermal/cooling_device6
lrwxrwxrwx 1 root root 0 2010-05-19 14:53 cooling_device7 -> ../../devices/virtual/thermal/cooling_device7
lrwxrwxrwx 1 root root 0 2010-05-19 14:53 cooling_device8 -> ../../devices/virtual/thermal/cooling_device8
lrwxrwxrwx 1 root root 0 2010-05-19 14:53 cooling_device9 -> ../../devices/virtual/thermal/cooling_device9
lrwxrwxrwx 1 root root 0 2010-05-19 14:53 thermal_zone0 -> ../../devices/virtual/thermal/thermal_zone0
lrwxrwxrwx 1 root root 0 2010-05-19 14:53 thermal_zone1 -> ../../devices/virtual/thermal/thermal_zone1
# grep active /sys/class/thermal/*/*
/sys/class/thermal/thermal_zone1/trip_point_2_type:active
/sys/class/thermal/thermal_zone1/trip_point_3_type:active
Comment 6 Matteo Croce 2010-05-24 15:59:22 UTC
please remove NEEDINFO now
Comment 7 Zhang Rui 2010-05-25 02:56:17 UTC
sorry, please attach the output of "grep . /sys/class/thermal/*/*".
Comment 8 Matteo Croce 2010-05-25 09:16:52 UTC
/sys/class/thermal/cooling_device0/cur_state:0
/sys/class/thermal/cooling_device0/max_state:10
/sys/class/thermal/cooling_device0/type:Processor
/sys/class/thermal/cooling_device1/cur_state:0
/sys/class/thermal/cooling_device1/max_state:10
/sys/class/thermal/cooling_device1/type:Processor
/sys/class/thermal/cooling_device2/cur_state:0
/sys/class/thermal/cooling_device2/max_state:10
/sys/class/thermal/cooling_device2/type:Processor
/sys/class/thermal/cooling_device3/cur_state:0
/sys/class/thermal/cooling_device3/max_state:10
/sys/class/thermal/cooling_device3/type:Processor
/sys/class/thermal/cooling_device4/cur_state:0
/sys/class/thermal/cooling_device4/max_state:10
/sys/class/thermal/cooling_device4/type:Processor
/sys/class/thermal/cooling_device5/cur_state:0
/sys/class/thermal/cooling_device5/max_state:10
/sys/class/thermal/cooling_device5/type:Processor
/sys/class/thermal/cooling_device6/cur_state:0
/sys/class/thermal/cooling_device6/max_state:10
/sys/class/thermal/cooling_device6/type:Processor
/sys/class/thermal/cooling_device7/cur_state:0
/sys/class/thermal/cooling_device7/max_state:10
/sys/class/thermal/cooling_device7/type:Processor
/sys/class/thermal/cooling_device8/cur_state:0
/sys/class/thermal/cooling_device8/max_state:1
/sys/class/thermal/cooling_device8/type:Fan
/sys/class/thermal/cooling_device9/cur_state:0
/sys/class/thermal/cooling_device9/max_state:1
/sys/class/thermal/cooling_device9/type:Fan
/sys/class/thermal/thermal_zone0/mode:enabled
/sys/class/thermal/thermal_zone0/passive:0
/sys/class/thermal/thermal_zone0/temp:26800
/sys/class/thermal/thermal_zone0/trip_point_0_temp:127000
/sys/class/thermal/thermal_zone0/trip_point_0_type:critical
/sys/class/thermal/thermal_zone0/type:acpitz
/sys/class/thermal/thermal_zone1/cdev0_trip_point:1
/sys/class/thermal/thermal_zone1/cdev1_trip_point:1
/sys/class/thermal/thermal_zone1/cdev2_trip_point:1
/sys/class/thermal/thermal_zone1/cdev3_trip_point:1
/sys/class/thermal/thermal_zone1/cdev4_trip_point:1
/sys/class/thermal/thermal_zone1/cdev5_trip_point:1
/sys/class/thermal/thermal_zone1/cdev6_trip_point:1
/sys/class/thermal/thermal_zone1/cdev7_trip_point:1
/sys/class/thermal/thermal_zone1/cdev8_trip_point:3
/sys/class/thermal/thermal_zone1/cdev9_trip_point:2
/sys/class/thermal/thermal_zone1/mode:enabled
/sys/class/thermal/thermal_zone1/temp:63000
/sys/class/thermal/thermal_zone1/trip_point_0_temp:85000
/sys/class/thermal/thermal_zone1/trip_point_0_type:critical
/sys/class/thermal/thermal_zone1/trip_point_1_temp:95000
/sys/class/thermal/thermal_zone1/trip_point_1_type:passive
/sys/class/thermal/thermal_zone1/trip_point_2_temp:71000
/sys/class/thermal/thermal_zone1/trip_point_2_type:active
/sys/class/thermal/thermal_zone1/trip_point_3_temp:55000
/sys/class/thermal/thermal_zone1/trip_point_3_type:active
/sys/class/thermal/thermal_zone1/type:acpitz
Comment 9 Matteo Croce 2010-06-05 02:16:53 UTC
linux 2.6.35-rc1 with intel-cpuidle:

/sys/class/thermal/cooling_device0/cur_state:0
/sys/class/thermal/cooling_device0/max_state:10
/sys/class/thermal/cooling_device0/type:Processor
/sys/class/thermal/cooling_device1/cur_state:0
/sys/class/thermal/cooling_device1/max_state:10
/sys/class/thermal/cooling_device1/type:Processor
/sys/class/thermal/cooling_device2/cur_state:0
/sys/class/thermal/cooling_device2/max_state:10
/sys/class/thermal/cooling_device2/type:Processor
/sys/class/thermal/cooling_device3/cur_state:0
/sys/class/thermal/cooling_device3/max_state:10
/sys/class/thermal/cooling_device3/type:Processor
/sys/class/thermal/cooling_device4/cur_state:0
/sys/class/thermal/cooling_device4/max_state:10
/sys/class/thermal/cooling_device4/type:Processor
/sys/class/thermal/cooling_device5/cur_state:0
/sys/class/thermal/cooling_device5/max_state:10
/sys/class/thermal/cooling_device5/type:Processor
/sys/class/thermal/cooling_device6/cur_state:0
/sys/class/thermal/cooling_device6/max_state:10
/sys/class/thermal/cooling_device6/type:Processor
/sys/class/thermal/cooling_device7/cur_state:0
/sys/class/thermal/cooling_device7/max_state:10
/sys/class/thermal/cooling_device7/type:Processor
/sys/class/thermal/cooling_device8/cur_state:0
/sys/class/thermal/cooling_device8/max_state:1
/sys/class/thermal/cooling_device8/type:Fan
/sys/class/thermal/cooling_device9/cur_state:0
/sys/class/thermal/cooling_device9/max_state:1
/sys/class/thermal/cooling_device9/type:Fan
/sys/class/thermal/thermal_zone0/mode:enabled
/sys/class/thermal/thermal_zone0/passive:0
/sys/class/thermal/thermal_zone0/temp:26800
/sys/class/thermal/thermal_zone0/trip_point_0_temp:127000
/sys/class/thermal/thermal_zone0/trip_point_0_type:critical
/sys/class/thermal/thermal_zone0/type:acpitz
/sys/class/thermal/thermal_zone1/cdev0_trip_point:1
/sys/class/thermal/thermal_zone1/cdev1_trip_point:1
/sys/class/thermal/thermal_zone1/cdev2_trip_point:1
/sys/class/thermal/thermal_zone1/cdev3_trip_point:1
/sys/class/thermal/thermal_zone1/cdev4_trip_point:1
/sys/class/thermal/thermal_zone1/cdev5_trip_point:1
/sys/class/thermal/thermal_zone1/cdev6_trip_point:1
/sys/class/thermal/thermal_zone1/cdev7_trip_point:1
/sys/class/thermal/thermal_zone1/cdev8_trip_point:3
/sys/class/thermal/thermal_zone1/cdev9_trip_point:2
/sys/class/thermal/thermal_zone1/mode:enabled
/sys/class/thermal/thermal_zone1/temp:66000
/sys/class/thermal/thermal_zone1/trip_point_0_temp:85000
/sys/class/thermal/thermal_zone1/trip_point_0_type:critical
/sys/class/thermal/thermal_zone1/trip_point_1_temp:95000
/sys/class/thermal/thermal_zone1/trip_point_1_type:passive
/sys/class/thermal/thermal_zone1/trip_point_2_temp:71000
/sys/class/thermal/thermal_zone1/trip_point_2_type:active
/sys/class/thermal/thermal_zone1/trip_point_3_temp:55000
/sys/class/thermal/thermal_zone1/trip_point_3_type:active
/sys/class/thermal/thermal_zone1/type:acpitz
Comment 10 Matteo Croce 2010-06-06 14:22:26 UTC
Machine check events logged
root@raver:~# mcelog 
mcelog: Unsupported new Family 6 Model 1e CPU: only decoding architectural errors
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
MCE 0
CPU 5 THERMAL EVENT TSC 54097c96cb8 
TIME 1275833991 Sun Jun  6 16:19:51 2010
Processor 5 heated above trip temperature. Throttling enabled.
Please check your system cooling. Performance will be impacted
STATUS 880003c3 MCGSTATUS 0
MCGCAP 1c09 APICID 3 SOCKETID 0 
CPUID Vendor Intel Family 6 Model 30
mcelog: Unsupported new Family 6 Model 1e CPU: only decoding architectural errors
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
MCE 1
CPU 1 THERMAL EVENT TSC 54097c970f4 
TIME 1275833991 Sun Jun  6 16:19:51 2010
Processor 1 heated above trip temperature. Throttling enabled.
Please check your system cooling. Performance will be impacted
STATUS 880003c3 MCGSTATUS 0
MCGCAP 1c09 APICID 2 SOCKETID 0 
CPUID Vendor Intel Family 6 Model 30
mcelog: Unsupported new Family 6 Model 1e CPU: only decoding architectural errors
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
MCE 2
CPU 1 THERMAL EVENT TSC 54097e3143c 
TIME 1275833991 Sun Jun  6 16:19:51 2010
Processor 1 below trip temperature. Throttling disabled
STATUS 88010282 MCGSTATUS 0
MCGCAP 1c09 APICID 2 SOCKETID 0 
CPUID Vendor Intel Family 6 Model 30
mcelog: Unsupported new Family 6 Model 1e CPU: only decoding architectural errors
HARDWARE ERROR. This is *NOT* a software problem!
Please contact your hardware vendor
MCE 3
CPU 5 THERMAL EVENT TSC 54097e31790 
TIME 1275833991 Sun Jun  6 16:19:51 2010
Processor 5 below trip temperature. Throttling disabled
STATUS 88010282 MCGSTATUS 0
MCGCAP 1c09 APICID 3 SOCKETID 0 
CPUID Vendor Intel Family 6 Model 30
Comment 11 Zhang Rui 2010-06-07 06:32:23 UTC
So the fan is still off even if the temperature is higher than the active trip point.
can you change the fan state manually by writing 3 or 0 to /proc/acpi/fan/*/state?

BTW: please attach the acpidump output of this machine.
Comment 12 Matteo Croce 2010-06-08 08:21:28 UTC
/proc/acpi/fan doesn't exists, maybe is in /sys now?
Is ths the same?
echo 0 >/sys/class/thermal/cooling_device9/cur_state
Comment 13 Matteo Croce 2010-06-08 08:22:14 UTC
Created attachment 26698 [details]
acpidump
Comment 14 Zhang Rui 2010-06-08 08:25:50 UTC
you can run "echo 1 >/sys/class/thermal/cooling_deviceX/cur_state" to turn on the fan.
Comment 15 Zhang Rui 2010-06-24 08:16:51 UTC
ping...
Comment 16 Matteo Croce 2010-06-24 09:59:54 UTC
it has no effect.

BTW the fan actually does spin, just it spin slow, even if the system is running at ~90 C
Comment 17 Matteo Croce 2010-08-30 00:27:48 UTC
There is a WinPhlash, a Windows BIOS flashing tool, which spins the fans at maximum speed while flashing maybe to avoid overheating.
Can we reverse engineer it?
Comment 18 Zhang Rui 2010-09-27 01:09:40 UTC
(In reply to comment #16)

> BTW the fan actually does spin, just it spin slow, even if the system is
> running at ~90 C

how many acpi fan devices do you have?
and does the fan still spin slow after set the cur_state of all these fan cooling devices to 1?
Comment 19 Matteo Croce 2010-09-27 01:47:35 UTC
yes

# grep . cooling_device*/type
cooling_device0/type:Processor
cooling_device1/type:Processor
cooling_device2/type:Processor
cooling_device3/type:Processor
cooling_device4/type:Processor
cooling_device5/type:Processor
cooling_device6/type:Processor
cooling_device7/type:Processor
cooling_device8/type:Fan
cooling_device9/type:Fan

# for i in /sys/class/thermal/cooling_device?; do echo 1 > $i/cur_state; done
Comment 20 Matteo Croce 2010-12-26 16:01:33 UTC
Any hint?
Comment 21 Jason Cook 2011-01-03 23:57:15 UTC
I'm seeing this problem as well, but the the values reported seem even more off.  The machine is overheating and restarting several times per day.


Battery 0: Full, 100%
Battery 0: design capacity 623 mAh, last full capacity 458 mAh = 73%
Adapter 0: on-line
Thermal 0: ok, 0.0 degrees C
Thermal 0: trip point 0 switches to mode critical at temperature 100.0 degrees C
Thermal 0: trip point 1 switches to mode passive at temperature 95.0 degrees C
Thermal 0: trip point 2 switches to mode active at temperature 71.0 degrees C
Thermal 0: trip point 3 switches to mode active at temperature 55.0 degrees C
Thermal 1: ok, 26.8 degrees C
Thermal 1: trip point 0 switches to mode critical at temperature 100.0 degrees C
Thermal 1: trip point 1 switches to mode active at temperature 71.0 degrees C
Thermal 1: trip point 2 switches to mode active at temperature 55.0 degrees C
Cooling 0: Processor 0 of 10
Cooling 1: Processor 0 of 10
Cooling 2: Processor 0 of 10
Cooling 3: Processor 0 of 10
Cooling 4: Processor 0 of 10
Cooling 5: Processor 0 of 10
Cooling 6: Processor 0 of 10
Cooling 7: Processor 0 of 10
Cooling 8: Fan 0 of 1
Cooling 9: Fan 0 of 1

At the listed 0 and 26.8 the machine is hot enough that the keyboard is warm to the touch.  Playing a flash video is enough to shut the machine down.
Comment 22 Matteo Croce 2011-01-04 00:12:30 UTC
flash video definitely kills my notebook
Comment 23 Zhang Rui 2012-01-18 02:03:54 UTC
It's great that kernel bugzilla is back.

can you please verify if the problem still exists in the latest upstream
kernel?
Comment 24 Matteo Croce 2012-01-18 08:25:45 UTC
Yes it exists
Comment 25 Lan Tianyu 2013-06-25 07:26:42 UTC
Hi:
      Just check FAN's power resource FN00 and FN01 in the the ACPI dsdt table. They are fake power resources. _ON and _OFF method just store the ONE or ZERO to VFN1(define in the system mem operation region) as status value and then check ECON. But after checking, do nothing. So can't control FAN via ACPI FAN's power resource. So this should be a Bios issue.


       PowerResource (FN01, 0x00, 0x0000)
        {
            Method (_STA, 0, Serialized)  // _STA: Status
            {
                Return (VFN1)
            }

            Method (_ON, 0, Serialized)  // _ON_: Power On
            {
                Store (One, VFN1)
                If (LAnd (ECON, ETMD)) {}
            }

            Method (_OFF, 0, Serialized)  // _OFF: Power Off
            {
                Store (Zero, VFN1)
                If (LAnd (LAnd (ECON, ETMD), LAnd (ECON, VFN0))) {}
            }
        }
Comment 26 Lan Tianyu 2013-07-04 05:26:45 UTC
Since Bios doesn't provide actual fan control and ACPI fan driver can't control its status. Closed this bug.
Comment 27 Matteo Croce 2013-07-04 08:25:46 UTC
If you update the BIOS from Windows with this tool http://downloads.dell.com/bios/R301433.exe the fans will go at maximum speed immediately, much higher than they go under Linux when the temperature is even 99° C
Comment 28 Lan Tianyu 2013-07-05 01:41:14 UTC
Hi Matteo:
       Some vendors will provide vendor driver(not via ACPI) for Windows to control some devices(e.g FAN) but not available for Linux. At last from the ACPI table, it doesn't provide FAN actual control and ACPI fan driver has done its job according ACPI SPEC.

Note You need to log in before you can comment on or make changes to this bug.