Bug 4867

Summary: Crash Reading /proc/acpi/thermal_zone/THRM/temperature
Product: ACPI Reporter: Jasper Spaans (jasper)
Component: Power-ThermalAssignee: Konstantin Karasyov (konstantin.karasyov)
Status: CLOSED UNREPRODUCIBLE    
Severity: blocking CC: acpi-bugzilla, akpm
Priority: P2    
Hardware: i386   
OS: Linux   
Kernel Version: 2.6.13-rc2-mm1 Subsystem:
Regression: --- Bisected commit-id:

Description Jasper Spaans 2005-07-09 09:58:56 UTC
Distribution: Debian/testing

Hardware Environment:
Shuttle AK-32 motherboard, Athlon XP 2000

Software Environment:
2.6.13-rc2-mm1, which contains ACPI: Subsystem revision 20050408

Problem Description:
I run a home-built daemon on this machine to monitor the temperature, and if
necessary to start throttling the processor. After upgrading to 2.6.13-rc2-mm1
this daemon causes a fatal oops. Note that this behaviour only occurs after some
hours have passed.
 

Jul  9 16:22:27 spaans kernel: [42966947.960000] Unable to handle kernel paging
request at virtual address 5755c35d
Jul  9 16:22:27 spaans kernel: [42966947.960000]  printing eip:
Jul  9 16:22:27 spaans kernel: [42966947.960000] c02247e0
Jul  9 16:22:27 spaans kernel: [42966947.960000] *pde = 00000000
Jul  9 16:22:27 spaans kernel: [42966947.960000] Oops: 0000 [#1]
Jul  9 16:22:27 spaans kernel: [42966947.960000] PREEMPT
Jul  9 16:22:27 spaans kernel: [42966947.960000] Modules linked in: sg sr_mod
binfmt_misc appletalk psnap llc wp512 tgr192 tea sha512 md4 khazad michael_mic
crc32c cast6cast5 arc4 anubis lp bridge iptable_filter ip_tables floppy
snd_via82xx snd_ac97_codec snd_pcm_oss snd_mixer_oss snd_pcm snd_timer
snd_page_alloc snd_mpu401_uart
snd_rawmidi snd_seq_device snd hostap_pci hostap usbhid pl2303 usbserial twofish
serpent aes blowfish sha256 crypto_null uhci_hcd eth1394 sbp2 ohci1394 ieee1394
Jul  9 16:22:27 spaans kernel: [42966947.960000] CPU:    0
Jul  9 16:22:27 spaans kernel: [42966947.960000] EIP:   
0060:[acpi_ns_get_attached_object+97/151]    Not tainted VLI
Jul  9 16:22:27 spaans kernel: [42966947.960000] EFLAGS: 00010202   (2.6.13-rc2-mm1)
Jul  9 16:22:27 spaans kernel: [42966947.960000] EIP is at
acpi_ns_get_attached_object+0x61/0x97
Jul  9 16:22:27 spaans kernel: [42966947.960000] eax: 0000010c   ebx: c0226232 
 ecx: 5755c35d   edx: 0000000c
Jul  9 16:22:27 spaans kernel: [42966947.960000] esi: e24a3d08   edi: d8411218 
 ebp: c0226232   esp: e24a3d08
Jul  9 16:22:27 spaans kernel: [42966947.960000] ds: 007b   es: 007b   ss: 0068
Jul  9 16:22:27 spaans kernel: [42966947.960000] Process tmon (pid: 4066,
threadinfo=e24a2000 task=e243c560)
Jul  9 16:22:27 spaans kernel: [42966947.960000] Stack: 00000010 c03983ae
c039828d 00000000 d8411028 c0211eb0 0000000a 00000040
Jul  9 16:22:27 spaans kernel: [42966947.960000]        c039330f c0392fdf
c5e72788 c18ee588 00000000 d8411028 c020f907 f880883c
Jul  9 16:22:27 spaans kernel: [42966947.960000]        0000000e 00000000
e5a36828 c7fdef88 c18f1708 c0226232 f88087d3 00000000
Jul  9 16:22:27 spaans kernel: [42966947.960000] Call Trace:
Jul  9 16:22:27 spaans kernel: [42966947.960000] Call Trace:
Jul  9 16:22:27 spaans kernel: [42966947.960000] 
[acpi_ds_init_aml_walk+206/391] acpi_ds_init_aml_walk+0xce/0x187
Jul  9 16:22:27 spaans kernel: [42966947.960000] 
[acpi_ds_call_control_method+329/653] acpi_ds_call_control_method+0x149/0x28d
Jul  9 16:22:27 spaans kernel: [42966947.960000]  [acpi_ps_parse_loop+2500/2510]
acpi_ps_parse_loop+0x9c4/0x9ce
Jul  9 16:22:27 spaans kernel: [42966947.960000]  [acpi_ps_parse_aml+241/680]
acpi_ps_parse_aml+0xf1/0x2a8
Jul  9 16:22:27 spaans kernel: [42966947.960000]  [acpi_psx_execute+452/612]
acpi_psx_execute+0x1c4/0x264
Jul  9 16:22:27 spaans kernel: [42966947.960000] 
[acpi_ns_execute_control_method+216/246] acpi_ns_execute_control_method+0xd8/0xf6
Jul  9 16:22:27 spaans kernel: [42966947.960000] 
[acpi_ns_evaluate_by_handle+208/254] acpi_ns_evaluate_by_handle+0xd0/0xfe
Jul  9 16:22:27 spaans kernel: [42966947.960000] 
[acpi_ns_evaluate_relative+337/427] acpi_ns_evaluate_relative+0x151/0x1ab
Jul  9 16:22:27 spaans kernel: [42966947.960000]  [acpi_evaluate_object+309/556]
acpi_evaluate_object+0x135/0x22c
Jul  9 16:22:27 spaans kernel: [42966947.960000] 
[acpi_evaluate_integer+201/486] acpi_evaluate_integer+0xc9/0x1e6
Jul  9 16:22:27 spaans kernel: [42966947.960000]  [link_path_walk+157/218]
link_path_walk+0x9d/0xda
Jul  9 16:22:27 spaans kernel: [42966947.960000]  [acpi_ut_trace+40/46]
acpi_ut_trace+0x28/0x2e
Jul  9 16:22:27 spaans kernel: [42966947.960000] 
[acpi_thermal_get_temperature+96/174] acpi_thermal_get_temperature+0x60/0xae
Jul  9 16:22:27 spaans kernel: [42966947.960000] 
[acpi_thermal_temp_seq_show+57/135] acpi_thermal_temp_seq_show+0x39/0x87
Jul  9 16:22:27 spaans kernel: [42966947.960000]  [seq_read+243/620]
seq_read+0xf3/0x26c
Jul  9 16:22:27 spaans kernel: [42966947.960000]  [seq_read+0/620]
seq_read+0x0/0x26c
Jul  9 16:22:27 spaans kernel: [42966947.960000]  [vfs_read+165/327]
vfs_read+0xa5/0x147
Jul  9 16:22:27 spaans kernel: [42966947.960000]  [sys_read+60/99]
sys_read+0x3c/0x63
Jul  9 16:22:27 spaans kernel: [42966947.960000]  [syscall_call+7/11]
syscall_call+0x7/0xb
Jul  9 16:22:27 spaans kernel: [42966947.960000] Code: 39 c0 56 68 10 01 00 00
6a 02 e8 68 83 00 00 b8 11 01 00 00 31 c9 89 f2 e8 83 85 00 00 31 c0 83 c4 10 eb
37 8b 4b 08 85 c9 74 0f <8a> 01 83 e8 0e 3c 01 77 06 80 79 01 1d 75 12 b8 18 01
00 00 89



Steps to reproduce:
Boot, start monitoring tool, wait.
Comment 1 Andrew Morton 2005-07-28 23:56:40 UTC
Jasper, do you know if this is still happening in 2.6.13-rc3-mm3?

Thanks.
Comment 2 Len Brown 2005-08-04 22:56:05 UTC
Jasper,
Can you clarify which kernels work and which fail?
You mentioned that upgrading to 2.6.13-rc2-mm1/20050408 fails.
What did you upgrade from that worked?
How about vanilla 2.6.12 or vanilla 2.6.13(-rc6, when that arrives)?
Ie. is this specific to -mm or does it happen in Linus' tree too?

Can you attach the source for the daemon so that Konstantin
can try to reproduce the failure?

thanks,
-Len

Comment 3 Len Brown 2005-08-11 22:18:22 UTC
any news on 2.6.13-rc6?
Comment 4 Jasper Spaans 2005-08-14 16:04:26 UTC
Well, as to testing newer kernelversions, the cpu-fan on the processor on the
motherboard with these symptoms failed -- unfortunately taking the cpu with it.

I've replaced it with a new one, and cannot test the old one right now. (If I
can get hold of an ancient athlon, I'll try to do some testing)