Bug 14190 - Laptop Vaio VGN-SZ2XP blocked when the cpu enters deep C-state for a long time
Summary: Laptop Vaio VGN-SZ2XP blocked when the cpu enters deep C-state for a long time
Status: REJECTED INSUFFICIENT_DATA
Alias: None
Product: ACPI
Classification: Unclassified
Component: Power-Other (show other bugs)
Hardware: All Linux
: P1 high
Assignee: Venkatesh Pallipadi
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2009-09-18 20:36 UTC by José Illescas
Modified: 2010-06-30 06:14 UTC (History)
3 users (show)

See Also:
Kernel Version: 2.6.28
Subsystem:
Regression: No
Bisected commit-id:


Attachments
This is acpidump output. (137.45 KB, text/plain)
2009-09-21 08:46 UTC, José Illescas
Details
acpidump with numcpus=1 (137.45 KB, text/plain)
2009-09-21 11:10 UTC, José Illescas
Details
lspci with numcpus=1 (28.80 KB, text/plain)
2009-09-21 11:11 UTC, José Illescas
Details
disable the SMI to see whether the issue still happens (817 bytes, patch)
2009-09-24 03:02 UTC, ykzhao
Details | Diff
cpu0cst (535 bytes, application/octet-stream)
2009-09-27 20:35 UTC, José Illescas
Details
cpu0ist (490 bytes, application/octet-stream)
2009-09-27 20:35 UTC, José Illescas
Details
cpu1cst (133 bytes, application/octet-stream)
2009-09-27 20:36 UTC, José Illescas
Details
cpu1ist (137 bytes, application/octet-stream)
2009-09-27 20:36 UTC, José Illescas
Details

Description José Illescas 2009-09-18 20:36:51 UTC
Hello,

I have a Sony Vaio VGN-SZ2XP laptop.

I have the following problem. When I plug the power cable, the system is totally blocked and I have to shut down and turn on the button.

I've probed with kernel 2.6.28, 2.6.29, 2.6.30 and 2.6.31. I've probed with Arch, Ubuntu, Mandriva, Debian, Gentoo and the system block too. There is a problem with Linux Kernel.

I can't see nothing in the /var/log.

I think that the problem is acpi. When I boot the system with option "acpi=off" I can plug the power cord and the laptop don't block, but don't work many things, with graphic card, mouse, etc, .

What can I do with acpi?, how can I configure acpi correctly?.

The problem doesn't occur in Windows, or FreeBSD, or Mac OSx86, only in Linux distros.

I've updated bios from R0082N0 to R0083N0, but the problem still occurs. I've updated bios at R0096N0 version, but the problem still occurs too.

This is my current acpi kernel configuration from Ubuntu Jaunty with kernel 2.6.28:

CONFIG_ACPI=y
CONFIG_ACPI_AC=y
# CONFIG_ACPI_ASUS is not set
CONFIG_ACPI_BATTERY=y
CONFIG_ACPI_BLACKLIST_YEAR=2000
CONFIG_ACPI_BUTTON=y
CONFIG_ACPI_CONTAINER=y
# CONFIG_ACPI_CUSTOM_DSDT is not set
CONFIG_ACPI_CUSTOM_DSDT_FILE=""
CONFIG_ACPI_CUSTOM_DSDT_INITRD=y
# CONFIG_ACPI_DEBUG is not set
CONFIG_ACPI_DOCK=y
CONFIG_ACPI_FAN=y
CONFIG_ACPI_HOTPLUG_CPU=y
CONFIG_ACPI_PCI_SLOT=y
CONFIG_ACPI_PROCESSOR=y
CONFIG_ACPI_PROCFS=y
CONFIG_ACPI_PROCFS_POWER=y
CONFIG_ACPI_PROC_EVENT=y
CONFIG_ACPI_SBS=y
CONFIG_ACPI_SLEEP=y
CONFIG_ACPI_SYSFS_POWER=y
CONFIG_ACPI_SYSTEM=y
CONFIG_ACPI_THERMAL=y
CONFIG_ACPI_TOSHIBA=m
CONFIG_ACPI_VIDEO=m
CONFIG_ACPI_WMI=y

Any ideas?.

Best regards.
Comment 1 Zhang Rui 2009-09-21 01:31:16 UTC
please try boot option "maxcpus=1" or "nosmp" and see if it helps.
Comment 2 ykzhao 2009-09-21 02:48:29 UTC
Will you please attach the output of acpidump, lspci -vxxx?

Will you please confirm whether the box can be booted with the power cable plugged?

Will you please do the following test under the console mode?
    a. boot the system with the battery
    b. kill the process using /proc/acpi/event (use the command of "lsof /proc/acpi/event " to get the process id)
    c. cat /proc/acpi/event 
    d. plug the power cable and see whether the box is blocked

Thanks.
Comment 3 José Illescas 2009-09-21 08:43:07 UTC
I'm 
(In reply to comment #1)
> please try boot option "maxcpus=1" or "nosmp" and see if it helps.

Hello, I'm booted with "maxcpus=1" option and the box don't block. But, I think that this isn't correct solution because I forget one cpu in my box.
Comment 4 José Illescas 2009-09-21 08:45:37 UTC
(In reply to comment #2)
> Will you please attach the output of acpidump, lspci -vxxx?
> 
> Will you please confirm whether the box can be booted with the power cable
> plugged?

The box boot correctly with power cable plugged.

> 
> Will you please do the following test under the console mode?
>     a. boot the system with the battery
>     b. kill the process using /proc/acpi/event (use the command of "lsof
> /proc/acpi/event " to get the process id)
>     c. cat /proc/acpi/event 
>     d. plug the power cable and see whether the box is blocked
> 
> Thanks.

I've probed this and the box is blocked.

This is lspci output:

-----
00:00.0 Host bridge: Intel Corporation Mobile 945GM/PM/GMS, 943/940GML and 945GT Express Memory Controller Hub (rev 03)
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, fast devsel, latency 0
	Capabilities: <access denied>
	Kernel driver in use: agpgart-intel
	Kernel modules: intel-agp
00: 86 80 a0 27 06 01 90 20 03 00 00 06 00 00 00 00
10: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
20: 00 00 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 e0 00 00 00 00 00 00 00 00 00 00 00

00:02.0 VGA compatible controller: Intel Corporation Mobile 945GM/GMS, 943/940GML Express Integrated Graphics Controller (rev 03)
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, fast devsel, latency 0, IRQ 16
	Memory at dc200000 (32-bit, non-prefetchable) [size=512K]
	I/O ports at 1800 [size=8]
	Memory at c0000000 (32-bit, prefetchable) [size=256M]
	Memory at dc300000 (32-bit, non-prefetchable) [size=256K]
	Capabilities: <access denied>
	Kernel modules: intelfb
00: 86 80 a2 27 07 00 90 00 03 00 00 03 00 00 80 00
10: 00 00 20 dc 01 18 00 00 08 00 00 c0 00 00 30 dc
20: 00 00 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 90 00 00 00 00 00 00 00 05 01 00 00

00:02.1 Display controller: Intel Corporation Mobile 945GM/GMS/GME, 943/940GML Express Integrated Graphics Controller (rev 03)
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, fast devsel, latency 0
	Memory at dc280000 (32-bit, non-prefetchable) [size=512K]
	Capabilities: <access denied>
00: 86 80 a6 27 07 00 90 00 03 00 80 03 00 00 80 00
10: 00 00 28 dc 00 00 00 00 00 00 00 00 00 00 00 00
20: 00 00 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 d0 00 00 00 00 00 00 00 00 00 00 00

00:1b.0 Audio device: Intel Corporation 82801G (ICH7 Family) High Definition Audio Controller (rev 02)
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, fast devsel, latency 0, IRQ 21
	Memory at dc340000 (64-bit, non-prefetchable) [size=16K]
	Capabilities: <access denied>
	Kernel driver in use: HDA Intel
	Kernel modules: snd-hda-intel
00: 86 80 d8 27 06 00 10 00 02 00 03 04 10 00 00 00
10: 04 00 34 dc 00 00 00 00 00 00 00 00 00 00 00 00
20: 00 00 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 50 00 00 00 00 00 00 00 0a 01 00 00

00:1c.0 PCI bridge: Intel Corporation 82801G (ICH7 Family) PCI Express Port 1 (rev 02)
	Flags: bus master, fast devsel, latency 0
	Bus: primary=00, secondary=02, subordinate=05, sec-latency=0
	I/O behind bridge: 00002000-00002fff
	Memory behind bridge: d6000000-d7ffffff
	Prefetchable memory behind bridge: 00000000d0000000-00000000d1ffffff
	Capabilities: <access denied>
	Kernel driver in use: pcieport-driver
	Kernel modules: shpchp
00: 86 80 d0 27 07 04 10 00 02 00 04 06 10 00 81 00
10: 00 00 00 00 00 00 00 00 00 02 05 00 20 20 00 20
20: 00 d6 f0 d7 01 d0 f1 d1 00 00 00 00 00 00 00 00
30: 00 00 00 00 40 00 00 00 00 00 00 00 05 01 04 00

00:1c.1 PCI bridge: Intel Corporation 82801G (ICH7 Family) PCI Express Port 2 (rev 02)
	Flags: bus master, fast devsel, latency 0
	Bus: primary=00, secondary=06, subordinate=06, sec-latency=0
	Memory behind bridge: dc100000-dc1fffff
	Capabilities: <access denied>
	Kernel driver in use: pcieport-driver
	Kernel modules: shpchp
00: 86 80 d2 27 07 04 10 00 02 00 04 06 10 00 81 00
10: 00 00 00 00 00 00 00 00 00 06 06 00 f0 00 00 00
20: 10 dc 10 dc f1 ff 01 00 00 00 00 00 00 00 00 00
30: 00 00 00 00 40 00 00 00 00 00 00 00 0a 02 04 00

00:1c.2 PCI bridge: Intel Corporation 82801G (ICH7 Family) PCI Express Port 3 (rev 02)
	Flags: bus master, fast devsel, latency 0
	Bus: primary=00, secondary=07, subordinate=07, sec-latency=0
	I/O behind bridge: 00003000-00003fff
	Memory behind bridge: d8000000-d9ffffff
	Prefetchable memory behind bridge: 00000000d2000000-00000000d3ffffff
	Capabilities: <access denied>
	Kernel driver in use: pcieport-driver
	Kernel modules: shpchp
00: 86 80 d4 27 07 04 10 00 02 00 04 06 10 00 81 00
10: 00 00 00 00 00 00 00 00 00 07 07 00 30 30 00 20
20: 00 d8 f0 d9 01 d2 f1 d3 00 00 00 00 00 00 00 00
30: 00 00 00 00 40 00 00 00 00 00 00 00 0a 03 04 00

00:1c.3 PCI bridge: Intel Corporation 82801G (ICH7 Family) PCI Express Port 4 (rev 02)
	Flags: bus master, fast devsel, latency 0
	Bus: primary=00, secondary=08, subordinate=08, sec-latency=0
	I/O behind bridge: 00004000-00004fff
	Memory behind bridge: da000000-dbffffff
	Prefetchable memory behind bridge: 00000000d4000000-00000000d5ffffff
	Capabilities: <access denied>
	Kernel driver in use: pcieport-driver
	Kernel modules: shpchp
00: 86 80 d6 27 07 04 10 00 02 00 04 06 10 00 81 00
10: 00 00 00 00 00 00 00 00 00 08 08 00 40 40 00 20
20: 00 da f0 db 01 d4 f1 d5 00 00 00 00 00 00 00 00
30: 00 00 00 00 40 00 00 00 00 00 00 00 0a 04 04 00

00:1d.0 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #1 (rev 02)
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, medium devsel, latency 0, IRQ 19
	I/O ports at 1820 [size=32]
	Kernel driver in use: uhci_hcd
00: 86 80 c8 27 05 00 80 02 02 00 03 0c 00 00 80 00
10: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
20: 21 18 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 00 00 00 00 00 00 00 00 0a 01 00 00

00:1d.1 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #2 (rev 02)
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, medium devsel, latency 0, IRQ 19
	I/O ports at 1840 [size=32]
	Kernel driver in use: uhci_hcd
00: 86 80 c9 27 05 00 80 02 02 00 03 0c 00 00 00 00
10: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
20: 41 18 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 00 00 00 00 00 00 00 00 0a 02 00 00

00:1d.2 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #3 (rev 02)
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, medium devsel, latency 0, IRQ 19
	I/O ports at 1860 [size=32]
	Kernel driver in use: uhci_hcd
00: 86 80 ca 27 05 00 80 02 02 00 03 0c 00 00 00 00
10: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
20: 61 18 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 00 00 00 00 00 00 00 00 0a 03 00 00

00:1d.3 USB Controller: Intel Corporation 82801G (ICH7 Family) USB UHCI Controller #4 (rev 02)
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, medium devsel, latency 0, IRQ 19
	I/O ports at 1880 [size=32]
	Kernel driver in use: uhci_hcd
00: 86 80 cb 27 05 00 80 02 02 00 03 0c 00 00 00 00
10: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
20: 81 18 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 00 00 00 00 00 00 00 00 0a 01 00 00

00:1d.7 USB Controller: Intel Corporation 82801G (ICH7 Family) USB2 EHCI Controller (rev 02) (prog-if 20)
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, medium devsel, latency 0, IRQ 23
	Memory at dc544000 (32-bit, non-prefetchable) [size=1K]
	Capabilities: <access denied>
	Kernel driver in use: ehci_hcd
00: 86 80 cc 27 06 00 90 02 02 20 03 0c 00 00 00 00
10: 00 40 54 dc 00 00 00 00 00 00 00 00 00 00 00 00
20: 00 00 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 50 00 00 00 00 00 00 00 0a 04 00 00

00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev e2) (prog-if 01)
	Flags: bus master, fast devsel, latency 0
	Bus: primary=00, secondary=09, subordinate=0d, sec-latency=56
	I/O behind bridge: 00005000-00005fff
	Memory behind bridge: dc000000-dc0fffff
	Prefetchable memory behind bridge: 0000000050000000-0000000053ffffff
	Capabilities: <access denied>
00: 86 80 48 24 07 00 10 00 e2 01 04 06 00 00 01 00
10: 00 00 00 00 00 00 00 00 00 09 0d 38 50 50 80 22
20: 00 dc 00 dc 01 50 f1 53 00 00 00 00 00 00 00 00
30: 00 00 00 00 50 00 00 00 00 00 00 00 ff 00 04 00

00:1f.0 ISA bridge: Intel Corporation 82801GBM (ICH7-M) LPC Interface Bridge (rev 02)
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, medium devsel, latency 0
	Capabilities: <access denied>
	Kernel modules: iTCO_wdt, intel-rng
00: 86 80 b9 27 07 00 10 02 02 00 01 06 00 00 80 00
10: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
20: 00 00 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 e0 00 00 00 00 00 00 00 00 00 00 00

00:1f.1 IDE interface: Intel Corporation 82801G (ICH7 Family) IDE Controller (rev 02) (prog-if 8a [Master SecP PriP])
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, medium devsel, latency 0, IRQ 22
	I/O ports at 01f0 [size=8]
	I/O ports at 03f4 [size=1]
	I/O ports at 0170 [size=8]
	I/O ports at 0374 [size=1]
	I/O ports at 1810 [size=16]
	Kernel driver in use: ata_piix
00: 86 80 df 27 05 00 80 02 02 8a 01 01 00 00 00 00
10: 01 00 00 00 01 00 00 00 01 00 00 00 01 00 00 00
20: 11 18 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 00 00 00 00 00 00 00 00 ff 02 00 00

00:1f.2 IDE interface: Intel Corporation 82801GBM/GHM (ICH7 Family) SATA IDE Controller (rev 02) (prog-if 8f [Master SecP SecO PriP PriO])
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, 66MHz, medium devsel, latency 0, IRQ 22
	I/O ports at 18d0 [size=8]
	I/O ports at 18c4 [size=4]
	I/O ports at 18c8 [size=8]
	I/O ports at 18c0 [size=4]
	I/O ports at 18b0 [size=16]
	Memory at dc544400 (32-bit, non-prefetchable) [size=1K]
	Capabilities: <access denied>
	Kernel driver in use: ata_piix
00: 86 80 c4 27 07 00 b0 02 02 8f 01 01 00 00 00 00
10: d1 18 00 00 c5 18 00 00 c9 18 00 00 c1 18 00 00
20: b1 18 00 00 00 44 54 dc 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 70 00 00 00 00 00 00 00 0a 02 00 00

00:1f.3 SMBus: Intel Corporation 82801G (ICH7 Family) SMBus Controller (rev 02)
	Subsystem: Sony Corporation Device 81e6
	Flags: medium devsel
	I/O ports at 18e0 [size=32]
	Kernel modules: i2c-i801
00: 86 80 da 27 01 00 80 02 02 00 05 0c 00 00 00 00
10: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
20: e1 18 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 00 00 00 00 00 00 00 00 ff 00 00 00

06:00.0 Network controller: Intel Corporation PRO/Wireless 3945ABG [Golan] Network Connection (rev 02)
	Subsystem: Intel Corporation Device 1051
	Flags: bus master, fast devsel, latency 0, IRQ 2298
	Memory at dc100000 (32-bit, non-prefetchable) [size=4K]
	Capabilities: <access denied>
	Kernel driver in use: iwl3945
	Kernel modules: iwl3945
00: 86 80 22 42 06 04 10 00 02 00 80 02 10 00 00 00
10: 00 00 10 dc 00 00 00 00 00 00 00 00 00 00 00 00
20: 00 00 00 00 00 00 00 00 00 00 00 00 86 80 51 10
30: 00 00 00 00 c8 00 00 00 00 00 00 00 0a 01 00 00

07:00.0 Ethernet controller: Marvell Technology Group Ltd. 88E8036 PCI-E Fast Ethernet Controller (rev 15)
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, fast devsel, latency 0, IRQ 2299
	Memory at d8000000 (64-bit, non-prefetchable) [size=16K]
	I/O ports at 3000 [size=256]
	Capabilities: <access denied>
	Kernel driver in use: sky2
	Kernel modules: sky2
00: ab 11 51 43 07 04 10 00 15 00 00 02 10 00 00 00
10: 04 00 00 d8 00 00 00 00 01 30 00 00 00 00 00 00
20: 00 00 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 48 00 00 00 00 00 00 00 0a 01 00 00

09:04.0 CardBus bridge: Texas Instruments PCIxx12 Cardbus Controller
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, medium devsel, latency 168, IRQ 20
	Memory at dc006000 (32-bit, non-prefetchable) [size=4K]
	Bus: primary=09, secondary=0a, subordinate=0d, sec-latency=176
	Memory window 0: 50000000-53fff000 (prefetchable)
	Memory window 1: 54000000-57fff000
	I/O window 0: 00005000-000050ff
	I/O window 1: 00005400-000054ff
	16-bit legacy interface ports at 0001
	Kernel driver in use: yenta_cardbus
	Kernel modules: yenta_socket
00: 4c 10 39 80 07 00 10 02 00 00 07 06 20 a8 82 00
10: 00 60 00 dc a0 00 00 02 09 0a 0d b0 00 00 00 50
20: 00 f0 ff 53 00 00 00 54 00 f0 ff 57 00 50 00 00
30: fc 50 00 00 00 54 00 00 fc 54 00 00 ff 01 c0 05
40: 4d 10 e6 81 01 00 00 00 00 00 00 00 00 00 00 00
50: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
60: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00
70: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00

09:04.1 FireWire (IEEE 1394): Texas Instruments PCIxx12 OHCI Compliant IEEE 1394 Host Controller (prog-if 10)
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, medium devsel, latency 32, IRQ 21
	Memory at dc005000 (32-bit, non-prefetchable) [size=2K]
	Memory at dc000000 (32-bit, non-prefetchable) [size=16K]
	Capabilities: <access denied>
	Kernel driver in use: ohci1394
	Kernel modules: firewire-ohci, ohci1394
00: 4c 10 3a 80 16 00 10 02 00 10 00 0c 10 20 80 00
10: 00 50 00 dc 00 00 00 dc 00 00 00 00 00 00 00 00
20: 00 00 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 44 00 00 00 00 00 00 00 0a 02 03 04

09:04.2 Mass storage controller: Texas Instruments 5-in-1 Multimedia Card Reader (SD/MMC/MS/MS PRO/xD)
	Subsystem: Sony Corporation Device 81e6
	Flags: bus master, medium devsel, latency 57, IRQ 22
	Memory at dc004000 (32-bit, non-prefetchable) [size=4K]
	Capabilities: <access denied>
	Kernel driver in use: tifm_7xx1
	Kernel modules: tifm_7xx1
00: 4c 10 3b 80 06 00 10 02 00 00 80 01 10 39 80 00
10: 00 40 00 dc 00 00 00 00 00 00 00 00 00 00 00 00
20: 00 00 00 00 00 00 00 00 00 00 00 00 4d 10 e6 81
30: 00 00 00 00 44 00 00 00 00 00 00 00 0a 03 07 04
-----

Best regards.
Comment 5 José Illescas 2009-09-21 08:46:28 UTC
Created attachment 23126 [details]
This is acpidump output.

This is acpidump output.
Comment 6 Zhang Rui 2009-09-21 09:11:24 UTC
I think this patch may help,
commit 74b5820808215f65b70b05a099d6d3c969b82689
Author: Bjorn Helgaas <bjorn.helgaas@hp.com>
Date:   Wed Jul 29 15:54:25 2009 -0600

    ACPI: bind workqueues to CPU 0 to avoid SMI corruption

    On some machines, a software-initiated SMI causes corruption unless the
    SMI runs on CPU 0.  An SMI can be initiated by any AML, but typically it's
    done in GPE-related methods that are run via workqueues, so we can avoid
    the known corruption cases by binding the workqueues to CPU 0.

    References:
        http://bugzilla.kernel.org/show_bug.cgi?id=13751
        https://bugs.launchpad.net/bugs/157171
        https://bugs.launchpad.net/bugs/157691

    Signed-off-by: Bjorn Helgaas <bjorn.helgaas@hp.com>
    Signed-off-by: Len Brown <len.brown@intel.com>

But IMO, it has been shipped in 2.6.31.
you said that you have tries this kernel but the laptop still blocks, right?
could you please make a double check?


>> 
>> Will you please do the following test under the console mode?
>>     a. boot the system with the battery
>>     b. kill the process using /proc/acpi/event (use the command of "lsof
>> /proc/acpi/event " to get the process id)
>>     c. cat /proc/acpi/event 
>>     d. plug the power cable and see whether the box is blocked
>> 
>> Thanks.
>
>I've probed this and the box is blocked.

then will you please get this info with boot option "maxcpus=1"?
Comment 7 José Illescas 2009-09-21 11:10:27 UTC
(In reply to comment #6)
> I think this patch may help,
> commit 74b5820808215f65b70b05a099d6d3c969b82689
> Author: Bjorn Helgaas <bjorn.helgaas@hp.com>
> Date:   Wed Jul 29 15:54:25 2009 -0600
> 
>     ACPI: bind workqueues to CPU 0 to avoid SMI corruption
> 
>     On some machines, a software-initiated SMI causes corruption unless the
>     SMI runs on CPU 0.  An SMI can be initiated by any AML, but typically
>     it's
>     done in GPE-related methods that are run via workqueues, so we can avoid
>     the known corruption cases by binding the workqueues to CPU 0.
> 
>     References:
>         http://bugzilla.kernel.org/show_bug.cgi?id=13751
>         https://bugs.launchpad.net/bugs/157171
>         https://bugs.launchpad.net/bugs/157691
> 
>     Signed-off-by: Bjorn Helgaas <bjorn.helgaas@hp.com>
>     Signed-off-by: Len Brown <len.brown@intel.com>
> 
> But IMO, it has been shipped in 2.6.31.
> you said that you have tries this kernel but the laptop still blocks, right?
> could you please make a double check?
> 
> 
> >> 
> >> Will you please do the following test under the console mode?
> >>     a. boot the system with the battery
> >>     b. kill the process using /proc/acpi/event (use the command of "lsof
> >> /proc/acpi/event " to get the process id)
> >>     c. cat /proc/acpi/event 
> >>     d. plug the power cable and see whether the box is blocked
> >> 
> >> Thanks.
> >
> >I've probed this and the box is blocked.
> 
> then will you please get this info with boot option "maxcpus=1"?

Hello,

I feel you, with option "maxcpus=1" the box is blocked too, I've probed very much and finally is blocked, :-(.

With 2.6.31 the box blocks too.

I send acpicpu and lspci output with numcpus=1.

Regards.
Comment 8 José Illescas 2009-09-21 11:10:57 UTC
Created attachment 23128 [details]
acpidump with numcpus=1
Comment 9 José Illescas 2009-09-21 11:11:24 UTC
Created attachment 23129 [details]
lspci with numcpus=1
Comment 10 José Illescas 2009-09-22 11:13:56 UTC
Hello, any ideas?.
Comment 11 Zhang Rui 2009-09-23 03:38:24 UTC
does it still hang in older kernels, say 2.6.21?
does the problem still exist after upgrading the BIOS?
Comment 12 José Illescas 2009-09-23 08:10:07 UTC
In older kernels the box is blocked too. My box is three years old and in all of versions of ubuntu has blocked.

After upgrading the bios the box is blocked too.

If you think that the problem is my box, I remember you that the problem don't occurs in FreeBSD, Mac OSx86 and Windows, only in linux distros.
Comment 13 ykzhao 2009-09-24 02:54:35 UTC
Please try the boot option of "acpi_apic_instance=2" and see whether the issue still exists.

Thanks.
Comment 14 ykzhao 2009-09-24 03:02:07 UTC
Created attachment 23163 [details]
disable the SMI to see whether the issue still happens

Will you please try the debug patch on the latest kernel and see whether the issue still exists?
In this debug patch it will disable the SMI.
thanks.
Comment 15 José Illescas 2009-09-24 11:33:31 UTC
Hello, with the boot option "acpi_apic_instance=2" the box is blocked too.

How can I try the debug patch?, Do I have to compile the kernel?.

Thanks.
Comment 16 José Illescas 2009-09-25 20:51:12 UTC
Hello,

I've tried the debug patch on the kernel 2.6.31.1 and I've compiled the kernel, but the box is blocked at first plug the power cable, :-((.

Regards.
Comment 17 ykzhao 2009-09-27 01:02:05 UTC
Thanks for the test.
    From the test it still hangs when the SMI is disabled.
    Will you please try the boot option of "idle=poll" and see whether the box is still blocked ?
    Will you please also attach the following output?
   ./acpidump --addr 0x3f676C73 --length 0x1EA -o cpu0ist
   ./acpidump --addr 0x3f676E5D --length 0x89 -o cpu1ist
   ./acpidump --addr 0x3f6769d7 --length 0x217 -o cpu0cst
   ./acpidump --addr 0x3f676bEE --length 0x85  -o cpu1cst

     Thanks.
Comment 18 José Illescas 2009-09-27 20:33:32 UTC
I can't to beleive!, with boot option "idle=poll", works!. The box don't blocks.

What's the problem?. What's the "idle=poll" option?.

Anyway, I attach the outputs of four commands.

Thanks you.
Comment 19 José Illescas 2009-09-27 20:35:09 UTC
Created attachment 23189 [details]
cpu0cst
Comment 20 José Illescas 2009-09-27 20:35:37 UTC
Created attachment 23190 [details]
cpu0ist
Comment 21 José Illescas 2009-09-27 20:36:05 UTC
Created attachment 23191 [details]
cpu1cst
Comment 22 José Illescas 2009-09-27 20:36:34 UTC
Created attachment 23192 [details]
cpu1ist
Comment 23 ykzhao 2009-09-29 06:10:09 UTC
How about the following boot option?
   a. processor.max_cstate=1
   b. nolapic_timer
   c. processor.max_cstate=2

Will you please attach the output of "cat /proc/cpuinfo"?

When the boot option of "idle=poll" is added, the C-state will be disabled.

thanks.
Comment 24 José Illescas 2009-09-29 18:48:18 UTC
Hello, tomorrow, I will probe three boot options that you say. Now I have not battery, it is in my work site.

Is it convenient to disable the C-states?.

This is my /proc/cpuinfo, with boot option "idle=poll":

$ cat /proc/cpuinfo 
processor	: 0
vendor_id	: GenuineIntel
cpu family	: 6
model		: 14
model name	: Genuine Intel(R) CPU           T2500  @ 2.00GHz
stepping	: 8
cpu MHz		: 1000.000
cache size	: 2048 KB
physical id	: 0
siblings	: 2
core id		: 0
cpu cores	: 2
apicid		: 0
initial apicid	: 0
fdiv_bug	: no
hlt_bug		: no
f00f_bug	: no
coma_bug	: no
fpu		: yes
fpu_exception	: yes
cpuid level	: 10
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx constant_tsc arch_perfmon bts pni monitor vmx est tm2 xtpr pdcm
bogomips	: 4000.14
clflush size	: 64
power management:

processor	: 1
vendor_id	: GenuineIntel
cpu family	: 6
model		: 14
model name	: Genuine Intel(R) CPU           T2500  @ 2.00GHz
stepping	: 8
cpu MHz		: 1000.000
cache size	: 2048 KB
physical id	: 0
siblings	: 2
core id		: 1
cpu cores	: 2
apicid		: 1
initial apicid	: 1
fdiv_bug	: no
hlt_bug		: no
f00f_bug	: no
coma_bug	: no
fpu		: yes
fpu_exception	: yes
cpuid level	: 10
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic sep mtrr pge mca cmov pat clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx constant_tsc arch_perfmon bts pni monitor vmx est tm2 xtpr pdcm
bogomips	: 4000.41
clflush size	: 64
power management:
Comment 25 José Illescas 2009-09-30 11:07:49 UTC
This is the result with following boot option:

a. processor.max_cstate=1: Works fine. Don't blocks when plug power cable.
b. nolapic_timer: Works fine. Don't blocks when plug power cable.
c. processor.max_cstate=2: Works fine. Don't blocks when plug power cable.

Finally, what's the recommended configuration for boot options?.

Greetings.
Comment 26 Zhang Rui 2010-02-21 06:30:00 UTC
the problem seems to happen if both lapic timer and deep C states are enabled.
but something weird is that we don't use lapic timer in deep C states. so I'm not sure what the problem is.
Yakui, please update this bug report.
Comment 27 José Illescas 2010-02-21 19:37:09 UTC
I have a new problem.

With kernel "2.6.31-14-generic #48-Ubuntu SMP" then cpu frequency is 100% forever and the fan is working at 100% forever too. My laptop is very hot!.

What's the problem now with 2.6.31???.

Greetings.
Comment 28 José Illescas 2010-02-21 19:38:08 UTC
The current boot option is "idle=poll".
Comment 29 ykzhao 2010-02-22 01:32:59 UTC
Can you use top to confirm which process/kernel thread uses a lot of cpu?

It will be great if you can also attach the output of "grep -R . /sys/firmware/acpi/interrupts/*".

How about using another option? for example: nolapic_timer, processor.max_cstate=2

thanks.
Comment 30 ykzhao 2010-02-22 01:46:41 UTC
The issue on this box is very weird.
From the test in comment #25 it seems that it can work well after using the following boot option:
   >processor.max_cstate =1 (only C1 is used)
   >nolapic_timer : C1/C2/C3 is used. The cpu will be waked up from C-state periodically and local APIC timer is disabled.
   >processor.max_cstate=2 : C1/C2 is used. 

   When the system is in C2/C3 state, the local APIC timer is also disabled. But the system can't work well without the boot option of "nolapic_timer" if the system can enter the deep C-state(C3). The only difference with/without the boot option of "nolapic_timer" is that the timer will bring the cpu out of deep C-state periodically. 

Hi, Venki
    Any idea about this issue?

thanks.
Comment 31 José Illescas 2010-02-22 12:09:44 UTC
Over the comment #30, the three options worked fine, but it was with the kernel 2.6.28. Now with kernel 2.6.31 this don't work. Thanks.
Comment 32 José Illescas 2010-02-22 12:32:46 UTC
> Can you use top to confirm which process/kernel thread uses a lot of cpu?

I'm sorry, but don't exist any process with 100% cpu. But, the fan is at 100% and heats up very quickly.

>It will be great if you can also attach the output of "grep -R .
>/sys/firmware/acpi/interrupts/*".

This is the output:

$ sudo grep -R . /sys/firmware/acpi/interrupts/*
[sudo] password for pepus: 
/sys/firmware/acpi/interrupts/error:       0
/sys/firmware/acpi/interrupts/ff_gbl_lock:       0	enabled
/sys/firmware/acpi/interrupts/ff_pmtimer:       0	invalid
/sys/firmware/acpi/interrupts/ff_pwr_btn:       0	invalid
/sys/firmware/acpi/interrupts/ff_rt_clk:       0	disabled
/sys/firmware/acpi/interrupts/ff_slp_btn:       0	invalid
/sys/firmware/acpi/interrupts/gpe00:       0	invalid
/sys/firmware/acpi/interrupts/gpe01:       0	enabled
/sys/firmware/acpi/interrupts/gpe02:       0	enabled
/sys/firmware/acpi/interrupts/gpe03:       0	disabled
/sys/firmware/acpi/interrupts/gpe04:       0	disabled
/sys/firmware/acpi/interrupts/gpe05:       0	enabled
/sys/firmware/acpi/interrupts/gpe06:       0	invalid
/sys/firmware/acpi/interrupts/gpe07:       0	enabled
/sys/firmware/acpi/interrupts/gpe08:       0	enabled
/sys/firmware/acpi/interrupts/gpe09:       0	disabled
/sys/firmware/acpi/interrupts/gpe0A:       0	invalid
/sys/firmware/acpi/interrupts/gpe0B:       0	disabled
/sys/firmware/acpi/interrupts/gpe0C:       0	invalid
/sys/firmware/acpi/interrupts/gpe0D:       0	disabled
/sys/firmware/acpi/interrupts/gpe0E:       0	invalid
/sys/firmware/acpi/interrupts/gpe0F:       0	invalid
/sys/firmware/acpi/interrupts/gpe10:       0	invalid
/sys/firmware/acpi/interrupts/gpe11:       0	invalid
/sys/firmware/acpi/interrupts/gpe12:       0	invalid
/sys/firmware/acpi/interrupts/gpe13:       0	invalid
/sys/firmware/acpi/interrupts/gpe14:       0	invalid
/sys/firmware/acpi/interrupts/gpe15:       0	invalid
/sys/firmware/acpi/interrupts/gpe16:       0	invalid
/sys/firmware/acpi/interrupts/gpe17:   19233	enabled
/sys/firmware/acpi/interrupts/gpe18:       0	invalid
/sys/firmware/acpi/interrupts/gpe19:       0	invalid 
/sys/firmware/acpi/interrupts/gpe1A:       0	invalid
/sys/firmware/acpi/interrupts/gpe1B:       0	invalid
/sys/firmware/acpi/interrupts/gpe1C:       0	invalid
/sys/firmware/acpi/interrupts/gpe1D:       0	enabled
/sys/firmware/acpi/interrupts/gpe1E:       0	invalid
/sys/firmware/acpi/interrupts/gpe1F:       0	invalid
/sys/firmware/acpi/interrupts/gpe_all:   19233
/sys/firmware/acpi/interrupts/sci:   19233
/sys/firmware/acpi/interrupts/sci_not:       0


>How about using another option? for example: nolapic_timer,
>processor.max_cstate=2

With nolapic_timer, is random, the fan not working 100% but sometimes if it is heated and occasionally.

I will probe with "processor.max_cstate=2".

Thanks.
Comment 33 José Illescas 2010-02-22 12:44:40 UTC
I've probed with "processor.max_cstate=2" option and the behavior is similar to the nolapic_timer option and also the cpu frequency is unchanged, is always full. Always is 2000 MHz.


$ cat /proc/cpuinfo 
processor	: 0
vendor_id	: GenuineIntel
cpu family	: 6
model		: 14
model name	: Genuine Intel(R) CPU           T2500  @ 2.00GHz
stepping	: 8
cpu MHz		: 2000.000
cache size	: 2048 KB
physical id	: 0
siblings	: 2
core id		: 0
cpu cores	: 2
apicid		: 0
initial apicid	: 0
fdiv_bug	: no
hlt_bug		: no
f00f_bug	: no
coma_bug	: no
fpu		: yes
fpu_exception	: yes
cpuid level	: 10
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic mtrr pge mca cmov clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx constant_tsc arch_perfmon bts pni monitor vmx est tm2 xtpr pdcm
bogomips	: 4000.43
clflush size	: 64
power management:

processor	: 1
vendor_id	: GenuineIntel
cpu family	: 6
model		: 14
model name	: Genuine Intel(R) CPU           T2500  @ 2.00GHz
stepping	: 8
cpu MHz		: 2000.000
cache size	: 2048 KB
physical id	: 0
siblings	: 2
core id		: 1
cpu cores	: 2
apicid		: 1
initial apicid	: 1
fdiv_bug	: no
hlt_bug		: no
f00f_bug	: no
coma_bug	: no
fpu		: yes
fpu_exception	: yes
cpuid level	: 10
wp		: yes
flags		: fpu vme de pse tsc msr pae mce cx8 apic mtrr pge mca cmov clflush dts acpi mmx fxsr sse sse2 ss ht tm pbe nx constant_tsc arch_perfmon bts pni monitor vmx est tm2 xtpr pdcm
bogomips	: 4000.33
clflush size	: 64
power management:
Comment 34 Zhang Rui 2010-06-22 07:07:57 UTC
does the problem still exist in the latest git kernel?
Comment 35 Zhang Rui 2010-06-30 06:14:24 UTC
bug closed as there is no response from the bug reporter.

please feel free to re-open it if the problem still exists in the latest upstream kernel.

Note You need to log in before you can comment on or make changes to this bug.