Bug 10568
Summary: | random crashes after resume - ASUS P5LD2-VM | ||
---|---|---|---|
Product: | Power Management | Reporter: | Oleksij Rempel (fishor) (bug-track) |
Component: | Hibernation/Suspend | Assignee: | acpi_power-sleep-wake |
Status: | REJECTED INSUFFICIENT_DATA | ||
Severity: | normal | CC: | acpi-bugzilla, bunk, gcosta, rui.zhang |
Priority: | P1 | ||
Hardware: | All | ||
OS: | Linux | ||
Kernel Version: | 2.6.20 - 2.6.25-git11 | Subsystem: | |
Regression: | --- | Bisected commit-id: | |
Bug Depends on: | |||
Bug Blocks: | 7216 | ||
Attachments: |
dmesg
dmesg-2.6.25-git11 config |
Description
Oleksij Rempel (fishor)
2008-04-27 23:31:50 UTC
Created attachment 15942 [details]
dmesg
(In reply to comment #0) > After resume system getting unstable, some applications will randomly crash. > For example firefox or complete xorg. Normally i'll get no dmesg That's weird, please try boot option "ignore_loglevel". Please try S3 in a later kernel, say 2.6.24. And attach the dmesg output after S3 resume. agrr... I still can't find 100% crasher. With latest git i can do: 1. debsums or md5deep -e -r Desktop/ 2. S3 3. debsums 4. firefox 5. X will segfault with some kernel before this all will nothing do but if i try to compile kernel, gcc will segfault. 2.6.24 GOOD 2.6.25 GOOD 2.6.25-01393-g2cca775 seems to be GOOD 2.6.25-05301-gc3bf9bc BAD 2.6.25-05561-g064922a BAD There is still no segfault before suspend. Created attachment 15973 [details]
dmesg-2.6.25-git11
does the problem still exist if X is not started? And it would be great if you can find the exact commit that cause this problem using git bisect. i can't find exact commit without 100% killer aplication. I can't reproduce one crash 2 times, but every time will crash some thing other. Today i tryed to kill firefox or gcc with "make -j2 all" and killed killed systemmonitor. Any suggestion what can i use to make better test? like memory test, cpu banch? It seesm like some memory allocation issue. This bug is hard to reproduce with suspend after freh start. So i get 100% killer. It is xorg dependet. i can't reproduce it without X Steps to reproduce: 1. boot 2. suspend S3 && resume 3. run in gnome-terminal for i in $( seq 100000 ); do echo "jjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjjj"; done && sleep 5s && cd tmp/kernel/linux-2.6.20/ && make -j2 all 4. compilation will segfault. ---------------------------------------------------- There is still 9 revisions to bisect but i get kernel oops at __wake_up_common+0x21/0x58 latest BAD commit d2bcbad5f3ad38a1c09861bca7e252dde7bb8259 x86: do not zap_low_mappings in __smp_prepare_cpus It was okay when cpus were cold booted before this point. But with the new state machine, they will not have arrived to the trampoline yet. zapping low mappings will have the bad effect of breaking it completely after paging enablement Signed-off-by: Glauber Costa <gcosta@redhat.com> Signed-off-by: Ingo Molnar <mingo@elte.hu> latest GOOD commit fbac7fcbadc54cc5d374873a2e60e924a056d198 x86: fix alloc_bootmem_pages_node macro missing a semicolon Signed-off-by: Glauber Costa <gcosta@redhat.com> Signed-off-by: Ingo Molnar <mingo@elte.hu> Patch set autors: Signed-off-by: Glauber Costa <gcosta@redhat.com> Signed-off-by: Ingo Molnar <mingo@elte.hu> after some commit betvene v2.6.25-06492-g7663c1e and v2.6.25-07245-ge4c576b ( last day ) i can't reproduce it any more. i do not why it was not reproducable with latest kernel, but it really good reproducable now with latest 2.6.25-testing-07351-g886c35f . SO i assume it was some site effect of some patch but not fixed bug. with nosmp it working 100% stable. Even suspend with HAL and Gnome will crush on resume but with nosmp it's working too. Heh... you don't need to suspend to reproduce this bug, just disable one cpu and enable it agene. echo 0 > /sys/devices/system/cpu/cpu1/online echo 1 > /sys/devices/system/cpu/cpu1/online please boot with "nopat" and see if the problem goes away No. the problem exist. Alexey, I'm trying to reproduce this issue, but without any success so far. Can you please post your .config ? Created attachment 16055 [details]
config
I think important factors are: 2G RAM ( if i set it to 1G it working ok ), SMP ( if set nosmp it working ok ), grafik on board ? broken acpi ?
Seems like this bug was partelly fixed some where betwene 2.6.26-rc2 and -rc3. I can't reproduce it with this "test script". And for first time of this PC live resume with gnome worked!!! At lest one time, second suspend/resume may not always work. I still assume Bug 10131 is just worse form of this bug. hi, alexey, can you reproduce the bug in the latest kernel release? alexey, please re-open this bug if you can reproduce this bug in the latest kernel release. |