Bug 16089
Summary: | 2.6.34 kernel fails while booting on a POWER6 | ||
---|---|---|---|
Product: | Platform Specific/Hardware | Reporter: | Victor (betabandido) |
Component: | PPC-64 | Assignee: | Anton Blanchard (anton) |
Status: | RESOLVED CODE_FIX | ||
Severity: | normal | CC: | alan, mikey |
Priority: | P1 | ||
Hardware: | All | ||
OS: | Linux | ||
Kernel Version: | 2.6.34 | Subsystem: | |
Regression: | No | Bisected commit-id: | |
Attachments: | Original configuration file |
Description
Victor
2010-06-01 14:18:18 UTC
Can you add your .config here? pseries_defconfig boots fine for me with the same kernel. I wonder if CONFIG_IRQSTACKS isn't enabled. I used arch/powerpc/configs/pseries_defconfig and, still, the system continues to fail while booting. In both this configuration file and the one I was using originally (which I'm attaching), CONFIG_IRQSTACKS is enabled. Just in case, LILO configuration entry for the kernel is: image = /boot/vmlinux-2.6.34-vjj ###Don't change this comment - YaST2 identifier: Original name: linux### label = 2.6.34-vjj append = " quiet sysrq=1 insmod=sym53c8xx insmod=ipr" initrd = /boot/initrd-2.6.34-vjj root = /dev/disk/by-id/scsi-35000c500077a329f-part3 The new output for the error is: === output begin === Checking ipr microcode levels Unable to handle kernel paging request for instruction fetch Faulting instruction address: 0x322d30312d31302c cpu 0x7: Vector: 400 (Instruction Access) at [c0000003d360f8f0] pc: 322d30312d31302c lr: 322d30312d31302d sp: c0000003d360fb70 msr: 8000000040009032 current = 0xc0000003d587d010 paca = 0xc00000000f622e00 pid = 5163, comm = iprupdate enter ? for help SP (43302d30302d3030) is in userspace 7:mon> Oops: Kernel access of bad area, sig: 11 [#1] SMP NR_CPUS=128 NUMA pSeries last sysfs file: /sys/devices/pci0000:00/0000:00:01.0/host0/target0:255:255/0:255:255:255/resource_path Modules linked in: NIP: 322d30312d31302c LR: 322d30312d31302d CTR: c000000000375bec REGS: c0000003d360f8f0 TRAP: 0400 Not tainted (2.6.34-vjj) MSR: 8000000040009432 <EE,ME,IR,DR> CR: 28002484 XER: 20000020 TASK = c0000003d587d010[5163] 'iprupdate' THREAD: c0000003d360c000 CPU: 7 GPR00: 322d30312d31302d c0000003d360fb70 c000000000ad07c0 00000000000185a0 GPR04: 0000000000000001 c0000003d360fb10 04000affffffffff c0000000006a6700 GPR08: c000000000823383 0000000000000000 0000000000000020 0000000000000000 GPR12: 000000000000f032 c00000000f622e00 00000000000000ed 0000000000000000 GPR16: 00000000100b8808 0000000010020000 0000000010020000 0000000010010000 GPR20: 0000000010010000 0000000000000001 0000000000001000 000000001045eef8 GPR24: c0000003d360fdf8 302d31342d31302d 43302d30302d3030 2d30332d44352d34 GPR28: 422d33382d30302d 30302d30302d3030 2d30302d30302d30 302d30302d30302d NIP [322d30312d31302c] 0x322d30312d31302c LR [322d30312d31302d] 0x322d30312d31302d Call Trace: [c0000003d360fb70] [322d30312d31302d] 0x322d30312d31302d (unreliable) Instruction dump: XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX ---[ end trace 443ae706377d94f7 ]--- /etc/init.d/iprupdate: line 28: 5163 Segmentation fault $IPRUPDATE --daemon Completed ipr microcode updates failed done done Starting nfsboot (sm-notify) Unable to handle kernel paging request for instruction fetch Faulting instruction address: 0x322d30312d31302c cpu 0x2: Vector: 400 (Instruction Access) at [c0000001e0a0b8f0] pc: 322d30312d31302c lr: 322d30312d31302d sp: c0000001e0a0bb70 msr: 8000000040009032 current = 0xc0000001e075b560 paca = 0xc00000000f622400 pid = 5169, comm = iprinit enter ? for help 2:mon> Oops: Kernel access of bad area, sig: 11 [#2] SMP NR_CPUS=128 NUMA pSeries last sysfs file: /sys/devices/pci0000:00/0000:00:01.0/host0/target0:255:255/0:255:255:255/resource_path Modules linked in: NIP: 322d30312d31302c LR: 322d30312d31302d CTR: c000000000375bec REGS: c0000001e0a0b8f0 TRAP: 0400 Tainted: G D (2.6.34-vjj) MSR: 8000000040009032 <EE,ME,IR,DR> CR: 28002484 XER: 20000020 TASK = c0000001e075b560[5169] 'iprinit' THREAD: c0000001e0a08000 CPU: 2 GPR00: 322d30312d31302d c0000001e0a0bb70 c000000000ad07c0 00000000000185a0 GPR04: 0000000000000001 c0000001e0a0bb10 04000affffffffff c0000000006a6700 GPR08: c000000000823383 0000000000000000 0000000000000020 0000000000000000 GPR12: 000000000000d032 c00000000f622400 0000000000000000 0000000010010000 GPR16: 0000000010020000 0000000010020000 0000000010020000 0000000010010000 GPR20: 0000000010010000 0000000000000001 0000000000001000 000000001024cef8 GPR24: c0000001e0a0bdf8 302d31342d31302d 43302d30302d3030 2d30332d44352d34 GPR28: 422d33382d30302d 30302d30302d3030 2d30302d30302d30 302d30302d30302d NIP [322d30312d31302c] 0x322d30312d31302c LR [322d30312d31302d] 0x322d30312d31302d Call Trace: [c0000001e0a0bb70] [322d30312d31302d] 0x322d30312d31302d (unreliable) Instruction dump: XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX XXXXXXXX BUG: Bad page map in process portmap pte:2d30312d45302c43 pmd:c0000003d3621000 addr:00000000ffd81000 vm_flags:00100173 anon_vma:c0000003d6fbb9e0 mapping:(null) index:ffffffdf Call Trace: [c0000001dfb83690] [c000000000013628] .show_stack+0xc0/0x200 (unreliable) [c0000001dfb83760] [c000000000014b50] .dump_stack+0x28/0x3c [c0000001dfb837e0] [c00000000014c3c8] .print_bad_pte+0x1c4/0x20c [c0000001dfb838a0] [c00000000014c4c8] .vm_normal_page+0xb8/0xe4 [c0000001dfb83940] [c000000000150cbc] .copy_page_range+0x500/0x7b4 [c0000001dfb83a90] [c00000000007cfc4] .dup_mm+0x378/0x5b4 [c0000001dfb83b70] [c00000000007db30] .copy_process+0x8a0/0x1020 [c0000001dfb83c70] [c00000000007e7a0] .do_fork+0x1a8/0x438 [c0000001dfb83d80] [c000000000013b5c] .sys_clone+0x90/0xc0 [c0000001dfb83e30] [c000000000008788] .ppc_clone+0x8/0xc BUG: Bad page map in process portmap pte:322d36312d34302d pmd:c0000003d3621000 addr:00000000ffd82000 vm_flags:00100173 anon_vma:c0000003d6fbb9e0 mapping:(null) index:ffffffe0 Call Trace: [c0000001dfb83690] [c000000000013628] .show_stack+0xc0/0x200 (unreliable) [c0000001dfb83760] [c000000000014b50] .dump_stack+0x28/0x3c [c0000001dfb837e0] [c00000000014c3c8] .print_bad_pte+0x1c4/0x20c [c0000001dfb838a0] [c00000000014c4c8] .vm_normal_page+0xb8/0xe4 [c0000001dfb83940] [c000000000150cbc] .copy_page_range+0x500/0x7b4 [c0000001dfb83a90] [c00000000007cfc4] .dup_mm+0x378/0x5b4 [c0000001dfb83b70] [c00000000007db30] .copy_process+0x8a0/0x1020 [c0000001dfb83c70] [c00000000007e7a0] .do_fork+0x1a8/0x438 [c0000001dfb83d80] [c000000000013b5c] .sys_clone+0x90/0xc0 [c0000001dfb83e30] [c000000000008788] .ppc_clone+0x8/0xc swap_dup: Bad swap file entry 5800002198169818 BUG: Bad page map in process portmap pte:2d30312d45302c43 pmd:c0000003d3621000 addr:00000000ffd84000 vm_flags:00100173 anon_vma:c0000003d6fbb9e0 mapping:(null) index:ffffffe2 Call Trace: [c0000001dfb83690] [c000000000013628] .show_stack+0xc0/0x200 (unreliable) [c0000001dfb83760] [c000000000014b50] .dump_stack+0x28/0x3c [c0000001dfb837e0] [c00000000014c3c8] .print_bad_pte+0x1c4/0x20c [c0000001dfb838a0] [c00000000014c4c8] .vm_normal_page+0xb8/0xe4 [c0000001dfb83940] [c000000000150cbc] .copy_page_range+0x500/0x7b4 [c0000001dfb83a90] [c00000000007cfc4] .dup_mm+0x378/0x5b4 [c0000001dfb83b70] [c00000000007db30] .copy_process+0x8a0/0x1020 [c0000001dfb83c70] [c00000000007e7a0] .do_fork+0x1a8/0x438 [c0000001dfb83d80] [c000000000013b5c] .sys_clone+0x90/0xc0 [c0000001dfb83e30] [c000000000008788] .ppc_clone+0x8/0xc BUG: Bad page map in process portmap pte:322d36302d32302d pmd:c0000003d3621000 addr:00000000ffd85000 vm_flags:00100173 anon_vma:c0000003d6fbb9e0 mapping:(null) index:ffffffe3 Call Trace: [c0000001dfb83690] [c000000000013628] .show_stack+0xc0/0x200 (unreliable) [c0000001dfb83760] [c000000000014b50] .dump_stack+0x28/0x3c [c0000001dfb837e0] [c00000000014c3c8] .print_bad_pte+0x1c4/0x20c [c0000001dfb838a0] [c00000000014c4c8] .vm_normal_page+0xb8/0xe4 [c0000001dfb83940] [c000000000150cbc] .copy_page_range+0x500/0x7b4 [c0000001dfb83a90] [c00000000007cfc4] .dup_mm+0x378/0x5b4 [c0000001dfb83b70] [c00000000007db30] .copy_process+0x8a0/0x1020 [c0000001dfb83c70] [c00000000007e7a0] .do_fork+0x1a8/0x438 [c0000001dfb83d80] [c000000000013b5c] .sys_clone+0x90/0xc0 [c0000001dfb83e30] [c000000000008788] .ppc_clone+0x8/0xc swap_dup: Bad swap file entry 5800001818169818 swap_dup: Bad swap file entry 6000001698181698 BUG: Bad page map in process portmap pte:302d30302d30302d pmd:c0000003d3621000 addr:00000000ffd88000 vm_flags:00100173 anon_vma:c0000003d6fbb9e0 mapping:(null) index:ffffffe6 Call Trace: [c0000001dfb83690] [c000000000013628] .show_stack+0xc0/0x200 (unreliable) [c0000001dfb83760] [c000000000014b50] .dump_stack+0x28/0x3c [c0000001dfb837e0] [c00000000014c3c8] .print_bad_pte+0x1c4/0x20c [c0000001dfb838a0] [c00000000014c4c8] .vm_normal_page+0xb8/0xe4 [c0000001dfb83940] [c000000000150cbc] .copy_page_range+0x500/0x7b4 [c0000001dfb83a90] [c00000000007cfc4] .dup_mm+0x378/0x5b4 [c0000001dfb83b70] [c00000000007db30] .copy_process+0x8a0/0x1020 [c0000001dfb83c70] [c00000000007e7a0] .do_fork+0x1a8/0x438 [c0000001dfb83d80] [c000000000013b5c] .sys_clone+0x90/0xc0 [c0000001dfb83e30] [c000000000008788] .ppc_clone+0x8/0xc swap_dup: Bad swap file entry 5800001818169818 swap_dup: Bad swap file entry 6000001698181698 BUG: Bad page map in process portmap pte:302d30302d30302d pmd:c0000003d3621000 addr:00000000ffd8b000 vm_flags:00100173 anon_vma:c0000003d6fbb9e0 mapping:(null) index:ffffffe9 Call Trace: [c0000001dfb83690] [c000000000013628] .show_stack+0xc0/0x200 (unreliable) [c0000001dfb83760] [c000000000014b50] .dump_stack+0x28/0x3c [c0000001dfb837e0] [c00000000014c3c8] .print_bad_pte+0x1c4/0x20c [c0000001dfb838a0] [c00000000014c4c8] .vm_normal_page+0xb8/0xe4 [c0000001dfb83940] [c000000000150cbc] .copy_page_range+0x500/0x7b4 [c0000001dfb83a90] [c00000000007cfc4] .dup_mm+0x378/0x5b4 [c0000001dfb83b70] [c00000000007db30] .copy_process+0x8a0/0x1020 [c0000001dfb83c70] [c00000000007e7a0] .do_fork+0x1a8/0x438 [c0000001dfb83d80] [c000000000013b5c] .sys_clone+0x90/0xc0 [c0000001dfb83e30] [c000000000008788] .ppc_clone+0x8/0xc === output end === The system continues entering the monitor, and when exiting generating new error output, but I just truncated at this point. If necessary, I can try to boot again and attach the complete output. Created attachment 26613 [details]
Original configuration file
Thanks for reporting this, it looks like a real issue. Could you give the patch here a test: http://patchwork.ozlabs.org/patch/54353/ Thanks for the patch. I could not try the system for a long time yet, but it booted correctly and everything seems to be working nicely. Thanks again! |