Bug 5159
Summary: | BUG: soft lockup detected on CPU#0! | ||
---|---|---|---|
Product: | IO/Storage | Reporter: | David Gardner (dgardnr) |
Component: | MD | Assignee: | io_md |
Status: | CLOSED CODE_FIX | ||
Severity: | normal | CC: | alan, astralbat, axboe, bzolnier, sglass |
Priority: | P2 | ||
Hardware: | i386 | ||
OS: | Linux | ||
Kernel Version: | 2.6.13-rc6-mm1 | Subsystem: | |
Regression: | --- | Bisected commit-id: | |
Attachments: |
dmesg output
/var/log/messages |
Description
David Gardner
2005-08-30 12:46:29 UTC
Created attachment 5820 [details]
dmesg output
Created attachment 5821 [details]
/var/log/messages
This problem is still occurring on 2.6.14-rc2-mm1. Methinks this is an IDE crash. PIO transfers are slow, does it happen with NMI watchdog disabled? Some more analysis: - DMA timed out on hdc and transfer mode was switched to PIO and it (falsely?) triggered NMI watchdog - hda and hdc seem to be identical (but hda has clipped capacity to 40GB) so this is quite interesting and may help in debugging the problem [ I'll audit serverworks.c driver later ] Is the DMA timeout bug reproducible? Is it always hdc? No longer reproducible. I've got this bug just now. Here's my output from dmesg: BUG: soft lockup detected on CPU#0! Pid: 27122, comm: ocssd.bin EIP: 0061:[<c0153738>] CPU: 0 EIP is at follow_page+0x128/0x1c0 EFLAGS: 00000246 Not tainted (2.6.16-xen #1) EAX: 00000000 EBX: 004110a0 ECX: 55555555 EDX: e0c61784 ESI: 00000000 EDI: 0000000e EBP: b4d3a000 DS: 007b ES: 007b CR0: 8005003b CR2: b5c400f8 CR3: 04f03000 CR4: 00000640 [<c0153900>] get_user_pages+0x130/0x340 [<c019aeef>] elf_core_dump+0x99f/0xb29 [<c0176ee0>] do_coredump+0x250/0x2d4 [<c012bdf1>] __dequeue_signal+0x71/0xa0 [<c012bea9>] dequeue_signal+0x89/0x110 [<c012e053>] get_signal_to_deliver+0x313/0x3a0 [<c0104dc1>] do_signal+0x71/0x170 [<c012cc74>] kill_proc_info+0x54/0x80 [<c012e16d>] sigprocmask+0x5d/0x120 [<c012e2ba>] sys_rt_sigprocmask+0x8a/0x140 [<c0104ef8>] do_notify_resume+0x38/0x3c [<c0105031>] work_notifysig+0x13/0x1a I've been running this machine without problems for about three weeks. The kernel was compiled manually as part of a Xen install (this happened on a DomU host) While I agree there are similarities, this is really a totally different bug. I recommend you open a new bug report with this informatio so that it gets assigned properly. closing based on comment #7 |