Distribution: Mandrake 10.1 Official Hardware Environment: Laptop P4@2.533Ghz 512MB Software Environment: Ingo Molnar's realtime-preempt patch Problem Description: Frequent kernel crash BUG at boot/init time. Running 2.6.11-rc4-RT-V0.7.39-02 (PREEMPT_RT=y). This BUG is happening in some probabilistic fashion, like 1 on each 3 boots, rendering the whole USB subsystem completely unusable as the most notable consequence. Taken from dmesg: BUG: Unable to handle kernel paging request at virtual address 0811eb68 printing eip: c0127927 *pde = 1e41d067 *pte = 00000000 Oops: 0000 [#1] PREEMPT Modules linked in: natsemi crc32 ohci1394 ieee1394 loop subfs evdev ohci_hcd usbcore video thermal processor fan button battery ac CPU: 0 EIP: 0060:[<c0127927>] Not tainted VLI EFLAGS: 00010082 (2.6.11-rc4-RT-V0.7.39-02.0) EIP is at change_owner+0x1a/0x5a eax: de65c550 ebx: de65c550 ecx: 0811eb68 edx: df180550 esi: df0c3cc8 edi: df180a48 ebp: df0c3cc8 esp: de59ded0 ds: 007b es: 007b ss: 0068 preempt: 00000004 Process IRQ 10 (pid: 1439, threadinfo=de59c000 task=df406000) Stack: de65c550 df0c3cc8 df03ff70 df180550 c0127b5c de65c550 df0c3cc8 c0127d31 00000000 df03ff70 00000286 00000000 de59c000 de558000 00010000 00000001 c01283f5 e00d4400 e00c2cb0 00000001 de55a4d4 e00b0bf6 00000001 00000017 Call Trace: [<c0127b5c>] set_new_owner+0x17/0x2b (20) [<c0127d31>] __up_mutex+0xa4/0x193 (12) [<c01283f5>] up+0x35/0x3d (36) [<e00c2cb0>] highlevel_host_reset+0x3b/0x49 [ieee1394] (8) [<e00b0bf6>] ohci_irq_handler+0x576/0x713 [ohci1394] (12) [<c029ec06>] __sched_text_start+0x5a/0x5d7 (28) [<c012d77a>] __do_IRQ+0xca/0x180 (4) [<c012d634>] handle_IRQ_event+0x5c/0xc8 (36) [<c012dd86>] do_hardirq+0x61/0x112 (48) [<c0110dbd>] do_sched_setscheduler+0x73/0xa0 (4) [<c012de37>] do_irqd+0x0/0x96 (20) [<c012de9d>] do_irqd+0x66/0x96 (4) [<c0127018>] kthread+0x94/0xc8 (28) [<c0126f84>] kthread+0x0/0xc8 (16) [<c0100791>] kernel_thread_helper+0x5/0xb (16) Code: 8b 6c 24 18 83 c4 1c c3 b8 da ff ff ff c3 90 90 c3 55 39 ca 89 c5 57 89 c8 56 53 74 49 8b 8a f8 04 00 00 8d ba f8 04 00 00 39 cf <8b> 19 74 37 8d b0 f8 04 00 00 eb 08 89 d9 8b 1b 39 cf 74 27 39 <6>note: IRQ 10[1439] exited with preempt_count 3 BUG: scheduling while atomic: IRQ 10/0x00000003/1439 caller is do_exit+0x1da/0x34d [<c029f03a>] __sched_text_start+0x48e/0x5d7 (8) [<c01163bb>] exit_notify+0x60b/0x8f4 (24) [<c01148fc>] vprintk+0x101/0x142 (24) [<c011687e>] do_exit+0x1da/0x34d (32) [<c010349f>] do_trap+0x0/0xfe (40) [<c010e8fc>] do_page_fault+0x0/0x524 (48) [<c010e8fc>] do_page_fault+0x0/0x524 (12) [<c010ec42>] do_page_fault+0x346/0x524 (4) [<c029f2ea>] preempt_schedule+0x50/0x6b (80) [<c010fcdb>] try_to_wake_up+0x104/0x106 (20) [<e00b0516>] dma_trm_reset+0x36/0x11e [ohci1394] (24) [<c01106ab>] __wake_up_common+0x35/0x55 (16) [<c010e8fc>] do_page_fault+0x0/0x524 (60) [<c0102d57>] error_code+0x2b/0x30 (8) [<c0127927>] change_owner+0x1a/0x5a (44) [<c0127b5c>] set_new_owner+0x17/0x2b (28) [<c0127d31>] __up_mutex+0xa4/0x193 (12) [<c01283f5>] up+0x35/0x3d (36) [<e00c2cb0>] highlevel_host_reset+0x3b/0x49 [ieee1394] (8) [<e00b0bf6>] ohci_irq_handler+0x576/0x713 [ohci1394] (12) [<c029ec06>] __sched_text_start+0x5a/0x5d7 (28) [<c012d77a>] __do_IRQ+0xca/0x180 (4) [<c012d634>] handle_IRQ_event+0x5c/0xc8 (36) [<c012dd86>] do_hardirq+0x61/0x112 (48) [<c0110dbd>] do_sched_setscheduler+0x73/0xa0 (4) [<c012de37>] do_irqd+0x0/0x96 (20) [<c012de9d>] do_irqd+0x66/0x96 (4) [<c0127018>] kthread+0x94/0xc8 (28) [<c0126f84>] kthread+0x0/0xc8 (16) [<c0100791>] kernel_thread_helper+0x5/0xb (16) prev->state: 2 != TASK_RUNNING?? IRQ 10/1439: BUG in __schedule at kernel/sched.c:3028 [<c029efa9>] __sched_text_start+0x3fd/0x5d7 (8) [<c011687e>] do_exit+0x1da/0x34d (80) [<c010349f>] do_trap+0x0/0xfe (40) [<c010e8fc>] do_page_fault+0x0/0x524 (48) [<c010e8fc>] do_page_fault+0x0/0x524 (12) [<c010ec42>] do_page_fault+0x346/0x524 (4) [<c029f2ea>] preempt_schedule+0x50/0x6b (80) [<c010fcdb>] try_to_wake_up+0x104/0x106 (20) [<e00b0516>] dma_trm_reset+0x36/0x11e [ohci1394] (24) [<c01106ab>] __wake_up_common+0x35/0x55 (16) [<c010e8fc>] do_page_fault+0x0/0x524 (60) [<c0102d57>] error_code+0x2b/0x30 (8) [<c0127927>] change_owner+0x1a/0x5a (44) [<c0127b5c>] set_new_owner+0x17/0x2b (28) [<c0127d31>] __up_mutex+0xa4/0x193 (12) [<c01283f5>] up+0x35/0x3d (36) [<e00c2cb0>] highlevel_host_reset+0x3b/0x49 [ieee1394] (8) [<e00b0bf6>] ohci_irq_handler+0x576/0x713 [ohci1394] (12) [<c029ec06>] __sched_text_start+0x5a/0x5d7 (28) [<c012d77a>] __do_IRQ+0xca/0x180 (4) [<c012d634>] handle_IRQ_event+0x5c/0xc8 (36) [<c012dd86>] do_hardirq+0x61/0x112 (48) [<c0110dbd>] do_sched_setscheduler+0x73/0xa0 (4) [<c012de37>] do_irqd+0x0/0x96 (20) [<c012de9d>] do_irqd+0x66/0x96 (4) [<c0127018>] kthread+0x94/0xc8 (28) [<c0126f84>] kthread+0x0/0xc8 (16) [<c0100791>] kernel_thread_helper+0x5/0xb (16) Steps to reproduce: As said, this occurs on almost every other boot/init time.
Created attachment 4599 [details] Complete dmesg output
Created attachment 4600 [details] Kernel config.gz
As you are using Ingo's patches, please post this to him, in an email, not on here.
I think I encountered the same bug using the 2.6.11 DeMuDi kernel and therefore subscribed to this one in case it is reopened.