Bug 13998
Summary: | fscache+nfs BUG: unable to handle kernel NULL pointer dereference at 0000000000000040 | ||
---|---|---|---|
Product: | File System | Reporter: | joshua (jlevine) |
Component: | Other | Assignee: | David Howells (dhowells) |
Status: | RESOLVED OBSOLETE | ||
Severity: | normal | CC: | akpm, alan, dhowells, esmil, jlevine, nyerup |
Priority: | P1 | ||
Hardware: | All | ||
OS: | Linux | ||
Kernel Version: | 2.6.38.8 | Subsystem: | |
Regression: | No | Bisected commit-id: | |
Attachments: | 2.6.38.8 kernel log with null pointer dereference |
Description
joshua
2009-08-16 17:30:32 UTC
Aug 16 10:14:13 dload04 kernel: [38984.345500] BUG: soft lockup - CPU#3 stuck for 61s! [cachefilesd:6652] Aug 16 10:14:13 dload04 kernel: [38984.345506] Modules linked in: cachefiles autofs4 nfs lockd fscache nfs_acl auth_rpcgss sunrpc iptable_filter ip_tables x_tables usbhid psmouse serio_raw snd_pcm snd_timer snd soundcore snd_page_alloc amd64_edac_mod parport_pc parport i2c_amd8111 shpchp edac_core pcspkr i2c_amd756 amd_rng k8temp 3w_xxxx sata_mv tg3 floppy Aug 16 10:14:13 dload04 kernel: [38984.345506] CPU 3: Aug 16 10:14:13 dload04 kernel: [38984.345506] Modules linked in: cachefiles autofs4 nfs lockd fscache nfs_acl auth_rpcgss sunrpc iptable_filter ip_tables x_tables usbhid psmouse serio_raw snd_pcm snd_timer snd soundcore snd_page_alloc amd64_edac_mod parport_pc parport i2c_amd8111 shpchp edac_core pcspkr i2c_amd756 amd_rng k8temp 3w_xxxx sata_mv tg3 floppy Aug 16 10:14:13 dload04 kernel: [38984.349016] Pid: 6652, comm: cachefilesd Tainted: G D 2.6.31-rc6-iwin-linus-cachefs #1 H8DAR-T Aug 16 10:14:13 dload04 kernel: [38984.349016] RIP: 0010:[<ffffffff8102a156>] [<ffffffff8102a156>] __ticket_spin_lock+0x16/0x20 Aug 16 10:14:13 dload04 kernel: [38984.349016] RSP: 0018:ffff8800785ddcd8 EFLAGS: 00000297 Aug 16 10:14:13 dload04 kernel: [38984.349016] RAX: 0000000000000605 RBX: ffff8800785ddcd8 RCX: 00000000c0000100 Aug 16 10:14:13 dload04 kernel: [38984.349016] RDX: ffff8800785ddd48 RSI: 0000000000000003 RDI: ffff880068324ddc Aug 16 10:14:13 dload04 kernel: [38984.349016] RBP: ffffffff8100ca2e R08: ffff8800785dc000 R09: 00000000ffffffff Aug 16 10:14:13 dload04 kernel: [38984.349016] R10: ffff8801385c4448 R11: 0000000000000000 R12: 0000000000013e40 Aug 16 10:14:13 dload04 kernel: [38984.349016] R13: 0000000000013e40 R14: 0000000000013e40 R15: 0000000000013e40 Aug 16 10:14:13 dload04 kernel: [38984.349016] FS: 00007f6a7da576e0(0000) GS:ffffc90000600000(0000) knlGS:0000000000000000 Aug 16 10:14:13 dload04 kernel: [38984.349016] CS: 0010 DS: 0000 ES: 0000 CR0: 000000008005003b Aug 16 10:14:13 dload04 kernel: [38984.349016] CR2: 00007f20e8604cf0 CR3: 00000000bde80000 CR4: 00000000000006a0 Aug 16 10:14:13 dload04 kernel: [38984.349016] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000 Aug 16 10:14:13 dload04 kernel: [38984.349016] DR3: 0000000000000000 DR6: 00000000ffff0ff0 DR7: 0000000000000400 Aug 16 10:14:13 dload04 kernel: [38984.349016] Call Trace: Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff8104a52c>] ? __cond_resched+0x1c/0x50 Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff814da3c9>] ? _spin_lock+0x9/0x10 Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffffa01ae56c>] ? fscache_withdraw_cache+0x10c/0x3d0 [fscache] Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff810cff72>] ? release_pages+0x1d2/0x210 Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff814da3c9>] ? _spin_lock+0x9/0x10 Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffffa023b950>] ? cachefiles_daemon_unbind+0xc0/0xf0 [cachefiles] Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffffa023ba27>] ? cachefiles_daemon_release+0x37/0x110 [cachefiles] Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff811071de>] ? __fput+0xee/0x210 Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff81107587>] ? fput+0x17/0x20 Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff81103f28>] ? filp_close+0x58/0x90 Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff81051dac>] ? put_files_struct+0xcc/0xe0 Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff81051e10>] ? exit_files+0x50/0x60 Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff810529df>] ? do_exit+0x17f/0x800 Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff814dcd1e>] ? do_page_fault+0x1ae/0x350 Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff8105309e>] ? do_group_exit+0x3e/0xb0 Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff81053122>] ? sys_exit_group+0x12/0x20 Aug 16 10:14:13 dload04 kernel: [38984.349016] [<ffffffff8100be82>] ? system_call_fastpath+0x16/0x1b Aug 16 10:14:16 dload04 varnishd[5325]: Child (5326) not responding to ping, killing it. The patch referenced here: http://bugzilla.kernel.org/show_bug.cgi?id=13469 Was applied prior to the build. Thank you, Joshua Assigned to David. This appears to still be an issue in kernel 2.6.38.8. I can reproduce this with an NFS backend, using cachefilesd to register a cache on both XFS and EXT3 file systems. See attached kernel log for details, and feel free to request additional details. Created attachment 63212 [details]
2.6.38.8 kernel log with null pointer dereference
If this is still seen with modern kernels please re-open |