Bug 7798
Summary: | [sparc64] nfsd randomly crashes | ||
---|---|---|---|
Product: | File System | Reporter: | JKB (mt1) |
Component: | NFS | Assignee: | Trond Myklebust (trondmy) |
Status: | REJECTED DUPLICATE | ||
Severity: | high | ||
Priority: | P2 | ||
Hardware: | i386 | ||
OS: | Linux | ||
Kernel Version: | 2.6.19.1 | Subsystem: | |
Regression: | --- | Bisected commit-id: |
Most recent kernel where this bug did *NOT* occur: I don't know. 2.6.15.1 works fine for me but I don't have test servers enough to test ;-) Distribution: Debian/Etch Hardware Environment: UltraSPARC 60 SMP (two UltraSPARC-II/450 MHz, 1GB) Software Environment: 2.6.19.1 Linux kernel, nfs-kernel-daemon Problem Description: with high nfs load, kernel does oops like this : Dec 20 00:06:12 rayleigh kernel: Unable to handle kernel NULL pointer dereferenc e Dec 20 00:06:12 rayleigh kernel: tsk->{mm,active_mm}->context = 000000000000133b Dec 20 00:06:12 rayleigh kernel: tsk->{mm,active_mm}->pgd = fffff8008790c000 Dec 20 00:06:12 rayleigh kernel: \|/ ____ \|/ Dec 20 00:06:12 rayleigh kernel: "@'/ .. \`@" Dec 20 00:06:12 rayleigh kernel: /_| \__/ |_\ Dec 20 00:06:12 rayleigh kernel: \__U_/ Dec 20 00:06:12 rayleigh kernel: nfsd(3561): Oops [#4] Dec 20 00:06:12 rayleigh kernel: TSTATE: 0000004480009602 TPC: 000000000048c504 TNPC: 000000000048c508 Y: 00000000 Not tainted Using defaults from ksymoops -t elf32-sparc -a sparc Dec 20 00:06:12 rayleigh kernel: g0: ffffffff80000000 g1: 0000000000000006 g2: 0 000000000004000 g3: 000000000000f000 Dec 20 00:06:12 rayleigh kernel: g4: fffff800807175a0 g5: fffff8007fc1dbc0 g6: f ffff800a25bc000 g7: 0000000000000006 Dec 20 00:06:12 rayleigh kernel: o0: 0000000000000000 o1: fffff800800dc0c8 o2: 0 000000000000003 o3: 0000000000000010 Dec 20 00:06:12 rayleigh kernel: o4: 000000000070f3d0 o5: fffff800800dc0e8 sp: f ffff800a25bee51 ret_pc: 000000000048c544 Dec 20 00:06:12 rayleigh kernel: l0: fffff800800dc0c0 l1: 000000000070f280 l2: 0 00000000070f3c0 l3: 00000000101e7678 Dec 20 00:06:12 rayleigh kernel: l4: 00000000007c7800 l5: 0000000000000000 l6: f ffff80087cc6000 l7: 0000000000000150 Dec 20 00:06:12 rayleigh kernel: i0: 0000000000000000 i1: fffff800800e2300 i2: 0 000000000000003 i3: 0000000000000010 Dec 20 00:06:12 rayleigh kernel: i4: fffff800a25bf8a0 i5: fffff800a2561a98 i6: f ffff800a25bef11 i7: 00000000101d2c2c >>PC; 0048c504 <put_page+4/100> <===== >>o4; 0070f3d0 <contig_page_data+150/9c0> >>ret_pc; 0048c544 <put_page+44/100> >>l1; 0070f280 <contig_page_data+0/9c0> >>l2; 0070f3c0 <contig_page_data+140/9c0> >>l3; 101e7678 <__crc_tcf_register_action+d04ea/377729> >>l4; 007c7800 <__log_buf+7e90/8000> >>i7; 101d2c2c <__crc_tcf_register_action+bba9e/377729> Dec 20 00:06:12 rayleigh kernel: Caller[00000000101d2c2c]: nfsd_read_actor+0x74/ 0x140 [nfsd] Dec 20 00:06:12 rayleigh kernel: Caller[0000000000484700]: do_generic_mapping_re ad+0x108/0x4c0 Dec 20 00:06:12 rayleigh kernel: Caller[0000000000484afc]: generic_file_sendfile +0x44/0x60 Dec 20 00:06:12 rayleigh kernel: Caller[00000000101d08bc]: nfsd_vfs_read+0x3a4/0 x3e0 [nfsd] Dec 20 00:06:12 rayleigh kernel: Caller[00000000101d0e40]: nfsd_read+0xa8/0xc0 [ nfsd] Dec 20 00:06:12 rayleigh kernel: Caller[00000000101d8fa0]: nfsd3_proc_read+0xa8/ 0x140 [nfsd] Dec 20 00:06:12 rayleigh kernel: Caller[00000000101cc2c8]: nfsd_dispatch+0x90/0x 220 [nfsd] Dec 20 00:06:12 rayleigh kernel: Caller[000000001018a56c]: svc_process+0x454/0x7 e0 [sunrpc] Dec 20 00:06:12 rayleigh kernel: Caller[00000000101ccabc]: nfsd+0x184/0x300 [nfs d] Dec 20 00:06:12 rayleigh kernel: Caller[0000000000417b50]: kernel_thread+0x38/0x 60 Dec 20 00:06:12 rayleigh kernel: Caller[0000000010189a20]: __svc_create_thread+0 x148/0x220 [sunrpc] Dec 20 00:06:12 rayleigh kernel: Instruction DUMP: 01000000 01000000 9de3bf40 <c45e0000> 03000010 a0100018 92062008 84088001 0ac08037 Trace; 101d2c2c <__crc_tcf_register_action+bba9e/377729> Trace; 00484700 <do_generic_mapping_read+100/4c0> Trace; 00484afc <generic_file_sendfile+3c/60> Trace; 101d08bc <__crc_tcf_register_action+b972e/377729> Trace; 101d0e40 <__crc_tcf_register_action+b9cb2/377729> Trace; 101d8fa0 <__crc_tcf_register_action+c1e12/377729> Trace; 101cc2c8 <__crc_tcf_register_action+b513a/377729> Trace; 1018a56c <__crc_tcf_register_action+733de/377729> Trace; 101ccabc <__crc_tcf_register_action+b592e/377729> Trace; 00417b50 <kernel_thread+30/60> Trace; 10189a20 <__crc_tcf_register_action+72892/377729> Code; 0048c4f8 <lru_add_drain_all+18/20> 00000000 <_PC>: Code; 0048c4f8 <lru_add_drain_all+18/20> 0: 01 00 00 00 nop Code; 0048c4fc <lru_add_drain_all+1c/20> 4: 01 00 00 00 nop Code; 0048c500 <put_page+0/100> 8: 9d e3 bf 40 save %sp, -192, %sp Code; 0048c504 <put_page+4/100> c: c4 5e 00 00 inconnu Code; 0048c508 <put_page+8/100> 10: 03 00 00 10 sethi %hi(0x4000), %g1 Code; 0048c50c <put_page+c/100> 14: a0 10 00 18 mov %i0, %l0 Code; 0048c510 <put_page+10/100> 18: 92 06 20 08 add %i0, 8, %o1 Code; 0048c514 <put_page+14/100> 1c: 84 08 80 01 and %g2, %g1, %g2 Code; 0048c518 <put_page+18/100> 20: 0a c0 80 37 inconnu This kernel was built with gcc-4.1 (from debian). But I have tried to build the same kernel with gcc-3.4 and I obtain a similar oops (not exactly the same). I have replaced nfs-kernel-daemon by nfs-user-deamon and I can use this server. Regards, JKB