Bug 14681

Summary: Unable to handle paging request at 0000341b.
Product: File System Reporter: Niels Thykier (niels)
Component: XFSAssignee: XFS Guru (xfs-masters)
Status: CLOSED UNREPRODUCIBLE    
Severity: normal CC: alan, hch, jrnieder
Priority: P1    
Hardware: All   
OS: Linux   
Kernel Version: 2.6.31-1-686 Subsystem:
Regression: No Bisected commit-id:
Attachments: dmesg from 2.6.31-1-686
Other info about the kernel.
dmesg from 2.6.30-2-686

Description Niels Thykier 2009-11-24 06:41:51 UTC
Created attachment 23915 [details]
dmesg from 2.6.31-1-686

Hi

So far I have only experienced it when I build eclipse in a chroot'ed system. When I just leave the computer to do its "normal" job (e.g. hosting git a couple of git repositories) it does not happen.

~Niels
Comment 1 Niels Thykier 2009-11-24 06:46:50 UTC
Created attachment 23916 [details]
Other info about the kernel.

Hi

I have attached some other information that my distribution also collects on a bug report in case it is important. This bug (as I forgot to mention in the original message) is a forward of the Debian bug 554272 [1].

~Niels

[1] http://bugs.debian.org/cgi-bin/bugreport.cgi?bug=554272
Comment 2 Niels Thykier 2009-11-24 06:52:08 UTC
Created attachment 23917 [details]
dmesg from 2.6.30-2-686

I have attached the log from 2.6.30-2-686 where I originally experienced a problem (The log refers to it as a "dereference of NULL pointer" though), in case it helps you locate the issue.

The log here contains two separate "dereferences" from two separate build attempts.

~Niels
Comment 3 Christoph Hellwig 2010-02-09 22:24:50 UTC
Is this in any way reproducible?  I've never seen a NULL pointer dereference in generic_file_aio_read (which actually is common code used by most filesystems before)
Comment 4 Niels Thykier 2010-02-14 21:57:50 UTC
Hi

Personally I do find it strange that my other machine do not show any signs of this issue. My affected machine did however reliably (often, not always) reproduce it (or the NULL pointer issue with the previous kernel). Both my machines uses xfs, though my unaffected machine also have an lvm/lvm2 layer beneath the xfs.

Since I did not use my affected machine for anything, I decided to take it offline last week. Though I do not mind booting it up and see if I can still reproduce the issue, if you want that - also let me know if there is anything besides the dmesg log you want for debugging.

~Niels
Comment 5 Niels Thykier 2011-08-28 11:34:11 UTC
Hi,

I have decommission the affected machine and replaced it with a far better machine for my purpose.  My new machine has not suffered this issue or anything like it.
  I still have the old machine lying around if you need extra information; but I am willing to write this off as a "possible hardware defect" and "call it a day".

~Niels
Comment 6 Jonathan Nieder 2012-06-15 14:49:38 UTC
Christoph et al: any idea idea how to track down whether this is a
hardware problem or a symptom of some particular filesystem corruption?

Niels: am I correct in guessing you've already tried memtest86+ and
badblocks?
Comment 7 Niels Thykier 2012-06-15 15:19:13 UTC
Hi,

I cannot remember what I tested (or didn't test) about 3 years ago, so it is entirely possible I forgot to check one of those.
  I cannot reproduce it on any system I am currently using.  So to be honest, I am okay with this being closed this as "unreproducible" ... also partly because I am too lazy to reinstall an OS on it. :)

~Niels