Bug 9241
Summary: | POSIX Access Control Lists cause bogus file system check errors | ||
---|---|---|---|
Product: | File System | Reporter: | Josh Rosen (bjrosen) |
Component: | ext3 | Assignee: | Andrew Morton (akpm) |
Status: | REJECTED UNREPRODUCIBLE | ||
Severity: | high | CC: | agruen, zhseal0 |
Priority: | P1 | ||
Hardware: | All | ||
OS: | Linux | ||
Kernel Version: | 2.6.23.1 | Subsystem: | |
Regression: | --- | Bisected commit-id: | |
Attachments: |
Snapshot of the error screen
Working .config file .config for unbootable kernel |
Description
Josh Rosen
2007-10-27 18:17:25 UTC
Created attachment 13292 [details]
Snapshot of the error screen
Created attachment 13293 [details]
Working .config file
A kernel built from this kernel boots.
Created attachment 13294 [details]
.config for unbootable kernel
A kernel built with this .config file won't boot.
You're saying that this problem does not occur in 2.6.23-6.fc8 and that it does occur in 2.6.23.1? That seems a bit odd. Maybe RH changed something, and changed their userspace to suit, but that doesn't sound likely. Anyway, hopefully Andreas will know what's going on here. It's possible that 2.6.23-6.fc8 was based on a release candidate and not the final 2.6.23. The problem occurs in 2.6.23 as well as 2.6.23.1, the 2.6.22.x kernels work fine so there must have been a change that was introduced in 2.6.23. The failure occurs while checking filesystems in the initrd, while the filesystem check in the running system shows no problems, right? Which messages does the failing e2fsck spit out? I assume that it's the same fsck version in both cases? Yes the failure occurs in the initrd, there are no failures when I run e2fsck. The only error messages that I have from the initrd are what you see on the snapshot that I attached. The fsck error message(s) must be on the part of the log that scrolled out. Can you attach a serial console to preserve that output, or add a sleep after the fsck so that you'll see those messages? Please explain the procedure for setting up a serial console, give me step by step instruction including the name of a terminal emulator (it's been years since I've used one). I assume I need a to connect a serial cable to another box, set up a terminal emulator of some sort, and give some switch to the boot loader. Is there an easier way? (over ethernet perhaps?). It's a reporting problem, the error message fails to give any information about which partition has the problem or what the problem is. E2fsck reported that all of the partitions were clean when I ran it without switches, however when I ran it with the -f switch it found problems with some of the attribute counts. After I fixed the problems I was able to boot. So the main problem isn't with 2.6.23, it found the file system errors, it's with 2.6.22.x and earlier which didn't find a problem. The problem with 2.6.23 is that it needs better error reporting. At the very least it should specify which partition has the file system problem, it would be better if it also specfied what the problems are. I failed to reproduce, sorry. I'm not running Fedora and I'm not using SELinux which exercises extended attributes quite a bit though, so it may just be that the error doesn't trigger here. (That's still quite unlikely since so far; there is only this one bug report about it.) Some testing advise: first, before running the suspect kernel version, make sure that the filesystem is consistent actually (e2fsck -f). Second, the initrd expectedly is doing nothing more than a filesystem check. If you don't know how to debug the problem directly in the initrd (serial console or simply by modifying the linuxrc script in the initrd, for example), you may as well boot into a rescue system and run e2fsck on the filesystem(s) in question from there. Can the bug still be reproduced with 2.6.23.x? An interesting data point in addition to the e2fsck error messages would be the inode size used (i.e., the output of ``/sbin/tune2fs -l /dev/sda4 | grep "Inode size"''). Finally, in case we won't manage to clearly track this issue down here, a more appropriate place to report the problem for getting more debugging help would be the Fedora community, who know about the specifics of the Fedora initrd, for example. |