Bug 8978
Summary: | Strange locking problems when hw RAID in Degraded status | ||
---|---|---|---|
Product: | File System | Reporter: | Peter (tuharsky) |
Component: | ext3 | Assignee: | Andrew Morton (akpm) |
Status: | REJECTED INSUFFICIENT_DATA | ||
Severity: | normal | CC: | alan, htejun, neilb, protasnb |
Priority: | P1 | ||
Hardware: | All | ||
OS: | Linux | ||
Kernel Version: | 2.6.21.7 | Subsystem: | |
Regression: | --- | Bisected commit-id: | |
Attachments: | syslog |
Description
Peter
2007-09-03 22:53:42 UTC
Is there nothing of interest in the logs? Well, RAID problems are back, so debugging possible. I'll try latest kernel. The buggy one is 2.6.21.7 Kernel dosen't tell anything to kern.log when the controller starts beeping. The startup (dmesg) is here -see attachment 12690 [details]
Created attachment 12691 [details]
syslog
Well, these samba oplock breaks are suspicious.
However, I cannot guarantee, that exactly these oplock errors are the merit. I looked at old logs and some oplock problems were there before, however they seemed a bit different. That was Debian Sarge with older samba release, so the error codes and syntax could have changed.. I don't understand what I'm seeing in your logs. How come there's a pile of ata errors coming out when you say the problem is with the aacraid controller? These are subject of separate bug 8979, that is resolved aj a problem of old smartd version. The kernel dosen't show up anything interesting when RAID enters "Degraded" state. Peter, any updates? Have you tried other kernel levels, newer ones or falling back to the one that used to work for you? I won't be surprised if the controller itself was going bad. Well, until the bug 9017 persists, it's quite impossible to debug this problem, because the symptoms are pretty same (file locking problems). After the bug 9017 resolved, I could try removing a harddrive from raid and see what will happen with recent kernel, but it dosen't make any sense any sooner. |