Bug 5117
Summary: | Panic when accessing scsi-tapedrives with 4G-remap | ||
---|---|---|---|
Product: | IO/Storage | Reporter: | Johan van Baarlen (JF) |
Component: | SCSI | Assignee: | Mike Anderson (andmike) |
Status: | REJECTED UNREPRODUCIBLE | ||
Severity: | blocking | CC: | protasnb |
Priority: | P2 | ||
Hardware: | i386 | ||
OS: | Linux | ||
Kernel Version: | 2.6.12.5 | Subsystem: | |
Regression: | --- | Bisected commit-id: |
Description
Johan van Baarlen
2005-08-23 12:53:34 UTC
Begin forwarded message: Date: Tue, 23 Aug 2005 12:53:38 -0700 From: bugme-daemon@kernel-bugs.osdl.org To: bugme-new@lists.osdl.org Subject: [Bugme-new] [Bug 5117] New: Panic when accessing scsi-tapedrives with 4G-remap http://bugzilla.kernel.org/show_bug.cgi?id=5117 Summary: Panic when accessing scsi-tapedrives with 4G-remap Kernel Version: 2.6.12.5 Could this purely be a highmem problem? Is the zero-copy DMA feature of st.c known to work OK with x86 highmem? Reply-To: osst@riede.org On 08/25/2005 08:24:17 PM, Andrew Morton wrote: > > Subject: [Bugme-new] [Bug 5117] New: Panic when accessing scsi-tapedrives > with 4G-remap > > > http://bugzilla.kernel.org/show_bug.cgi?id=5117 > > Summary: Panic when accessing scsi-tapedrives with 4G-remap > Kernel Version: 2.6.12.5 > > > Could this purely be a highmem problem? Is the zero-copy DMA feature of > st.c known to work OK with x86 highmem? I don't know about Kai, but I don't have the hardware to test that... Sorry, Willem Riede. Reply-To: Kai.Makisara@kolumbus.fi On Thu, 25 Aug 2005, Andrew Morton wrote: > > > Begin forwarded message: > > Date: Tue, 23 Aug 2005 12:53:38 -0700 > From: bugme-daemon@kernel-bugs.osdl.org > To: bugme-new@lists.osdl.org > Subject: [Bugme-new] [Bug 5117] New: Panic when accessing scsi-tapedrives with 4G-remap > > > http://bugzilla.kernel.org/show_bug.cgi?id=5117 > > Summary: Panic when accessing scsi-tapedrives with 4G-remap > Kernel Version: 2.6.12.5 > > > Could this purely be a highmem problem? Is the zero-copy DMA feature of > st.c known to work OK with x86 highmem? It is _not_ known _not_ to work ;-) I.e., I have received neither any success nor any failure reports. I have not tested it because I don't have any machine with enough memory (and I have not hacked a kernel to use highmem with 512 MB of memory). I hope someone seeing this thread and using highmem with tape can comment on this subject. Reply-To: Kai.Makisara@kolumbus.fi On Fri, 26 Aug 2005, Kai Makisara wrote: > On Thu, 25 Aug 2005, Andrew Morton wrote: > > > > > > > Begin forwarded message: > > > > Date: Tue, 23 Aug 2005 12:53:38 -0700 > > From: bugme-daemon@kernel-bugs.osdl.org > > To: bugme-new@lists.osdl.org > > Subject: [Bugme-new] [Bug 5117] New: Panic when accessing scsi-tapedrives with 4G-remap > > > > > > http://bugzilla.kernel.org/show_bug.cgi?id=5117 > > > > Summary: Panic when accessing scsi-tapedrives with 4G-remap > > Kernel Version: 2.6.12.5 > > > > > > Could this purely be a highmem problem? Is the zero-copy DMA feature of > > st.c known to work OK with x86 highmem? > > It is _not_ known _not_ to work ;-) I.e., I have received neither any > success nor any failure reports. I have not tested it because I don't have > any machine with enough memory (and I have not hacked a kernel to use > highmem with 512 MB of memory). I hope someone seeing this thread and > using highmem with tape can comment on this subject. > OK. I booted my test i386 machine with highmem=384m and did some tests. I also added a counter to st.c to count the highmem pages used for zero-copy DMA. I could not get dd to use highmem but with tar that succeeded. No extra messages were found in syslog during these tests. BUT, at the same time I remembered that the system in the Bugzilla report was Athlon64 running FC3 x86_64. The x86_64 kernel does not have highmem. Reply-To: osst@riede.org On 08/28/2005 06:40:04 AM, Kai Makisara wrote: > On Fri, 26 Aug 2005, Kai Makisara wrote: > > > On Thu, 25 Aug 2005, Andrew Morton wrote: > > > > > Could this purely be a highmem problem? Is the zero-copy DMA feature of > > > st.c known to work OK with x86 highmem? > > > > It is _not_ known _not_ to work ;-) I.e., I have received neither any > > success nor any failure reports. I have not tested it because I don't have > > > any machine with enough memory (and I have not hacked a kernel to use > > highmem with 512 MB of memory). I hope someone seeing this thread and > > using highmem with tape can comment on this subject. > > > OK. I booted my test i386 machine with highmem=384m and did some tests. I > also added a counter to st.c to count the highmem pages used for zero-copy > DMA. I could not get dd to use highmem but with tar that succeeded. No > extra messages were found in syslog during these tests. > > BUT, at the same time I remembered that the system in the Bugzilla report > was Athlon64 running FC3 x86_64. The x86_64 kernel does not have highmem. True. I do have a machine running FC4 x86_64, but it doesn't have 4G of memory, so it doesn't do the 4G-remap that the bug report refers to. Don't know if there is any way to make it do so (other than buy more memory). Regards, Willem Riede. Reply-To: arjan@infradead.org > > OK. I booted my test i386 machine with highmem=384m and did some tests. I > > also added a counter to st.c to count the highmem pages used for zero-copy > > DMA. I could not get dd to use highmem but with tar that succeeded. No > > extra messages were found in syslog during these tests. > > > > BUT, at the same time I remembered that the system in the Bugzilla report > > was Athlon64 running FC3 x86_64. The x86_64 kernel does not have highmem. > > True. I do have a machine running FC4 x86_64, but it doesn't have 4G of > memory, so it doesn't do the 4G-remap that the bug report refers to. > > Don't know if there is any way to make it do so (other than buy more memory). note that some (tyan) amd64 motherboards have a really broken bios wrt remap, and the only way to get those systems stable is to disable the memory remap in that bios. If you don't all kinds of funky stuff can and does happen. Please don't get me started on broken bioses... I had a lot of fun with 4G on a Asus A8V (Via K8T800), without remap, 3.5G are available and everything works - with remap, all PCI-cards and onboard devices stop functioning. Nice. The A8N is much better, at least all onboard devices work without a hitch :-) But back to the topic at hand: I've just flashed it to 1008-bios, and you can probably guess it did make a difference - now I can't access anything on the second scsi-controller anymore. cat /proc/scsi/scsi shows all devices, but as soon as I try to do mt -f /dev/st1 status (or /dev/st2), the driver seems to lock up and only a full reboot gets it going again. Without remap, everything still works fine. Later today I'm gonna try some other controllers and initially only test one at a time, to see if I can pinpoint the troublemaker a bit more precise. Been a while, but we're up to 2.6.13.2 by now... Didn't see a panic on this kernel, but something is definitely not right. The driver locks up completely way too often - doing a simple 'mt -f /dev/st0 status' right after a warm or cold reboot is enough to send it into non-responsive hell, no way to break, no errormessages, no nothing. Rest of the system is running, I can still logon, access network, disks, etcetera, just no responses on any of the controllers. Happens without the 3G-remap too, so probably a slightly different issue, but still a very bad one. Does anybody have a clue how to start debugging this? This renders the system unusable, which is a very bad thing. If more info required, PLEASE tell me how to produce it so there is at least a glimpse of resolving this matter. 2.6.13.3, still happening... This is getting critical, I need that machine stable! Anybody got a clue? As far as I am aware this really isn't a sym2 problem. Reassigning to tape What is the current state of this problem, any update please. Thanks. Closing the bug, no recent activity. Johan if you still have the problem with recent kernels please reopen. |