Bug 7507
Summary: | pata_it821x: system freezes | ||
---|---|---|---|
Product: | IO/Storage | Reporter: | Martin von Gagern (Martin.vGagern) |
Component: | Serial ATA | Assignee: | Alan (alan) |
Status: | REJECTED UNREPRODUCIBLE | ||
Severity: | normal | CC: | alan, bunk, htejun, kernel, protasnb, vonami |
Priority: | P2 | ||
Hardware: | i386 | ||
OS: | Linux | ||
Kernel Version: | 2.6.20-rc5 | Subsystem: | |
Regression: | --- | Bisected commit-id: | |
Attachments: |
manual screen copy after modprobe pata_it821x noraid=1
kernel messages from 2.6.22 for boot and modprobe pata_it821x noraid=1 console log of 2.6.23-rc2 for insmod pata_it821x.ko noraid=1 dmesg after booting with "init=/bin/bash" and pata_it821x 2.6.23-rc6 with libata.pata_dma=0 for pata_it821x 2.6.23-rc6 with debug output added to pata_it821x |
Description
Martin von Gagern
2006-11-12 23:47:39 UTC
Created attachment 9481 [details]
manual screen copy after modprobe pata_it821x noraid=1
I wrote down the messages on a sheet of paper and just typed them into a file.
There might be minor errors, but basically this should be an (almost) exact
screen dump.
Still freezes for 2.6.20-rc5 Martin, have you tried with 2.6.22-rc7 or current git? Thanks. Created attachment 11978 [details] kernel messages from 2.6.22 for boot and modprobe pata_it821x noraid=1 (In reply to comment #3) > Martin, have you tried with 2.6.22-rc7 or current git? Just tried 2.6.22, problem still exists. Attaching a dmesg and console copy. Timestamps 161.051733 through 161.714039 are printed to console directly after the call to "modprobe pata_it821x noraid=1", with the cursor stying in the last line just after the colon. Using a serial console allowed me to copy the output this time. pata_it821x and it821x received related updates recently. Can you please give a shot at 2.6.23-rc2? Thanks. Created attachment 12332 [details] console log of 2.6.23-rc2 for insmod pata_it821x.ko noraid=1 There are some changes in my setup for this run. The kernel is pretty much a "make defconfig". I configured an initramfs containing a busybox, so I could make my PIIX drivers modules and run these tests without loading them. (In reply to comment #5) > Can you please give a shot at 2.6.23-rc2? Done that, but the output sadly looks pretty much the same, except for some changed numbers due to the missing driver for my PIIX SATA controller. Not even the warning message I got in bug 7506 comment 9 now. It's very likely that I ran into this bug. My hardware is similar: ASUS P5WD2 motherboard + onboard ITE8211F controller. Just tested 2.6.23-rc5 and the system locked up at boot. After booting with "init=/bin/bash" I was able to grap dmesg (attached below). The system freezed after ~30 seconds but that was enough to save dmesg. Created attachment 12693 [details]
dmesg after booting with "init=/bin/bash" and pata_it821x
(In reply to comment #7) > It's very likely that I ran into this bug. Comparing your log to mine I see it recognizes the disk partitions and even mounts one file system for you. This is a lot farther than I ever got. Even more interesting for me might be the indication where mine stopped: while recognizing the partitions. My disks were part of a software raid once, and probably haven't been repartitioned since then, so the partition table won't work well for a single disk. I guess I'll repartition them using the old it821x driver and see whether this fixes the problem for pata_it821x. I'll let you know the outcome once I can reboot. (In reply to comment #9) > I guess I'll repartition them using the old it821x driver and see whether > this fixes the problem for pata_it821x. No luck, sadly. (In reply to comment #9) > Comparing your log to mine I see it recognizes the disk partitions and even > mounts one file system for you. This is a lot farther than I ever got. Actually, no. The hard drive is connected to the onboard Intel ICH7 family IDE controller which is handled by the ata_piix driver - see the lines 47.502632 - 47.905235. It works fine. My DVD drive is attached to the ITE8211F controller (lines 48.233996 - 48.930191). When I enable the controller in BIOS _and_ attach a CD or DVD drive to it I get lockups (note: even without mounting CD/DVD disks). With no devices attached to it the system works fine. Unfortunately Ubuntu is shipped with pata_it821x enabled by default so I get all sorts of headaches. Good news! I just tried 2.6.23-rc6 and it works. I guess the patch from Jeff Norden (found in 'git log -p drivers/ata/pata_it821x.c') fixed it. Also there is a thread at https://bugzilla.redhat.com/show_bug.cgi?id=242229 where all debugging goes. Martin, could you test 2.6.23-rc6? (In reply to comment #12) > Martin, could you test 2.6.23-rc6? Tried that, same result than before. :( > https://bugzilla.redhat.com/show_bug.cgi?id=242229 Looks pretty FC-specific most of the time. I'll try this pata_dma param patch tomorrow if I find the time, to see whether DMA is the problem for me here as well. I would think so, as bug #7506 seems to be related to DMA as well. Created attachment 12811 [details]
2.6.23-rc6 with libata.pata_dma=0 for pata_it821x
OK, it definitely is a DMA issue, although not fixed in vanilla rc6.
I applied linux-2.6-libata-pata_dma-param.patch to my kernel and booted with libata.pata_dma=0. With this setup I could load pata_it821x successfully, console log attached.
Created attachment 12816 [details] 2.6.23-rc6 with debug output added to pata_it821x I read some more of that Red Hat bugzilla and added some debugging code to the module. See https://bugzilla.redhat.com/show_bug.cgi?id=242229#c47 for my post of this log and the following comment for the source modifications I applied. What can be seen from this log here is that it821x_passthru_bmdma_start gets called and returns, then the system hangs for 30 seconds, then the freeze routine gets called, issues iowrite8, and then the system hangs for good. Hopefully more experienced people can see even more. Tell me when any added debugging code could give you further insights here. Any updates on this problem please. Thanks. As mentioned in bug 7506 comment 15, I figured out that one of the two drives I was using in this setup was probably defective. So while at last test (around 2.6.25.0 iirc) the system would still crash when the driver was loaded with this defective device installed, this might say nothing about working hardware. Sorry for not performing tests with single drives before. |