Bug 3062 - aic7xxx doesn't support ACPI suspend
Summary: aic7xxx doesn't support ACPI suspend
Status: CLOSED PATCH_ALREADY_AVAILABLE
Alias: None
Product: SCSI Drivers
Classification: Unclassified
Component: Other (show other bugs)
Hardware: i386 Linux
: P2 normal
Assignee: Hannes Reinecke
URL:
Keywords:
Depends on:
Blocks: 7216
  Show dependency tree
 
Reported: 2004-07-13 18:14 UTC by Nathan Bryant
Modified: 2007-10-22 01:11 UTC (History)
5 users (show)

See Also:
Kernel Version: 2.6.8-rc2
Subsystem:
Regression: ---
Bisected commit-id:


Attachments
aic7xxx_swsusp.patch (800 bytes, patch)
2004-07-16 15:01 UTC, Nathan Bryant
Details | Diff
dmesg (922.34 KB, text/plain)
2007-05-29 11:02 UTC, Gabriel C
Details
lspci (22.75 KB, text/plain)
2007-05-29 11:04 UTC, Gabriel C
Details
aic7xxx-add-suspend-resume (19.93 KB, patch)
2007-10-22 01:09 UTC, Hannes Reinecke
Details | Diff

Description Nathan Bryant 2004-07-13 18:14:15 UTC
Distribution:
Fedora Core 2 with kernel 2.6.7-1.486 RPM from http://people.redhat.com/~arjanv
Hardware Environment:
Gigabyte 8ITXE motherboard, Pentium 4, Adaptec 29160N
Software Environment:
(see above)
Problem Description:

Due to problems in the aic7xxx driver (its I/O task refuses to go to sleep),
system does not suspend to ACPI S3 - it tries, gives up, and resumes normal
operation.
echo 3 > /proc/acpi/sleep produces the following:
PM: Preparing system for suspend
Stopping tasks: ==========
 stopping tasks failed (1 tasks remaining)
Restarting tasks...<6> Strange, ahc_dv_0 not stopped
 done


Steps to reproduce:
echo 3 > /proc/acpi/sleep produces the following:
Comment 1 Nathan Bryant 2004-07-14 10:20:18 UTC
As Arjan's 2.6.7-1.486 RPM is based on 2.6.8-RC1, the cause might be the following:

http://marc.theaimsgroup.com/?l=linux-scsi&m=108306129820558&w=2

This change is present in 2.6.8-RC1 mainline
Comment 2 Nathan Bryant 2004-07-16 15:01:10 UTC
Created attachment 3385 [details]
aic7xxx_swsusp.patch

If I apply this patch (attached), I am now able to enter ACPI S3 suspend, but
resume is still broken: The first time I try to  access the hard disk after
resuming, the kernel prints "Kernel panic: Loop 1"
Comment 3 Nathan Bryant 2004-07-22 19:23:30 UTC
Comment on attachment 3385 [details]
aic7xxx_swsusp.patch

I'm floating a patch on the mailing lists; see
http://marc.theaimsgroup.com/?l=linux-scsi&m=109054640414945&w=2
Comment 4 Diego Calleja 2006-09-28 09:15:37 UTC
has this been fixed in recent kernels?
Comment 5 Nathan Bryant 2006-09-28 09:27:19 UTC
Not that I know of. Last time I checked, the scsi midlayer in general is also
missing suspend/resume support. Specifically, you need to resolve all
outstanding transactions (quiesce) every device on the SCSI bus before you can
put the bus adapter itself to sleep, so you have to implement the suspend/resume
driver model in sd, scd drivers et cetera. I made patches for this and for
aic7xxx back in the 2.6.12 timeframe but I was having trouble getting the
full-time linux-scsi maintainers to take an interest in them. If someone else is
interested in advocating for this functionality and resurrecting the patches, I
can probably dig my work out and freshen it up.
Comment 6 Nathan Bryant 2006-09-28 09:42:01 UTC
One also needs to write code in the SD driver that handles spin-up/spin-down in
a clean way. I got this working on my specific hard drive but if I remember
correctly, some people were worried about the effect of the code on all
configurations. I think I was sending an explicit spinup command. Getting
everyone to agree upon logic to cleanly reinitialize, and spin up if necessary,
a SCSI hard drive after resume is probably the major remaining issue here. Of
course resume doesn't work at all right now so anything we do is probably better
than nothing ;)
Comment 7 Joachim Reichelt 2007-04-19 01:53:43 UTC
I got a crash on resume using SUSE 10.x having a scsi-only system on my 
aha2940UW, kernel 2.6.18
There is no disk activity after the finished resume op.
X11 etc. are just as before suspend.
Then the aic7xxx locks up with a lot of messages, so fast that I cannot read.
Comment 8 Gabriel C 2007-05-29 11:01:07 UTC
err PING ?

This is still broken in latest 2.6.22-rc3 ( with or without scsi-misc git
patches ) on my Dell Precision 530 MT ( suspend to whatever is broken ) I always
get the same error(s) after resume.

I attach my lspci output and I managed to log all the errors the card dumps
after resume so I attach the broken dmesg as well.

Does someone care at all about this issues ?

Comment 9 Gabriel C 2007-05-29 11:02:56 UTC
Created attachment 11611 [details]
dmesg
Comment 10 Gabriel C 2007-05-29 11:04:07 UTC
Created attachment 11612 [details]
lspci
Comment 11 Rafael J. Wysocki 2007-05-30 11:32:59 UTC
Unfortunatly, this seems to require someone with deep SCSI knowledge to fix a
couple of drivers.

That said, we have suspend support in SATA drivers, so the SCSI midlayer must
have been updated.
Comment 12 Nathan Bryant 2007-05-30 11:52:52 UTC
Some work does appear to have been done on the scsi midlayer, but I don't know
if this has been merged yet or if it's in a state that works for more than just
SATA. Of course, the SCSI midlayer needs to be sorted out before it makes sense
to merge any changes for SCSI HBA drivers. See here:

http://lwn.net/Articles/157057/
Comment 13 Gabriel C 2007-05-31 04:01:34 UTC
Looks like some parts are merged other not ( but maybe done in some other way ?! ).

I guess a good idea is to ask the patch author :)

Anyway if someone has some patch for this issues , whatever experimental or not
I can test it.
Comment 14 Natalie Protasevich 2007-09-22 18:38:35 UTC
Any updates on this bug, do the problems still exist with latest kernel?
Maybe Nathan and Rafael can outline what needs to be done still and whether those are "projects" that need to be announced so someone (if not you) takes ownership.
Thanks. 
Comment 15 Gabriel C 2007-09-22 18:47:01 UTC
Yes the problems still exists in 2.6.23-rc7 and latest -mm kernel.
Comment 16 Nathan Bryant 2007-09-24 08:49:07 UTC
Problem still exists but I've taken this bug as far as I can on my own...
Comment 17 Hannes Reinecke 2007-10-22 01:09:15 UTC
Created attachment 13227 [details]
aic7xxx-add-suspend-resume

Patch to add suspend/resume support to aic7xxx.
Comment 18 Hannes Reinecke 2007-10-22 01:11:10 UTC
With the above patch it should work. We only have to take care to save some extra PCI registers (which the PCI core doesn't know about). And as we're lazy we're doing a full SCSI bus reset on resume, so we don't have to worry about the internal state of the aic7xxx anyway.
Patch has been tested by Jens Axboe and accepted in scsi-misc, so we can close this.

Note You need to log in before you can comment on or make changes to this bug.