Bug 34762 - highpoint Rocket Raid 622 presents no drives.
Summary: highpoint Rocket Raid 622 presents no drives.
Status: RESOLVED WILL_NOT_FIX
Alias: None
Product: IO/Storage
Classification: Unclassified
Component: Serial ATA (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: Jeff Garzik
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2011-05-09 22:54 UTC by Peter Silva
Modified: 2012-08-23 15:47 UTC (History)
3 users (show)

See Also:
Kernel Version: 2.6.38 (ubuntu stock 11.04)
Subsystem:
Regression: No
Bisected commit-id:


Attachments
add-TTI-0622-to-ahci.patch (451 bytes, patch)
2011-05-10 09:23 UTC, Tejun Heo
Details | Diff
kern.log extract shows boot of kernel with initial patch applied. (223.73 KB, application/octet-stream)
2011-05-13 02:38 UTC, Peter Silva
Details
mounted /dev/sdg as /mnt, and ran bonnie. (1.44 KB, text/x-log)
2011-05-13 10:07 UTC, Peter Silva
Details
better kernel log. (123.58 KB, text/x-log)
2011-05-13 11:05 UTC, Peter Silva
Details

Description Peter Silva 2011-05-09 22:54:21 UTC
(result of email with tejun heo)

It is a eSATA III (6 GB/s) controller that does not present any disks.   I am willing to "play" with the controller and disks, and apply some kernel patches for testing purposes (on a testing kernel of some kind.) I would like to obtain a replacement for the proprietary blob driver (support ended at ubuntu 9.10.) as I am running linux 2.6.38 (ubuntu 11.04 stock kernel)

Just JBOD is fine, just want to ignore the RAID and use btrfs. 

root@alu:~# lspci -nn
00:00.0 Host bridge [0600]: Advanced Micro Devices [AMD] RS780 Host Bridge [1022:9600]
00:02.0 PCI bridge [0604]: Advanced Micro Devices [AMD] RS780 PCI to PCI bridge (ext gfx port 0) [1022:9603]
00:04.0 PCI bridge [0604]: Advanced Micro Devices [AMD] RS780 PCI to PCI bridge (PCIE port 0) [1022:9604]
00:06.0 PCI bridge [0604]: Advanced Micro Devices [AMD] RS780 PCI to PCI bridge (PCIE port 2) [1022:9606]
00:11.0 SATA controller [0106]: ATI Technologies Inc SB7x0/SB8x0/SB9x0 SATA Controller [AHCI mode] [1002:4391]
00:12.0 USB Controller [0c03]: ATI Technologies Inc SB7x0/SB8x0/SB9x0 USB OHCI0 Controller [1002:4397]
00:12.1 USB Controller [0c03]: ATI Technologies Inc SB7x0 USB OHCI1 Controller [1002:4398]
00:12.2 USB Controller [0c03]: ATI Technologies Inc SB7x0/SB8x0/SB9x0 USB EHCI Controller [1002:4396]
00:13.0 USB Controller [0c03]: ATI Technologies Inc SB7x0/SB8x0/SB9x0 USB OHCI0 Controller [1002:4397]
00:13.1 USB Controller [0c03]: ATI Technologies Inc SB7x0 USB OHCI1 Controller [1002:4398]
00:13.2 USB Controller [0c03]: ATI Technologies Inc SB7x0/SB8x0/SB9x0 USB EHCI Controller [1002:4396]
00:14.0 SMBus [0c05]: ATI Technologies Inc SBx00 SMBus Controller [1002:4385] (rev 3c)
00:14.1 IDE interface [0101]: ATI Technologies Inc SB7x0/SB8x0/SB9x0 IDE Controller [1002:439c]
00:14.2 Audio device [0403]: ATI Technologies Inc SBx00 Azalia (Intel HDA) [1002:4383]
00:14.3 ISA bridge [0601]: ATI Technologies Inc SB7x0/SB8x0/SB9x0 LPC host controller [1002:439d]
00:14.4 PCI bridge [0604]: ATI Technologies Inc SBx00 PCI to PCI Bridge [1002:4384]
00:14.5 USB Controller [0c03]: ATI Technologies Inc SB7x0/SB8x0/SB9x0 USB OHCI2 Controller [1002:4399]
00:18.0 Host bridge [0600]: Advanced Micro Devices [AMD] Family 10h Processor HyperTransport Configuration [1022:1200]
00:18.1 Host bridge [0600]: Advanced Micro Devices [AMD] Family 10h Processor Address Map [1022:1201]
00:18.2 Host bridge [0600]: Advanced Micro Devices [AMD] Family 10h Processor DRAM Controller [1022:1202]
00:18.3 Host bridge [0600]: Advanced Micro Devices [AMD] Family 10h Processor Miscellaneous Control [1022:1203]
00:18.4 Host bridge [0600]: Advanced Micro Devices [AMD] Family 10h Processor Link Control [1022:1204]
01:00.0 VGA compatible controller [0300]: nVidia Corporation GT215 [GeForce GT 240] [10de:0ca3] (rev a2)
01:00.1 Audio device [0403]: nVidia Corporation High Definition Audio Controller [10de:0be4] (rev a1)
02:00.0 RAID bus controller [0104]: HighPoint Technologies, Inc. Device [1103:0622] (rev 01)
02:00.1 IDE interface [0101]: Marvell Technology Group Ltd. Device [1b4b:91a4] (rev 11)
03:00.0 Ethernet controller [0200]: Atheros Communications AR8131 Gigabit Ethernet [1969:1063] (rev c0)
04:07.0 Multimedia video controller [0400]: Conexant Systems, Inc. CX23880/1/2/3 PCI Video and Audio Decoder [14f1:8800] (rev 05)
04:07.1 Multimedia controller [0480]: Conexant Systems, Inc. CX23880/1/2/3 PCI Video and Audio Decoder [Audio Port] [14f1:8801] (rev 05)
04:07.2 Multimedia controller [0480]: Conexant Systems, Inc. CX23880/1/2/3 PCI Video and Audio Decoder [MPEG Port] [14f1:8802] (rev 05)
04:07.4 Multimedia controller [0480]: Conexant Systems, Inc. CX23880/1/2/3 PCI Video and Audio Decoder [IR Port] [14f1:8804] (rev 05)
root@alu:~# 

I have agreement with supplier of my unit to RMA for an SI3132 if I cannot get 
this working, so would like to try for a week or two, but beyond that, will likely RMA.
Comment 1 Tejun Heo 2011-05-10 09:23:24 UTC
Created attachment 57122 [details]
add-TTI-0622-to-ahci.patch

Can you please try the attached patch and post the kernel boot log?

Thank you.
Comment 2 Tejun Heo 2011-05-10 09:25:07 UTC
Cc'ing Mark just in case.  I'm shooting in the dark and just added an unknown Highpoint PCI ID as an ahci device.  Mark, I know your contact with marvell is limited but if you know somebody there, please ask them to talk to us.  We really need to sort out this marvell thing.

Thank you.
Comment 3 Peter Silva 2011-05-13 02:38:58 UTC
Created attachment 57692 [details]
kern.log extract shows boot of kernel with initial patch applied.

This is great!  the disks now shows up as /dev/sdf on lsscsi.
I did mkfs.btrfs on it,  what sort of torture test should I apply to ensure that kernel really understands the controller?

Please note, the original patch did not apply cleanly, to ubuntu stock kernel.  It was just offset by a few lines, so did it manually.  I assume ubuntu must have some patches, please provide instructions as to what tree I should be pulling from.
Comment 4 Peter Silva 2011-05-13 02:47:19 UTC
sorry, it shows up as /dev/sdg, not f.  c,d,e & f are being access through an on-board controller I am using to work around not being able to use the rr622.  I put a single disk on the rr622 to be able to test.  That disk shows up as /dev/sdg
Comment 5 Tejun Heo 2011-05-13 08:12:15 UTC
Hmmm... the marvell device @ pci 02:00.1 doesn't seem to be an AHCI controller at all and /dev/sdg is not connected to that device.

You have a port multiplier connected to one of the ports coming off AHCI controller @ pci 00:11.0, which looks like the south bridge.  The PMP is Silicon Image 3726 and there are four devices attached to it.

 ata7: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
 ata7.15: Port Multiplier 1.1, 0x1095:0x3726 r23, 6 ports, feat 0x1/0x9
 ...
 ata7.05: SATA link up 1.5 Gbps (SStatus 113 SControl 320)
 ata7.00: ATA-8: WDC WD5000AAKS-00C8A0, 12.01C02, max UDMA/133
 ata7.00: 976773168 sectors, multi 0: LBA48 NCQ (depth 31/32), AA
 ata7.00: configured for UDMA/133
 ata7.01: ATA-8: WDC WD5000AAKS-00C8A0, 12.01C02, max UDMA/133
 ata7.01: 976773168 sectors, multi 0: LBA48 NCQ (depth 31/32), AA
 ata7.01: configured for UDMA/133
 ata7.02: ATA-8: ST31000524NS, SN12, max UDMA/133
 ata7.02: 1953525168 sectors, multi 0: LBA48 NCQ (depth 31/32)
 ata7.02: configured for UDMA/133
 ata7.03: ATA-8: ST31000524NS, SN12, max UDMA/133
 ata7.03: 1953525168 sectors, multi 0: LBA48 NCQ (depth 31/32)
 ata7.03: configured for UDMA/133

ata7.03 is sd 6:3:0:0, which in turn is /dev/sdg.

So, the marvell controller still isn't working and it doesn't seem to be an ahci controller.
Comment 6 Peter Silva 2011-05-13 10:07:10 UTC
Created attachment 57712 [details]
mounted /dev/sdg as /mnt, and ran bonnie.

Is this good torture?
Comment 7 Tejun Heo 2011-05-13 10:13:10 UTC
Peter, the patch didn't do anything.  SBx00 ahci + SIMG3726 is a known working configuration (well, 3726 is known to be flaky too) and doesn't have anything to do with the marvell controller.  I don't know why it didn't work before tho.
Comment 8 Peter Silva 2011-05-13 11:05:23 UTC
Created attachment 57722 [details]
better kernel log.

Sorry, original log had before and after boots.  second boot in the log, from May 12th is with the patch applied. I updated the attachment to remove the may 11th boot without the patch.  so now, /dev/sdg is different!

The RAID tower is a Sans digital tr8m.  It has 8 drive bays, and two PC Link connections. I think that each PC-link connection is for one four-bay module in the RAID unit.  

After booting may 12th, I see:

alu% lsscsi
[0:0:0:0]    disk    ATA      ST3120026AS      3.05  /dev/sda
[2:0:0:0]    disk    ATA      WDC WD1600JS-55N 10.0  /dev/sdb
[4:0:0:0]    disk    ATA      WDC WD5000AAKS-0 12.0  /dev/sdc
[4:1:0:0]    disk    ATA      WDC WD5000AAKS-0 12.0  /dev/sdd
[4:2:0:0]    disk    ATA      ST31000524NS     SN12  /dev/sde
[4:3:0:0]    disk    ATA      ST31000524NS     SN12  /dev/sdf
[6:0:0:0]    disk    ATA      WDC WD7500AAKS-0 30.0  /dev/sdg
[8:0:0:0]    cd/dvd  HL-DT-ST DVDRAM GSA-4082B A201  /dev/sr0
[8:0:1:0]    cd/dvd  HL-DT-ST DVD-ROM GDR8162B 0015  /dev/sr1
[10:0:0:0]   disk    TRUSTED  Mass Storage     1.00  /dev/sdh
alu% 

There are:
via PC link 1 (pm): two WD5000, and two ST31000.  they show up on [4:...]
via PC Link 2 (rr622): a single WD7500.  [6:...]  

I think ata6 is the highpoint rocket raid (rr622).  If it was through port multiplier, it would be 4:4.
Comment 9 Peter Silva 2011-05-14 04:54:46 UTC
very confused now... reverted to original stock ubuntu kernel, cabled using configuration as designed rr622 ESATA plugged into PC Links 1 and 2.  all five drives present themselves in both bays.


May 14 00:40:19 alu kernel: [    4.548948]  sdc: unknown partition table
May 14 00:40:19 alu kernel: [    4.549087] sd 8:0:0:0: [sdc] Attached SCSI disk
May 14 00:40:19 alu kernel: [    4.870184] ata10.03: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
May 14 00:40:19 alu kernel: [    4.870200] ata10.04: hard resetting link
May 14 00:40:19 alu kernel: [    5.220160] ata10.04: SATA link down (SStatus 0 SControl 320)
May 14 00:40:19 alu kernel: [    5.220186] ata10.05: hard resetting link
May 14 00:40:19 alu kernel: [    5.570186] ata10.05: SATA link up 1.5 Gbps (SStatus 113 SControl 320)
May 14 00:40:19 alu kernel: [    5.582178] ata10.00: ATA-8: WDC WD5000AAKS-00C8A0, 12.01C02, max UDMA/133
May 14 00:40:19 alu kernel: [    5.582181] ata10.00: 976773168 sectors, multi 0: LBA48 NCQ (depth 31/32), AA
May 14 00:40:19 alu kernel: [    5.583209] ata10.00: configured for UDMA/133
May 14 00:40:19 alu kernel: [    5.599177] ata10.01: ATA-8: WDC WD5000AAKS-00C8A0, 12.01C02, max UDMA/133
May 14 00:40:19 alu kernel: [    5.599179] ata10.01: 976773168 sectors, multi 0: LBA48 NCQ (depth 31/32), AA
May 14 00:40:19 alu kernel: [    5.600188] ata10.01: configured for UDMA/133
May 14 00:40:19 alu kernel: [    5.601658] ata10.02: ATA-8: ST31000524NS, SN12, max UDMA/133
May 14 00:40:19 alu kernel: [    5.601661] ata10.02: 1953525168 sectors, multi 0: LBA48 NCQ (depth 31/32)
May 14 00:40:19 alu kernel: [    5.603378] ata10.02: configured for UDMA/133
May 14 00:40:19 alu kernel: [    5.604847] ata10.03: ATA-8: ST31000524NS, SN12, max UDMA/133
May 14 00:40:19 alu kernel: [    5.604849] ata10.03: 1953525168 sectors, multi 0: LBA48 NCQ (depth 31/32)
May 14 00:40:19 alu kernel: [    5.606546] ata10.03: configured for UDMA/133
May 14 00:40:19 alu kernel: [    5.606580] ata10: EH complete
May 14 00:40:19 alu kernel: [    5.606729] scsi 9:0:0:0: Direct-Access     ATA      WDC WD5000AAKS-0 12.0 PQ: 0 ANSI: 5
May 14 00:40:19 alu kernel: [    5.606868] sd 9:0:0:0: [sdd] 976773168 512-byte logical blocks: (500 GB/465 GiB)
May 14 00:40:19 alu kernel: [    5.606876] sd 9:0:0:0: Attached scsi generic sg5 type 0
May 14 00:40:19 alu kernel: [    5.606899] sd 9:0:0:0: [sdd] Write Protect is off
May 14 00:40:19 alu kernel: [    5.606901] sd 9:0:0:0: [sdd] Mode Sense: 00 3a 00 00
May 14 00:40:19 alu kernel: [    5.606915] sd 9:0:0:0: [sdd] Write cache: disabled, read cache: enabled, doesn't support DPO or FUA


The only difference I can think of, is that, on one of the boots I entered <Ctrl-H> to enter the RAID unit BIOS.  I created some raidsets, then exited.
Comment 10 Tejun Heo 2011-05-14 10:39:10 UTC
Yeap, you're right.  In the last log, 11.0 is working properly as an AHCI controller.  In the earlier log, the controller was just spitting out 0xffffffff's which often is an indication that the card is not well seated in the PCI slot.  But, it's also possible that the controller changes its programming interface depending on the controller mode in use.  Can you please play with different settings and find out what's going on?

Thanks.
Comment 11 Peter Silva 2011-05-14 14:14:05 UTC
I guess the card moving around is possible, the case has been slid back and forth, and cables plugged in that might have shifted it very slightly.

The card is screwed in to the bracket, and the BIOS utility has always worked properly.  Would the BIOS work if the card were not properly seated?  

OK, on boot, I go into the rr622's BIOS menu, after it does it's scanning devices... for a few minutes...    

I have menu (via Ctrl-H)

Create Delete Add/Remove Spare Settings View Initialize

5 disks show up. the two I originally created a mirrored raidset show up as 'configured' the other three show up as New.

Create:
  no suitable disks

Delete:
  don't dare.

Add/Remove Spare:
  offers me to do remove one of the 'configured' disks from their 1 disk JBOD sets. no thanks.

Settings
  Select boot device
  staggerred spinup: Disabled.

View
  shows the two disks each configured as individual JBODS called rr622_0 and rr622_1

Initialize

  shows me the three 'New' disks.
  presumably puts some sort of header on the drive?

didn't touch it.


so, before I submitted the bug, I created two jbod devices using this BIOS menu, but I recall rebooting and never seeing the devices at the OS level.

Since the rr622 started working, I have done a good half-dozen boots, and the disks always appear as individual sata units.

Can you suggest some other settings to tweak?
Comment 12 Tejun Heo 2011-05-17 07:27:36 UTC
Hmm... not really.  The earlier log definitely seemed like the hardware was having some problem but I would really appreciate if you can try different RAID / JBOD settings and verify that they all work.  Not having enough info and given the earlier weird failure, I'm a bit reluctant to forward the patch upstream.

Thanks.
Comment 13 Peter Silva 2011-05-17 11:46:25 UTC
The raidsets defined in the bios don't "work." I have two jbod raid sets that show up in the BIOS, but 5 disks are present, and they all look the same at the OS level.  The RAID stuff I didn't want isn't working. That's perfect.  I never wanted the RAID functions, which I believe are implemented in the host-based driver, rather than in firmware/hardware.  The thing that bugs me is that I don't know why it didn't present disks in the first place.  I agree that there is nothing to forward at this point. I'm OK with closing if we don't think we can figure out why it suddenly started working.
Comment 14 Tejun Heo 2011-05-19 13:50:12 UTC
Yeah, the RAID is most likely just BIOS / driver thing.  At hardware, it's always just multiple disks attached to a single controller, but I just want to make sure that it works under different BIOS configurations.  Can you please test with JBOD and different RAID configurations and see whether the disks come up regardless of configuration?
Comment 15 Peter Silva 2011-06-19 13:42:14 UTC
I booted a 2.6.39 kernel from the ubuntu PPA set for natty.   With
that kernel, no drives are presented.


On Thu, May 19, 2011 at 9:50 AM,  <bugzilla-daemon@bugzilla.kernel.org> wrote:
> https://bugzilla.kernel.org/show_bug.cgi?id=34762
>
>
>
>
>
> --- Comment #14 from Tejun Heo <tj@kernel.org>  2011-05-19 13:50:12 ---
> Yeah, the RAID is most likely just BIOS / driver thing.  At hardware, it's
> always just multiple disks attached to a single controller, but I just want
> to
> make sure that it works under different BIOS configurations.  Can you please
> test with JBOD and different RAID configurations and see whether the disks
> come
> up regardless of configuration?
>
> --
> Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email
> ------- You are receiving this mail because: -------
> You reported the bug.
>
Comment 16 Tejun Heo 2011-06-23 14:16:19 UTC
The patch hasn't been committed to the kernel yet so...  Has it been working fine with the patched kernel?  Have you tried different configs from BIOS?
Comment 17 Peter Silva 2011-06-25 16:28:05 UTC
The patch did nothing.  At some point, magically, the controller
started presenting the drives to the OS.
when booted on a kernel without the patch, the drives remain visible.
 No change in observed behaviour.  the kernel is ubuntu natty standard
2.6.38.2.  When I boot a 2.6.39 from ubuntu kernel ppa, I can see none
of the drives attached to that controller.
Comment 18 Tejun Heo 2011-06-26 08:54:19 UTC
Peter, without the patch, there's no way the drives connected to the highpoint controller are visible to the operating system.  The controller doesn't have a class code with standard programming interface.  If the ahci driver doesn't have the PCI ID, it just won't detect it.  This patch not being in upstream, trying different/newer distro kernels isn't likely to help either (unless a distro adds the patch itself), so I strongly suggest verifying the test setup and re-test the patch.

Thanks.
Comment 19 Alan 2012-08-23 14:05:13 UTC
Not sure there is anything we can do. It seems it was only supported by some kind of Ubuntu added proprietary driver. One for Ubuntu
Comment 20 Peter Silva 2012-08-23 15:47:35 UTC
This bug report was opened because It works on 2.6.38 using normal open
source drivers.  Something was done to actively break working support,
using open drivers in the >= 2.6.39/3.0 tree.

I have a server accessing these disks, with it's kernel pinned to 2.6.38,
and no proprietary driver in use.

p.s. fwiw... my account is not recoverable on kernel.org.   tried changing
password, but even after pw change, it remains disabled.

On Thu, Aug 23, 2012 at 10:05 AM, <bugzilla-daemon@bugzilla.kernel.org>wrote:

> https://bugzilla.kernel.org/show_bug.cgi?id=34762
>
>
> Alan <alan@lxorguk.ukuu.org.uk> changed:
>
>            What    |Removed                     |Added
>
> ----------------------------------------------------------------------------
>              Status|NEW                         |RESOLVED
>                  CC|                            |alan@lxorguk.ukuu.org.uk
>          Resolution|                            |WILL_NOT_FIX
>
>
>
>
> --- Comment #19 from Alan <alan@lxorguk.ukuu.org.uk>  2012-08-23 14:05:13
> ---
> Not sure there is anything we can do. It seems it was only supported by
> some
> kind of Ubuntu added proprietary driver. One for Ubuntu
>
> --
> Configure bugmail: https://bugzilla.kernel.org/userprefs.cgi?tab=email
> ------- You are receiving this mail because: -------
> You reported the bug.
>

Note You need to log in before you can comment on or make changes to this bug.