Bug 199663
Summary: | Can't find root with scsi_blk_mq | ||
---|---|---|---|
Product: | IO/Storage | Reporter: | Randy DuCharme (radio.ad5gb) |
Component: | Other | Assignee: | io_other |
Status: | RESOLVED INVALID | ||
Severity: | normal | CC: | tom.leiming |
Priority: | P1 | ||
Hardware: | x86-64 | ||
OS: | Linux | ||
Kernel Version: | 4.16.7 | Subsystem: | |
Regression: | No | Bisected commit-id: | |
Attachments: |
dmesg output
lspci -k output |
Description
Randy DuCharme
2018-05-09 05:27:38 UTC
Could you provide your root disk info? And the whole dmesg info. Like which kind of drive, what is the driver, ... Created attachment 275933 [details]
dmesg output
Created attachment 275935 [details]
lspci -k output
(In reply to Lei Ming from comment #1) > Could you provide your root disk info? And the whole dmesg info. > > Like which kind of drive, what is the driver, ... I discovered I could get it to boot by manually mounting /dev/sdc3 to /new_root/ and exiting the emergency shell. It takes nearly 30 seconds however for any of the sdX devices to become available and visible in /dev with blk-mq enabled. I've tried preloading all modules with grub, and in the initramfs but that didn't seem to help any. Odd. System's primary drive is a Seagate 2TB Barracuda and I have 2 RAID-0 arrays configured on an LSI9260-8I controller with 3 Western Digital Raptors in each. I've attached the dmesg -T and lspci -k outputs hoping it'll help From the following dmesg log you provided, looks everything is fine. Maybe some blk-mq sysfs related change breaks userspace, could you test if commit bf9ae8c5325c0070d0 is good? If yes, please test commit fa70d2e2c4a0a54ced9 further and see if it is introduced by Mike's change. [Fri May 11 17:24:06 2018] scsi 0:2:0:0: Direct-Access LSI MR9260-8i 2.13 PQ: 0 ANSI: 5 [Fri May 11 17:24:06 2018] scsi 0:2:1:0: Direct-Access LSI MR9260-8i 2.13 PQ: 0 ANSI: 5 [Fri May 11 17:24:10 2018] random: crng init done [Fri May 11 17:24:20 2018] scsi 1:0:0:0: Direct-Access ATA ST2000DM001-1CH1 CC29 PQ: 0 ANSI: 5 [Fri May 11 17:24:20 2018] sd 1:0:0:0: [sdc] 3907029168 512-byte logical blocks: (2.00 TB/1.82 TiB) [Fri May 11 17:24:20 2018] sd 1:0:0:0: [sdc] 4096-byte physical blocks [Fri May 11 17:24:20 2018] sd 0:2:0:0: [sda] 2927099904 512-byte logical blocks: (1.50 TB/1.36 TiB) [Fri May 11 17:24:20 2018] sd 0:2:0:0: [sda] 4096-byte physical blocks [Fri May 11 17:24:20 2018] sd 1:0:0:0: [sdc] Write Protect is off [Fri May 11 17:24:20 2018] sd 1:0:0:0: [sdc] Mode Sense: 00 3a 00 00 [Fri May 11 17:24:20 2018] sd 1:0:0:0: [sdc] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA [Fri May 11 17:24:20 2018] sd 0:2:1:0: [sdb] 2927099904 512-byte logical blocks: (1.50 TB/1.36 TiB) [Fri May 11 17:24:20 2018] sd 0:2:0:0: [sda] Write Protect is off [Fri May 11 17:24:20 2018] sd 0:2:1:0: [sdb] 4096-byte physical blocks [Fri May 11 17:24:20 2018] sd 0:2:0:0: [sda] Mode Sense: 1f 00 00 08 [Fri May 11 17:24:20 2018] sd 0:2:1:0: [sdb] Write Protect is off [Fri May 11 17:24:20 2018] sd 0:2:1:0: [sdb] Mode Sense: 1f 00 00 08 [Fri May 11 17:24:20 2018] sd 0:2:0:0: [sda] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA [Fri May 11 17:24:20 2018] sd 0:2:1:0: [sdb] Write cache: enabled, read cache: enabled, doesn't support DPO or FUA [Fri May 11 17:24:20 2018] sdb: sdb1 [Fri May 11 17:24:20 2018] sd 0:2:1:0: [sdb] Attached SCSI disk [Fri May 11 17:24:20 2018] scsi 3:0:0:0: CD-ROM PIONEER BD-RW BDR-209D 1.20 PQ: 0 ANSI: 5 [Fri May 11 17:24:20 2018] sda: sda1 [Fri May 11 17:24:20 2018] sd 0:2:0:0: [sda] Attached SCSI disk [Fri May 11 17:24:20 2018] sdc: sdc1 sdc2 sdc3 sdc4 [Fri May 11 17:24:20 2018] sd 1:0:0:0: [sdc] Attached SCSI disk [Fri May 11 17:24:20 2018] sr 3:0:0:0: [sr0] scsi3-mmc drive: 125x/125x writer cd/rw xa/form2 cdda tray [Fri May 11 17:24:20 2018] cdrom: Uniform CD-ROM driver Revision: 3.20 [Fri May 11 17:24:20 2018] sr 3:0:0:0: Attached scsi CD-ROM sr0 [Fri May 11 17:24:21 2018] ata5: SATA link down (SStatus 0 SControl 300) [Fri May 11 17:24:21 2018] ata7: SATA link down (SStatus 0 SControl 300) [Fri May 11 17:24:28 2018] EXT4-fs (sdc3): mounted filesystem with ordered data mode. Opts: (null) (In reply to Lei Ming from comment #5) > From the following dmesg log you provided, looks everything is fine. > > Maybe some blk-mq sysfs related change breaks userspace, could you test > if commit bf9ae8c5325c0070d0 is good? > > If yes, please test commit fa70d2e2c4a0a54ced9 further and see if it is > introduced by Mike's change. > > > [Fri May 11 17:24:06 2018] scsi 0:2:0:0: Direct-Access LSI > MR9260-8i 2.13 PQ: 0 ANSI: 5 > [Fri May 11 17:24:06 2018] scsi 0:2:1:0: Direct-Access LSI > MR9260-8i 2.13 PQ: 0 ANSI: 5 > [Fri May 11 17:24:10 2018] random: crng init done > [Fri May 11 17:24:20 2018] scsi 1:0:0:0: Direct-Access ATA > ST2000DM001-1CH1 CC29 PQ: 0 ANSI: 5 > [Fri May 11 17:24:20 2018] sd 1:0:0:0: [sdc] 3907029168 512-byte logical > blocks: (2.00 TB/1.82 TiB) > [Fri May 11 17:24:20 2018] sd 1:0:0:0: [sdc] 4096-byte physical blocks > [Fri May 11 17:24:20 2018] sd 0:2:0:0: [sda] 2927099904 512-byte logical > blocks: (1.50 TB/1.36 TiB) > [Fri May 11 17:24:20 2018] sd 0:2:0:0: [sda] 4096-byte physical blocks > [Fri May 11 17:24:20 2018] sd 1:0:0:0: [sdc] Write Protect is off > [Fri May 11 17:24:20 2018] sd 1:0:0:0: [sdc] Mode Sense: 00 3a 00 00 > [Fri May 11 17:24:20 2018] sd 1:0:0:0: [sdc] Write cache: enabled, read > cache: enabled, doesn't support DPO or FUA > [Fri May 11 17:24:20 2018] sd 0:2:1:0: [sdb] 2927099904 512-byte logical > blocks: (1.50 TB/1.36 TiB) > [Fri May 11 17:24:20 2018] sd 0:2:0:0: [sda] Write Protect is off > [Fri May 11 17:24:20 2018] sd 0:2:1:0: [sdb] 4096-byte physical blocks > [Fri May 11 17:24:20 2018] sd 0:2:0:0: [sda] Mode Sense: 1f 00 00 08 > [Fri May 11 17:24:20 2018] sd 0:2:1:0: [sdb] Write Protect is off > [Fri May 11 17:24:20 2018] sd 0:2:1:0: [sdb] Mode Sense: 1f 00 00 08 > [Fri May 11 17:24:20 2018] sd 0:2:0:0: [sda] Write cache: enabled, read > cache: enabled, doesn't support DPO or FUA > [Fri May 11 17:24:20 2018] sd 0:2:1:0: [sdb] Write cache: enabled, read > cache: enabled, doesn't support DPO or FUA > [Fri May 11 17:24:20 2018] sdb: sdb1 > [Fri May 11 17:24:20 2018] sd 0:2:1:0: [sdb] Attached SCSI disk > [Fri May 11 17:24:20 2018] scsi 3:0:0:0: CD-ROM PIONEER BD-RW > BDR-209D 1.20 PQ: 0 ANSI: 5 > [Fri May 11 17:24:20 2018] sda: sda1 > [Fri May 11 17:24:20 2018] sd 0:2:0:0: [sda] Attached SCSI disk > [Fri May 11 17:24:20 2018] sdc: sdc1 sdc2 sdc3 sdc4 > [Fri May 11 17:24:20 2018] sd 1:0:0:0: [sdc] Attached SCSI disk > [Fri May 11 17:24:20 2018] sr 3:0:0:0: [sr0] scsi3-mmc drive: 125x/125x > writer cd/rw xa/form2 cdda tray > [Fri May 11 17:24:20 2018] cdrom: Uniform CD-ROM driver Revision: 3.20 > [Fri May 11 17:24:20 2018] sr 3:0:0:0: Attached scsi CD-ROM sr0 > [Fri May 11 17:24:21 2018] ata5: SATA link down (SStatus 0 SControl 300) > [Fri May 11 17:24:21 2018] ata7: SATA link down (SStatus 0 SControl 300) > [Fri May 11 17:24:28 2018] EXT4-fs (sdc3): mounted filesystem with ordered > data mode. Opts: (null) I'll surely test but will not be able to until Sunday afternoon. I'll not be back at the machine until then. Will post my findings. Thanks!!! So far nothing I've done with those commits has been very helpful. I've made some filesystem changes to get some more space so I can install some more tools. I'm a fairly seasoned C/C++ developer (nearly 20 years) but have little experience with Linux kernel hacking (save a little Android work) and debugging but I guess I'll try to jump into it and see if I can figure it out. Clearly it's not a widespread issue. Any links or resources you can point me to to "fast track" getting up to speed with Linux kernel internals and debugging would be welcome. I've got a few and found a few more. This'll likely disappear before I get anywhere with it tho'. (In reply to Randy DuCharme from comment #7) > So far nothing I've done with those commits has been very helpful. I've > made some filesystem changes to get some more space so I can install some > more tools. I'm a fairly seasoned C/C++ developer (nearly 20 years) but > have little experience with Linux kernel hacking (save a little Android > work) and debugging but I guess I'll try to jump into it and see if I can > figure it out. Clearly it's not a widespread issue. Any links or resources > you can point me to to "fast track" getting up to speed with Linux kernel > internals and debugging would be welcome. I've got a few and found a few > more. This'll likely disappear before I get anywhere with it tho'. Actually, one approach I suggest is to use 'git bisect' to find which commit causes this issue. And it is pretty effective. Thanks, So, it would seem I've wasted some people's time. Problem is not a kernel problem at all. I migrated the system to a larger hard disk and somehow in the process resolved the issue - yet it's odd that Arch release kernels prior to 4.16.7 didn't exhibit the problem on the other disk. I wish I knew what happened but many apologies for the wasted bandwidth and time. I'm marking this resolved/invalid. |