Bug 216394 - NVME device not found with 5.19.* - globally duplicate IDs for nsid 1
Summary: NVME device not found with 5.19.* - globally duplicate IDs for nsid 1
Status: NEW
Alias: None
Product: IO/Storage
Classification: Unclassified
Component: NVMe (show other bugs)
Hardware: Other Linux
: P1 normal
Assignee: IO/NVME Virtual Default Assignee
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2022-08-22 12:19 UTC by Shyamin Ayesh
Modified: 2023-02-21 20:33 UTC (History)
5 users (show)

See Also:
Kernel Version: 5.19
Subsystem:
Regression: No
Bisected commit-id:


Attachments
Possible test patch for Lexar NM610 duplicate ids (536 bytes, patch)
2022-08-22 12:20 UTC, Shyamin Ayesh
Details | Diff

Description Shyamin Ayesh 2022-08-22 12:19:03 UTC
After upgrading to linux-5.19.2 a second identical NVME device is no longer found/visible. Only one of the 2 devices is visible in the /dev tree. Also can reproduce the same bug in 5.18.* versions and downgrading to 5.17.* fixes the issue atm.


The 5.18* / 5.19* kernel throws the error: "globally duplicate IDs for nsid 1"

➜  ~ sudo dmesg| grep nvme
[    1.145069] nvme nvme0: pci function 0000:02:00.0
[    1.145201] nvme nvme1: pci function 0000:05:00.0
[    1.148053] nvme nvme1: missing or invalid SUBNQN field.
[    1.149823] nvme nvme0: missing or invalid SUBNQN field.
[    1.157321] nvme nvme1: allocated 64 MiB host memory buffer.
[    1.157960] nvme nvme1: 4/0/0 default/read/poll queues
[    1.159831]  nvme1n1: p1 p2 p3 p4
[    1.159932] nvme nvme0: allocated 64 MiB host memory buffer.
[    1.160277] nvme nvme0: 4/0/0 default/read/poll queues
[    1.160530] nvme nvme0: globally duplicate IDs for nsid 1
[    1.160532] nvme nvme0: VID:DID 1d97:2263 model:Lexar 250GB SSD firmware:V2.18.0
➜  ~

➜  ~ lspci -nnk | grep "Non-Volatile memory controller"
02:00.0 Non-Volatile memory controller [0108]: Shenzhen Longsys Electronics Co., Ltd. SM2263EN/SM2263XT-based OEM SSD [1d97:2263] (rev 03)
05:00.0 Non-Volatile memory controller [0108]: Shenzhen Longsys Electronics Co., Ltd. SM2263EN/SM2263XT-based OEM SSD [1d97:2263] (rev 03)
➜  ~
Comment 1 Shyamin Ayesh 2022-08-22 12:20:48 UTC
Created attachment 301627 [details]
Possible test patch for Lexar NM610 duplicate ids
Comment 2 Keith Busch 2022-08-22 14:01:01 UTC
Patch look appropriate. Would you care to send it to the mailing list?
Comment 3 Shyamin Ayesh 2022-08-26 10:02:07 UTC
(In reply to Keith Busch from comment #2)
> Patch look appropriate. Would you care to send it to the mailing list?
Comment 4 Shyamin Ayesh 2022-08-26 10:02:33 UTC
(In reply to Keith Busch from comment #2)
> Patch look appropriate. Would you care to send it to the mailing list?

I just sent the patch to linux-nvme mailing list.
Comment 6 James Bober 2022-10-23 11:58:34 UTC
I recently upgraded to Ubuntu 22.10 with the 5.19.* kernel and now I can only see one side of my Intel F640 flash card.  This card was fine with the 5.15.* Kernel.

lspci -nnk | grep "Non-Volatile memory controller"
03:00.0 Non-Volatile memory controller [0108]: Intel Corporation NVMe Datacenter SSD [3DNAND, Beta Rock Controller] [8086:0a54]
04:00.0 Non-Volatile memory controller [0108]: Intel Corporation NVMe Datacenter SSD [3DNAND, Beta Rock Controller] [8086:0a54]
05:00.0 Non-Volatile memory controller [0108]: Samsung Electronics Co Ltd NVMe SSD Controller SM981/PM981/PM983 [144d:a808]

j$ sudo dmesg |grep nvme
[sudo] password for jbober:
[    1.492483] nvme nvme0: pci function 0000:03:00.0
[    1.492555] nvme nvme1: pci function 0000:04:00.0
[    1.492673] nvme nvme2: pci function 0000:05:00.0
[    1.497696] nvme nvme0: 4/0/0 default/read/poll queues
[    1.499203]  nvme0n1: p1 p9
[    1.499219] nvme nvme2: missing or invalid SUBNQN field.
[    1.499243] nvme nvme2: Shutdown timeout set to 8 seconds
[    1.501225] nvme nvme1: 4/0/0 default/read/poll queues
[    1.501551] nvme nvme1: globally duplicate IDs for nsid 1
[    1.501567] nvme nvme1: VID:DID 8086:0a54 model:7335943:ICDPC5ED2ORA6.4T firmware:QDV1RD22
[    1.507257] nvme nvme2: 4/0/0 default/read/poll queues
[    1.514392]  nvme2n1: p1 p2 p3 p4
[    6.353622] Adding 2097148k swap on /dev/nvme2n1p2.  Priority:-2 extents:1 across:2097148k SSDscFS

sudo nvme list
Node                  SN                   Model                                    Namespace Usage                      Format           FW Rev  
--------------------- -------------------- ---------------------------------------- --------- -------------------------- ---------------- --------
/dev/nvme0n1          PHLE7314008Y6P4BGN-1 7335943:ICDPC5ED2ORA6.4T                 1           3.20  TB /   3.20  TB    512   B +  0 B   QDV1RD22
/dev/nvme2n1          S59BNJ0N105797V      Samsung SSD 970 EVO Plus 250GB           1         123.71  GB / 250.06  GB    512   B +  0 B   2B2QEXM7
Comment 7 Shyamin Ayesh 2023-02-01 05:10:28 UTC
Above mentioned patch didn't fix the problem. Can someone please help me ?
Comment 8 Mike K 2023-02-21 20:33:53 UTC
Seems I have found another afflicted make/model. My symptoms are identical to the others.

lspci -nnk | grep "Non-Volatile"
01:00.0 Non-Volatile memory controller [0108]: Realtek Semiconductor Co., Ltd. Device [10ec:5763] (rev 01)
22:00.0 Non-Volatile memory controller [0108]: Realtek Semiconductor Co., Ltd. Device [10ec:5763] (rev 01)

For the record, these are supposed to be TEAM TM8FP6002T, so I guess they just rebranded a Realtek part.

Note You need to log in before you can comment on or make changes to this bug.