Created attachment 301415 [details] device information dmesg show an infinite stream of errors: 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: can't find device of ID00e5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: PCIe Bus Error: severity=Corrected, type=Physical Layer, (Receiver ID) 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: device [8086:9d15] error status/mask=00000001/00002000 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: [ 0] RxErr 7月 14 13:14:31 uos kernel: pcieport 0000:00:1c.5: AER: Corrected error received: 0000:00:1c.5
Created attachment 301416 [details] System startup logs
Created attachment 301417 [details] hwinfo
Created attachment 301418 [details] dmidecode
Created attachment 301419 [details] lshw
Created attachment 301420 [details] lspci
Created attachment 301421 [details] lspci_have_pcie_aspm=off
Created attachment 301422 [details] cmdline
Created attachment 301423 [details] uname
Created attachment 301424 [details] os
I have a repaired patch: https://patchwork.kernel.org/project/linux-pci/patch/20220713112612.6935-1-limanyi@uniontech.com/ diff --git a/drivers/pci/pcie/aspm.c b/drivers/pci/pcie/aspm.c index a96b7424c9bc..b173d3c75ae7 100644 --- a/drivers/pci/pcie/aspm.c +++ b/drivers/pci/pcie/aspm.c @@ -1359,6 +1359,7 @@ void pcie_no_aspm(void) if (!aspm_force) { aspm_policy = POLICY_DEFAULT; aspm_disabled = 1; + aspm_support_enabled = false; } } Maybe there is a better fix for this issue.
Related commit: 8b8bae901ce23 3c076351c4027 387d37577fdd0 https://lkml.org/lkml/2011/11/10/467
Passing pcie_aspm=off to the kernel command line solves this problem.
the kernel command line does not have "pcie_aspm=off", lspci -nnvvk is 00:1c.5 PCI bridge [0604]: Intel Corporation Sunrise Point-LP PCI Express Root Port #6 [8086:9d15] (rev f1) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin B routed to IRQ 124 Bus: primary=00, secondary=03, subordinate=03, sec-latency=0 I/O behind bridge: 0000c000-0000cfff Memory behind bridge: df100000-df1fffff Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort+ <SERR- <PERR- BridgeCtl: Parity- SERR+ NoISA- VGA- MAbort- >Reset- FastB2B- PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn- Capabilities: [40] Express (v2) Root Port (Slot+), MSI 00 DevCap: MaxPayload 256 bytes, PhantFunc 0 ExtTag- RBE+ DevCtl: Report errors: Correctable+ Non-Fatal+ Fatal+ Unsupported+ /* have aer function */ RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop- MaxPayload 128 bytes, MaxReadReq 128 bytes DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq- AuxPwr+ TransPend- LnkCap: Port #6, Speed 8GT/s, Width x1, ASPM L1, Exit Latency L0s <1us, L1 <16us ClockPM- Surprise- LLActRep+ BwNot+ ASPMOptComp+ LnkCtl: ASPM L1 Enabled; RCB 64 bytes Disabled- CommClk+ ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt- LnkSta: Speed 2.5GT/s, Width x1, TrErr- Train+ SlotClk+ DLActive+ BWMgmt+ ABWMgmt- SltCap: AttnBtn- PwrCtrl- MRL- AttnInd- PwrInd- HotPlug- Surprise- Slot #9, PowerLimit 10.000W; Interlock- NoCompl+ SltCtl: Enable: AttnBtn- PwrFlt- MRL- PresDet- CmdCplt- HPIrq- LinkChg- Control: AttnInd Unknown, PwrInd Unknown, Power- Interlock- SltSta: Status: AttnBtn- PowerFlt- MRL- CmdCplt- PresDet+ Interlock- Changed: MRL- PresDet- LinkState- RootCtl: ErrCorrectable- ErrNon-Fatal- ErrFatal- PMEIntEna+ CRSVisible- RootCap: CRSVisible- RootSta: PME ReqID 0000, PMEStatus- PMEPending- DevCap2: Completion Timeout: Range ABC, TimeoutDis+, LTR+, OBFF Not Supported ARIFwd+ DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis+, LTR+, OBFF Disabled ARIFwd- LnkCtl2: Target Link Speed: 2.5GT/s, EnterCompliance- SpeedDis- Transmit Margin: Normal Operating Range, EnterModifiedCompliance- ComplianceSOS- Compliance De-emphasis: -6dB LnkSta2: Current De-emphasis Level: -3.5dB, EqualizationComplete-, EqualizationPhase1- EqualizationPhase2-, EqualizationPhase3-, LinkEqualizationRequest- Capabilities: [80] MSI: Enable+ Count=1/1 Maskable- 64bit- Address: fee00278 Data: 0000 Capabilities: [90] Subsystem: ASUSTeK Computer Inc. Sunrise Point-LP PCI Express Root Port [1043:1c3d] Capabilities: [a0] Power Management version 3 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+) Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME- Capabilities: [100 v1] Advanced Error Reporting UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol- UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol- UESvrt: DLP+ SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol- CESta: RxErr+ BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr- CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr+ AERCap: First Error Pointer: 00, GenCap- CGenEn- ChkCap- ChkEn- Capabilities: [140 v1] Access Control Services ACSCap: SrcValid+ TransBlk+ ReqRedir+ CmpltRedir+ UpstreamFwd- EgressCtrl- DirectTrans- ACSCtl: SrcValid- TransBlk- ReqRedir- CmpltRedir- UpstreamFwd- EgressCtrl- DirectTrans- Capabilities: [220 v1] #19 Kernel driver in use: pcieport the kernel command line has "pcie_aspm=off", lspci -nnvvk is 00:1c.5 PCI bridge [0604]: Intel Corporation Sunrise Point-LP PCI Express Root Port #6 [8086:9d15] (rev f1) (prog-if 00 [Normal decode]) Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- Stepping- SERR- FastB2B- DisINTx+ Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort- >SERR- <PERR- INTx- Latency: 0 Interrupt: pin B routed to IRQ 124 Bus: primary=00, secondary=03, subordinate=03, sec-latency=0 I/O behind bridge: 0000c000-0000cfff Memory behind bridge: df100000-df1fffff Secondary status: 66MHz- FastB2B- ParErr- DEVSEL=fast >TAbort- <TAbort- <MAbort+ <SERR- <PERR- BridgeCtl: Parity- SERR+ NoISA- VGA- MAbort- >Reset- FastB2B- PriDiscTmr- SecDiscTmr- DiscTmrStat- DiscTmrSERREn- Capabilities: [40] Express (v2) Root Port (Slot+), MSI 00 DevCap: MaxPayload 256 bytes, PhantFunc 0 ExtTag- RBE+ DevCtl: Report errors: Correctable- Non-Fatal- Fatal- Unsupported- /* no aer function */ RlxdOrd- ExtTag- PhantFunc- AuxPwr- NoSnoop- MaxPayload 128 bytes, MaxReadReq 128 bytes DevSta: CorrErr+ UncorrErr- FatalErr- UnsuppReq- AuxPwr+ TransPend- LnkCap: Port #6, Speed 8GT/s, Width x1, ASPM L1, Exit Latency L0s <1us, L1 <16us ClockPM- Surprise- LLActRep+ BwNot+ ASPMOptComp+ LnkCtl: ASPM L1 Enabled; RCB 64 bytes Disabled- CommClk+ ExtSynch- ClockPM- AutWidDis- BWInt- AutBWInt- LnkSta: Speed 2.5GT/s, Width x1, TrErr- Train- SlotClk+ DLActive+ BWMgmt+ ABWMgmt- SltCap: AttnBtn- PwrCtrl- MRL- AttnInd- PwrInd- HotPlug- Surprise- Slot #9, PowerLimit 10.000W; Interlock- NoCompl+ SltCtl: Enable: AttnBtn- PwrFlt- MRL- PresDet- CmdCplt- HPIrq- LinkChg- Control: AttnInd Unknown, PwrInd Unknown, Power- Interlock- SltSta: Status: AttnBtn- PowerFlt- MRL- CmdCplt- PresDet+ Interlock- Changed: MRL- PresDet- LinkState- RootCtl: ErrCorrectable- ErrNon-Fatal- ErrFatal- PMEIntEna- CRSVisible- RootCap: CRSVisible- RootSta: PME ReqID 0000, PMEStatus- PMEPending- DevCap2: Completion Timeout: Range ABC, TimeoutDis+, LTR+, OBFF Not Supported ARIFwd+ DevCtl2: Completion Timeout: 50us to 50ms, TimeoutDis+, LTR+, OBFF Disabled ARIFwd- LnkCtl2: Target Link Speed: 2.5GT/s, EnterCompliance- SpeedDis- Transmit Margin: Normal Operating Range, EnterModifiedCompliance- ComplianceSOS- Compliance De-emphasis: -6dB LnkSta2: Current De-emphasis Level: -3.5dB, EqualizationComplete-, EqualizationPhase1- EqualizationPhase2-, EqualizationPhase3-, LinkEqualizationRequest- Capabilities: [80] MSI: Enable+ Count=1/1 Maskable- 64bit- Address: fee00278 Data: 0000 Capabilities: [90] Subsystem: ASUSTeK Computer Inc. Sunrise Point-LP PCI Express Root Port [1043:1c3d] Capabilities: [a0] Power Management version 3 Flags: PMEClk- DSI- D1- D2- AuxCurrent=0mA PME(D0+,D1-,D2-,D3hot+,D3cold+) Status: D0 NoSoftRst- PME-Enable- DSel=0 DScale=0 PME- Capabilities: [100 v1] Advanced Error Reporting UESta: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol- UEMsk: DLP- SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF- MalfTLP- ECRC- UnsupReq- ACSViol- UESvrt: DLP+ SDES- TLP- FCP- CmpltTO- CmpltAbrt- UnxCmplt- RxOF+ MalfTLP+ ECRC- UnsupReq- ACSViol- CESta: RxErr+ BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr- CEMsk: RxErr- BadTLP- BadDLLP- Rollover- Timeout- NonFatalErr+ AERCap: First Error Pointer: 00, GenCap- CGenEn- ChkCap- ChkEn- Capabilities: [140 v1] Access Control Services ACSCap: SrcValid+ TransBlk+ ReqRedir+ CmpltRedir+ UpstreamFwd- EgressCtrl- DirectTrans- ACSCtl: SrcValid- TransBlk- ReqRedir- CmpltRedir- UpstreamFwd- EgressCtrl- DirectTrans- Capabilities: [220 v1] #19 Kernel driver in use: pcieport
acpi_pci_root_add() /* drivers/acpi/pci_root.c */ \--negotiate_os_control(root, &no_aspm); \--calculate_support(); \--if (pcie_aspm_support_enabled()) /* if is true, this issue occurs */ \--support |= OSC_PCI_ASPM_SUPPORT | OSC_PCI_CLOCK_PM_SUPPORT;