Bug 37232

Summary: [3.0-rc1][NMI/DMAR][iwlagn] errors - PCI system error (SERR) for reason a1 on CPU 0
Product: Drivers Reporter: Maciej Rutecki (maciej.rutecki)
Component: PCIAssignee: drivers_pci (drivers_pci)
Status: CLOSED UNREPRODUCIBLE    
Severity: normal CC: florian, maciej.rutecki, rjw, shawn.starr
Priority: P1    
Hardware: All   
OS: Linux   
Kernel Version: 3.0-rc1 Subsystem:
Regression: Yes Bisected commit-id:
Bug Depends on:    
Bug Blocks: 36912    

Description Maciej Rutecki 2011-06-11 16:30:31 UTC
Subject    : [3.0-rc1][NMI/DMAR][iwlagn] errors - PCI system error (SERR) for reason a1 on CPU 0
Submitter  : Shawn Starr <shawn.starr@rogers.com>
Date       : 2011-06-03 4:56
Message-ID : 4478124.OuFNgaPiHS@segfault.sh0n.net
References : http://marc.info/?l=linux-kernel&m=130707737616116&w=2

This entry is being used for tracking a regression from 2.6.39. Please don't
close it until the problem is fixed in the mainline.
Comment 1 Rafael J. Wysocki 2011-06-19 13:45:27 UTC
On Sunday, June 19, 2011, Shawn Starr wrote:
> On Sunday, June 12, 2011 10:30:54 PM Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a summary report
> > of recent regressions.
> > 
> > The following bug entry is on the current list of known regressions
> > from 2.6.39.  Please verify if it still should be listed and let the
> > tracking team know (either way).
> > 
> This continues in -rc3 (3.0-0.rc3.git5.1.fc16.x86_64)
> 
> [   22.675931] NMI: PCI system error (SERR) for reason a1 on CPU 0.
> [  163.002909] DMAR:[DMA Write] Request device [03:00.0] fault addr ffe48000 
> [  163.002911] DMAR:[fault reason 05] PTE Write access is not set
> 
> 03:00.0 Network controller: Intel Corporation PRO/Wireless 5100 AGN [Shiloh] 
> Network Connection
> 
> I assume both of those are separate issues, but they continue in any case.
> 
> > 
> > Bug-Entry   : http://bugzilla.kernel.org/show_bug.cgi?id=37232
> > Subject             : [3.0-rc1][NMI/DMAR][iwlagn] errors - PCI system error
> (SERR) 
> for
> > reason a1 on CPU 0 Submitter        : Shawn Starr <shawn.starr@XXXXX>
> > Date                : 2011-06-03 4:56 (10 days old)
Comment 2 Rafael J. Wysocki 2011-06-29 19:30:15 UTC
On Tuesday, June 28, 2011, Shawn Starr wrote:
> On Sunday, June 26, 2011 11:23:44 PM Rafael J. Wysocki wrote:
> > This message has been generated automatically as a part of a summary report
> > of recent regressions.
> > 
> > The following bug entry is on the current list of known regressions
> > from 2.6.39.  Please verify if it still should be listed and let the
> > tracking team know (either way).
> > 
> This continues in -rc4, although Fedora has a patch to try to fix some DMAR 
> errors, it doesn't seem to work for me.
> 
> 03:00.0 Network controller: Intel Corporation PRO/Wireless 5100 AGN [Shiloh] 
> Network Connection
> 
> [ 1046.002195] DMAR:[DMA Write] Request device [03:00.0] fault addr ffda6000 
> [ 1046.002198] DMAR:[fault reason 05] PTE Write access is not set
> [ 1048.189224] DRHD: handling fault status reg 2
> [ 1048.189240] DMAR:[DMA Write] Request device [03:00.0] fault addr ffd94000 
> [ 1048.189244] DMAR:[fault reason 05] PTE Write access is not set
> 
> $ uname -a
> Linux segfault.sh0n.net 3.0-0.rc4.git0.2.fc16.x86_64 #1 SMP Wed Jun 22 
> 16:27:16 UTC 2011 x86_64 x86_64 x
> 
> Perhaps someone from Intel can comment on why this broke recently? It's not 
> causing me issues, just noise in logs/dmesg.
> 
> > 
> > Bug-Entry   : http://bugzilla.kernel.org/show_bug.cgi?id=37232
> > Subject             : [3.0-rc1][NMI/DMAR][iwlagn] errors - PCI system error
> (SERR) 
> for
> > reason a1 on CPU 0 Submitter        : Shawn Starr <shawn.starr@rogers.com>
> > Date                : 2011-06-03 4:56 (24 days old)
Comment 3 Florian Mickler 2011-07-02 19:57:30 UTC
On Wed, 29 Jun 2011 23:57:57 -0400
Shawn Starr <shawn.starr@rogers.com> wrote:
> 
> Hello Rafael/Linux devs,
> 
> You can close this bug now, regression is gone now. Both the SERR and the
> DMAR  (coming from iwlagn)
> errors are resolved.
> 
> Working in: -rc5 (Fedora rawhide 3.0-0.rc5.git0.1.fc16.x86_64)
> 
> Thanks,
> Shawn.



Closing on the assumption that this was a vanilla kernel...