Bug 61521 - WARNING: CPU:0 PID 1 at /home/raj/source/linux-3.11.1/arch/x86/kernel/smp.c:124 native_smp_send_reschedule+0x4b/0x50()
Summary: WARNING: CPU:0 PID 1 at /home/raj/source/linux-3.11.1/arch/x86/kernel/smp.c:1...
Status: NEW
Alias: None
Product: Timers
Classification: Unclassified
Component: Interval Timers (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: timers_interval-timers
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2013-09-16 20:38 UTC by Raj Upadhyaya
Modified: 2015-08-15 10:37 UTC (History)
4 users (show)

See Also:
Kernel Version: 3.11.1
Subsystem:
Regression: No
Bisected commit-id:


Attachments
Picture of stacktrace. (479.21 KB, image/jpeg)
2013-09-16 20:38 UTC, Raj Upadhyaya
Details
A picture of a newer stacktrace from 3.12.0-rc1 (511.04 KB, image/jpeg)
2013-09-22 14:30 UTC, Raj Upadhyaya
Details
Picture of a stacktrace, Kernel 3.13.4 (1.16 MB, image/jpeg)
2014-02-24 20:52 UTC, antu456
Details
Kernel Stack Trace (3.12 MB, image/jpeg)
2014-04-06 23:18 UTC, Sumit Khanna
Details

Description Raj Upadhyaya 2013-09-16 20:38:27 UTC
Created attachment 108631 [details]
Picture of stacktrace.

For me to reproduce this error, I reboot, and pick the kernel I just built.

It doesn't seem to happen in linux-3.11, but I have changed a ton of things.
I took a picture of the stacktrace as it didn't seem saved anywhere.

I copied my config file to pastebin


http://pastebin.com/81Bnp4NP



Any ideas?

Regards,
Raj A. Upadhyaya
Comment 1 Raj Upadhyaya 2013-09-22 14:30:46 UTC
Created attachment 109241 [details]
A picture of a newer stacktrace from 3.12.0-rc1
Comment 2 Raj Upadhyaya 2013-09-22 14:38:50 UTC
I have also experienced this in linux 3.12.0-rc1 which I just downloaded and built.

Here is my config file.
http://pastebin.com/fwEHes8f


Please let me know if I am opening up the ticket correctly.

email:   raj@upadhyaya.com


Regards,

Raj A. Upadhyaya
Comment 3 antu456 2014-02-24 20:50:42 UTC
I have experienced this error too, I'm using Linux 3.13.4. I've attached a picture of the stacktrace. 

The error does not occur anymore if i disable these options:

--- IOMMU Hardware Support
[ ]  Support for Intel IOMMU using DMA Remapping Devices
[ ]  Enable Intel DMA Remapping Devices by default
[ ]  Support for Interrupt Remapping

Hardware: 
Gigabyte Z87X-D3H 
Intel Xeon E3-1240v3
16 GB RAM
GeForce GTX 560 Ti
GeForce GT 610
Comment 4 antu456 2014-02-24 20:52:20 UTC
Created attachment 127331 [details]
Picture of a stacktrace, Kernel 3.13.4
Comment 5 Sumit Khanna 2014-04-06 23:17:43 UTC
I have the same problem on an IBM IntelliStation Z Pro Type 9228
Comment 6 Sumit Khanna 2014-04-06 23:18:15 UTC
Created attachment 131591 [details]
Kernel Stack Trace
Comment 7 xerofoify 2014-06-24 22:26:29 UTC
I can trace your bug to these lines.
	if (unlikely(cpu_is_offline(cpu))) {
		WARN_ON(1);
		return;
	}
Seems that due to your CPU being offline at this point , it goes
into the WARN_ON which warns but doesn't reschedule. There are 
two issues here.
1. Your CPU core is offline and therefore needs to be online before this
function
2.WARN_ON is there but we need to reschedule to another core or we panic
as it seems in your trace attachment. 
Cheers Nick
Comment 8 Tiago Gehring 2015-01-29 17:10:14 UTC
I'm experiencing exactly the same problem with kernels 3.18.4 and 3.19-rc6. Everything worked fine with kernels 3.14 through 3.17. 

I tried to change some kernel options (including disabling all "IOMMU Hardware Support" options as suggested above) to no avail.

Hardware:

Dell Latitude E7440
Intel i7-4600U / 8 GB RAM

lspci output:

00:00.0 Host bridge: Intel Corporation Haswell-ULT DRAM Controller (rev 0b)
00:02.0 VGA compatible controller: Intel Corporation Haswell-ULT Integrated Graphics Controller (rev 0b)
00:03.0 Audio device: Intel Corporation Haswell-ULT HD Audio Controller (rev 0b)
00:14.0 USB controller: Intel Corporation 8 Series USB xHCI HC (rev 04)
00:16.0 Communication controller: Intel Corporation 8 Series HECI #0 (rev 04)
00:16.3 Serial controller: Intel Corporation 8 Series HECI KT (rev 04)
00:19.0 Ethernet controller: Intel Corporation Ethernet Connection I218-LM (rev 04)
00:1b.0 Audio device: Intel Corporation 8 Series HD Audio Controller (rev 04)
00:1c.0 PCI bridge: Intel Corporation 8 Series PCI Express Root Port 1 (rev e4)
00:1c.3 PCI bridge: Intel Corporation 8 Series PCI Express Root Port 4 (rev e4)
00:1c.4 PCI bridge: Intel Corporation 8 Series PCI Express Root Port 5 (rev e4)
00:1d.0 USB controller: Intel Corporation 8 Series USB EHCI #1 (rev 04)
00:1f.0 ISA bridge: Intel Corporation 8 Series LPC Controller (rev 04)
00:1f.2 RAID bus controller: Intel Corporation 82801 Mobile SATA Controller [RAID mode] (rev 04)
00:1f.3 SMBus: Intel Corporation 8 Series SMBus Controller (rev 04)
02:00.0 Network controller: Intel Corporation Wireless 7260 (rev 73)
03:00.0 SD Host controller: O2 Micro, Inc. SD/MMC Card Reader Controller (rev 01)
Comment 10 Tiago Gehring 2015-08-15 10:37:48 UTC
Coming back to this after some some time now, I can report that I still get the same crash with mainline kernel 4.1 but only when my extern USB 3.0 HD is attached. I didn't see it happening anymore without it beign attached but now it consistently crashes whenever I forgot to detach my hard drive during boot.

Note You need to log in before you can comment on or make changes to this bug.