Bug 101731 - [HSW/i915 PM] Oops 0002 in intel_set_pipe_timings after S4 resume
Summary: [HSW/i915 PM] Oops 0002 in intel_set_pipe_timings after S4 resume
Status: RESOLVED CODE_FIX
Alias: None
Product: Drivers
Classification: Unclassified
Component: Video(DRI - Intel) (show other bugs)
Hardware: x86-64 Linux
: P1 normal
Assignee: intel-gfx-bugs@lists.freedesktop.org
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2015-07-19 19:59 UTC by Jens
Modified: 2015-10-07 11:14 UTC (History)
1 user (show)

See Also:
Kernel Version: 4.2.0-994-generic #201507172200
Subsystem:
Regression: No
Bisected commit-id:


Attachments

Description Jens 2015-07-19 19:59:13 UTC
Hardware name: MSI MS-7817/CSM-B85M-E45 (MS-7817), BIOS V10.8 12/11/2014
Haswell chipset.

S4 resume works fine (multiple times) on 3.19.0+ (git 89271faca1..) under load while compiling a kernel on 4 cores.

Upgraded to 4.2.0 (see kernel version) to test compatibility of i915 with current kernel. Sleep mode worked fine, but suspend and subsequent resume results in Oops:

[  179.204936] PM: Syncing filesystems ... done.
[  179.207302] Freezing user space processes ... (elapsed 0.001 seconds) done.
[  179.208855] PM: Marking nosave pages: [mem 0x00000000-0x00000fff]
[  179.208856] PM: Marking nosave pages: [mem 0x00058000-0x00058fff]
[  179.208857] PM: Marking nosave pages: [mem 0x0009f000-0x000fffff]
[  179.208859] PM: Marking nosave pages: [mem 0xc5355000-0xc535bfff]
[  179.208859] PM: Marking nosave pages: [mem 0xc579f000-0xc5d2ffff]
[  179.208873] PM: Marking nosave pages: [mem 0xd8dac000-0xd8e42fff]
[  179.208875] PM: Marking nosave pages: [mem 0xd8e93000-0xd9ffefff]
[  179.208917] PM: Marking nosave pages: [mem 0xda000000-0xffffffff]
[  179.209289] PM: Basic memory bitmaps created
[  179.209328] PM: Preallocating image memory... done (allocated 443082 pages)
[  179.350181] PM: Allocated 1772328 kbytes in 0.14 seconds (12659.48 MB/s)
[  179.350183] Freezing remaining freezable tasks ... (elapsed 0.001 seconds) done.
[  179.351540] Suspending console(s) (use no_console_suspend to debug)
[  179.351945] parport_pc 00:07: disabled
[  179.352112] serial 00:06: disabled
[  179.352124] i8042 kbd 00:05: System wakeup enabled by ACPI
[  179.369747] PM: freeze of devices complete after 18.203 msecs
[  179.380092] PM: late freeze of devices complete after 10.333 msecs
[  179.380641] PM: noirq freeze of devices complete after 0.545 msecs
[  179.380844] ACPI: Preparing to enter system sleep state S4
[  179.381073] PM: Saving platform NVS memory
[  179.381521] Disabling non-boot CPUs ...
[  179.381650] Broke affinity for irq 26
[  179.382663] smpboot: CPU 1 is now offline
[  179.392352] Broke affinity for irq 16
[  179.392354] Broke affinity for irq 23
[  179.392356] Broke affinity for irq 26
[  179.393369] smpboot: CPU 2 is now offline
[  179.412370] Broke affinity for irq 16
[  179.412372] Broke affinity for irq 23
[  179.412374] Broke affinity for irq 26
[  179.413386] smpboot: CPU 3 is now offline
[  179.436300] PM: Creating hibernation image:
[  179.440017] PM: Need to copy 441830 pages
[  179.440017] PM: Normal pages needed: 441830 + 1024, available pages: 1619848
[  179.440017] PM: Restoring platform NVS memory
[  179.440017] Enabling non-boot CPUs ...
[  179.440017] x86: Booting SMP configuration:
[  179.440017] smpboot: Booting Node 0 Processor 1 APIC 0x2
[  179.448128]  cache: parent cpu1 should not be sleeping
[  179.448196] CPU1 is up
[  179.448216] smpboot: Booting Node 0 Processor 2 APIC 0x4
[  179.460147]  cache: parent cpu2 should not be sleeping
[  179.460214] CPU2 is up
[  179.460234] smpboot: Booting Node 0 Processor 3 APIC 0x6
[  179.472171]  cache: parent cpu3 should not be sleeping
[  179.472236] CPU3 is up
[  179.474695] ACPI: Waking up from system sleep state S4
[  179.476055] PM: noirq restore of devices complete after 0.896 msecs
[  179.553709] PM: early restore of devices complete after 77.644 msecs
[  179.594723] usb usb1: root hub lost power or was reset
[  179.594724] usb usb2: root hub lost power or was reset
[  179.594956] usb usb3: root hub lost power or was reset
[  179.595024] usb usb4: root hub lost power or was reset
[  179.596923] rtc_cmos 00:02: System wakeup disabled by ACPI
[  179.596944] i8042 kbd 00:05: System wakeup disabled by ACPI
[  179.597313] serial 00:06: activated
[  179.597721] parport_pc 00:07: activated
[  179.598839] ehci-pci 0000:00:1a.0: cache line size of 64 is not supported
[  179.598927] ehci-pci 0000:00:1d.0: cache line size of 64 is not supported
[  179.600013] sd 0:0:0:0: [sda] Starting disk
[  179.600029] sd 1:0:0:0: [sdb] Starting disk
[  179.600058] sd 3:0:0:0: [sdc] Starting disk
[  179.916054] usb 4-1: reset high-speed USB device number 2 using ehci-pci
[  179.916058] usb 3-1: reset high-speed USB device number 2 using ehci-pci
[  179.932073] ata3: SATA link up 1.5 Gbps (SStatus 113 SControl 300)
[  179.936055] ata1: SATA link up 3.0 Gbps (SStatus 123 SControl 300)
[  179.938773] ata3.00: configured for UDMA/100
[  179.939467] ata1.00: configured for UDMA/133
[  179.940051] ata2: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[  179.940068] ata4: SATA link up 6.0 Gbps (SStatus 133 SControl 300)
[  179.941487] ata4.00: configured for UDMA/133
[  179.941637] ata2.00: supports DRM functions and may not be fully accessible
[  179.941912] ata2.00: failed to get NCQ Send/Recv Log Emask 0x1
[  179.942476] ata2.00: supports DRM functions and may not be fully accessible
[  179.942685] ata2.00: failed to get NCQ Send/Recv Log Emask 0x1
[  179.942844] ata2.00: configured for UDMA/133
[  180.076075] usb 1-1: reset low-speed USB device number 2 using xhci_hcd
[  180.353345] usb 1-1: ep 0x81 - rounding interval to 64 microframes, ep desc says 80 microframes
[  180.355580] PM: restore of devices complete after 760.986 msecs
[  180.355696] PM: Image restored successfully.
[  180.355713] PM: Basic memory bitmaps freed
[  180.355715] Restarting tasks ... 
[  180.355857] pci_bus 0000:04: Allocating resources
[  180.355871] pci 0000:03:00.0: PCI bridge to [bus 04]
[  180.355876] pci 0000:03:00.0:   bridge window [io  0x3000-0x3fff]
[  180.355883] pci 0000:03:00.0:   bridge window [mem 0xdf600000-0xdf7fffff]
[  180.355888] pci 0000:03:00.0:   bridge window [mem 0xdf800000-0xdf9fffff 64bit pref]
[  180.356527] done.
[  180.356530] video LNXVIDEO:00: Restoring backlight state
[  180.518566] BUG: unable to handle kernel paging request at 0000000011870370
[  180.518587] IP: [<ffffffffc01c5a70>] intel_set_pipe_timings+0x130/0x330 [i915]
[  180.518621] PGD 0 
[  180.518626] Oops: 0002 [#1] SMP 
[  180.518635] Modules linked in: pci_stub vboxpci(OE) vboxnetadp(OE) vboxnetflt(OE) vboxdrv(OE) intel_rapl iosf_mbi x86_pkg_temp_thermal intel_powerclamp coretemp kvm_intel snd_hda_codec_realtek snd_hda_codec_generic snd_hda_codec_hdmi bnep kvm rfcomm snd_hda_intel bluetooth snd_hda_codec input_leds snd_hda_core serio_raw snd_hwdep snd_pcm snd_seq_midi lpc_ich snd_seq_midi_event snd_rawmidi snd_seq 8250_fintek snd_seq_device snd_timer snd tpm_infineon soundcore intel_smartconnect shpchp mei_me mei mac_hid parport_pc ppdev nls_iso8859_1 lp parport jitterentropy_rng drbg ansi_cprng dm_crypt hid_generic usbhid hid mxm_wmi crct10dif_pclmul crc32_pclmul ghash_clmulni_intel aesni_intel i915 aes_x86_64 lrw gf128mul glue_helper ablk_helper i2c_algo_bit cryptd drm_kms_helper drm ahci r8169 libahci mii wmi video
[  180.518835] CPU: 0 PID: 1349 Comm: Xorg Tainted: G           OE   4.2.0-994-generic #201507172200
[  180.518856] Hardware name: MSI MS-7817/CSM-B85M-E45 (MS-7817), BIOS V10.8 12/11/2014
[  180.518874] task: ffff8800d7c94b00 ti: ffff880212434000 task.ti: ffff880212434000
[  180.518891] RIP: 0010:[<ffffffffc01c5a70>]  [<ffffffffc01c5a70>] intel_set_pipe_timings+0x130/0x330 [i915]
[  180.518922] RSP: 0018:ffff880212437b30  EFLAGS: 00010212
[  180.518935] RAX: 0000000011870370 RBX: ffff8802117cbc0c RCX: ffff880214e25f6d
[  180.518952] RDX: ffffffffffff5c6c RSI: 0000000000000000 RDI: ffff8802117ce000
[  180.518968] RBP: ffff880212437ba8 R08: ffff8802117bd8e0 R09: ffff8802132c5f00
[  180.518985] R10: 0000000000000006 R11: 0000000000000005 R12: ffff8802109f0000
[  180.519001] R13: ffff880214e2c000 R14: ffff8802117ce000 R15: ffff880212437c08
[  180.519018] FS:  00007fee248c69c0(0000) GS:ffff88021ea00000(0000) knlGS:0000000000000000
[  180.519037] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
[  180.519050] CR2: 0000000011870370 CR3: 00000000d6ffa000 CR4: 00000000001407f0
[  180.519067] DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
[  180.519083] DR3: 0000000000000000 DR6: 00000000fffe0ff0 DR7: 0000000000000400
[  180.519099] Stack:
[  180.519104]  ffffffffc01dd1d0 ffff8802109f0000 0000000000000000 0000000000080041
[  180.519123]  0000000000000001 ffff880212437ba8 ffff880211871800 0000000000000000
[  180.519142]  fffffffffff7ffbe ffff8800d7c94b00 ffff8802117ce000 ffff8800d8111c00
[  180.519160] Call Trace:
[  180.519175]  [<ffffffffc01dd1d0>] ? haswell_crtc_enable+0x90/0xbd0 [i915]
[  180.519199]  [<ffffffffc01e26be>] intel_atomic_commit+0x57e/0x780 [i915]
[  180.519224]  [<ffffffffc007b4a7>] drm_atomic_commit+0x37/0x60 [drm]
[  180.519242]  [<ffffffffc00dec3e>] drm_atomic_helper_set_config+0x38e/0x400 [drm_kms_helper]
[  180.519266]  [<ffffffffc00694de>] ? _object_find+0x6e/0xb0 [drm]
[  180.519285]  [<ffffffffc006b568>] drm_mode_set_config_internal+0x68/0x100 [drm]
[  180.519307]  [<ffffffffc006f47d>] drm_mode_setcrtc+0xdd/0x4f0 [drm]
[  180.519325]  [<ffffffffc0060863>] drm_ioctl+0x363/0x680 [drm]
[  180.519342]  [<ffffffffc006f3a0>] ? drm_mode_setplane+0x1c0/0x1c0 [drm]
[  180.519359]  [<ffffffff811faccd>] do_vfs_ioctl+0x2cd/0x4b0
[  180.519373]  [<ffffffff811ea555>] ? __sb_end_write+0x35/0x70
[  180.519387]  [<ffffffff811e824c>] ? vfs_write+0x15c/0x190
[  180.519400]  [<ffffffff811faf29>] SyS_ioctl+0x79/0x90
[  180.519413]  [<ffffffff817a7cb2>] entry_SYSCALL_64_fastpath+0x16/0x75
[  180.519428] Code: 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 00 01 47 78 01 00 00 00 00 00 00 00 00 00 01 00 00 00 02 00 00 00 03 00 <00> 00 04 00 00 00 05 00 00 00 06 00 00 00 07 00 00 00 08 00 00 
[  180.519494] RIP  [<ffffffffc01c5a70>] intel_set_pipe_timings+0x130/0x330 [i915]
[  180.519519]  RSP <ffff880212437b30>
[  180.519528] CR2: 0000000011870370
[  180.525700] ---[ end trace 8424f3dd9925311a ]---

After the oops, a shutdown attempt results in multiple "Bad swap offset entry" and "Bad page map" errors in the kernel for many running processes.

Is this a known bug? Anything I can do to help fix it?

Thank you!
Comment 1 Jens 2015-07-25 20:22:14 UTC
Not directly reproducable with git checkout as of #201507220314.

However, twice in ten suspend/resume cycles the system froze after resuming and displaying the login screen (X, Ubuntu 14.04). Even netconsole did not show any output (other than the messages that it always shows during boot) so I cannot say whether it was the same problem.
Comment 2 Jani Nikula 2015-10-07 11:14:58 UTC
(In reply to Jens from comment #1)
> Not directly reproducable with git checkout as of #201507220314.
> 
> However, twice in ten suspend/resume cycles the system froze after resuming
> and displaying the login screen (X, Ubuntu 14.04). Even netconsole did not
> show any output (other than the messages that it always shows during boot)
> so I cannot say whether it was the same problem.

Thanks for the follow-up, closing.

Please file any new bug reports at the freedesktop.org bugzilla [1]. Thank you.

[1] https://bugs.freedesktop.org/enter_bug.cgi?product=DRI&component=DRM/Intel

Note You need to log in before you can comment on or make changes to this bug.