Bug 92421

Summary: ASRock J1800M recurring kernel hang futex.c:1094 futex_wait+0xf3/0x260()
Product: Memory Management Reporter: Zouhair (zouhair.s)
Component: OtherAssignee: Deepak S (deepak.s)
Status: NEW ---    
Severity: high CC: agk, deepak.s
Priority: P1    
Hardware: x86-64   
OS: Linux   
Kernel Version: 3.16.7-ckt2-1 Subsystem:
Regression: No Bisected commit-id:
Attachments: syslog freezes
dmesg
debian kernel def_config

Description Zouhair 2015-01-31 12:58:31 UTC
Created attachment 165371 [details]
syslog freezes

Hi,
using usb disk plugged in and 500Gb encrypted disk partition with ext4 file systems. After 2 or more hours... the qbitorrent running and small amounts of transfer, the mouse freezes. thereafter i unplug the power cable and I find "attached log" in /var/log/syslog.

Motherboard: ASRock J1800
http://www.asrock.com/mb/Intel/D1800B-ITX/
with memory Kingston HyperX Savage 4Go 1600MHz DDR3 Non-ECC CL9 DIMM

OS: Debian (jessie) 8.0
3.16.0-4-amd64 #1 SMP Debian 3.16.7-ckt2-1 (2014-12-08) x86_64 GNU/Linux

-- cpuinfo --
processor	: 0
vendor_id	: GenuineIntel
cpu family	: 6
model		: 55
model name	: Intel(R) Celeron(R) CPU  J1800  @ 2.41GHz
stepping	: 3
microcode	: 0x321
cpu MHz		: 1332.881
cache size	: 1024 KB
...
processor	: 1
vendor_id	: GenuineIntel
cpu family	: 6
model		: 55
model name	: Intel(R) Celeron(R) CPU  J1800  @ 2.41GHz
stepping	: 3
microcode	: 0x321
cpu MHz		: 1332.881
cache size	: 1024 KB
...

-- pci info --
00:00.0 Host bridge: Intel Corporation Atom Processor Z36xxx/Z37xxx Series SoC Transaction Register (rev 0c)
00:02.0 VGA compatible controller: Intel Corporation Atom Processor Z36xxx/Z37xxx Series Graphics & Display (rev 0c)
00:13.0 SATA controller: Intel Corporation Device 0f23 (rev 0c)
00:14.0 USB controller: Intel Corporation Atom Processor Z36xxx/Z37xxx Series USB xHCI (rev 0c)
00:1a.0 Encryption controller: Intel Corporation Atom Processor Z36xxx/Z37xxx Series Trusted Execution Engine (rev 0c)
00:1b.0 Audio device: Intel Corporation Atom Processor Z36xxx/Z37xxx Series High Definition Audio Controller (rev 0c)
00:1c.0 PCI bridge: Intel Corporation Device 0f48 (rev 0c)
00:1c.1 PCI bridge: Intel Corporation Device 0f4a (rev 0c)
00:1c.2 PCI bridge: Intel Corporation Device 0f4c (rev 0c)
00:1c.3 PCI bridge: Intel Corporation Device 0f4e (rev 0c)
00:1f.0 ISA bridge: Intel Corporation Atom Processor Z36xxx/Z37xxx Series Power Control Unit (rev 0c)
00:1f.3 SMBus: Intel Corporation Device 0f12 (rev 0c)
04:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL8111/8168/8411 PCI Express Gigabit Ethernet Controller (rev 11)

-- usb info --
Bus 002 Device 001: ID 1d6b:0003 Linux Foundation 3.0 root hub
Bus 001 Device 006: ID 046d:c01e Logitech, Inc. MX518 Optical Mouse
Bus 001 Device 005: ID 045e:00bd Microsoft Corp. Fingerprint Reader
Bus 001 Device 003: ID 05e3:0606 Genesys Logic, Inc. USB 2.0 Hub / D-Link DUB-H4 USB 2.0 Hub
Bus 001 Device 004: ID 0a12:0001 Cambridge Silicon Radio, Ltd Bluetooth Dongle (HCI mode)
Bus 001 Device 007: ID 1058:1042 Western Digital Technologies, Inc. 
Bus 001 Device 002: ID 05e3:0608 Genesys Logic, Inc. USB-2.0 4-Port HUB
Bus 001 Device 001: ID 1d6b:0002 Linux Foundation 2.0 root hub
Comment 1 Zouhair 2015-01-31 12:59:40 UTC
Created attachment 165381 [details]
dmesg
Comment 2 Zouhair 2015-01-31 13:00:38 UTC
Created attachment 165391 [details]
debian kernel def_config
Comment 3 Zouhair 2015-01-31 13:09:09 UTC
syslog freezes: http://pastebin.com/raw.php?i=X7CDhY71
Comment 4 Zouhair 2015-02-09 08:14:10 UTC
I don't think is usb storage or md... related, because the bug still present when the destination of saving file is sata disque drive ($HOME/) with ext4 file system.
Comment 5 Zouhair 2015-02-09 08:18:24 UTC
last message today befaure system hang:
Feb  9 01:59:53 zz kernel: [47219.766885] [drm:vlv_set_rps_idle] *ERROR* timed out waiting for Punit
Comment 6 Deepak S 2015-02-09 08:50:10 UTC
Hi Zouhair,

diff --git a/drivers/gpu/drm/i915/intel_pm.c b/drivers/gpu/drm/i915/intel_pm.c
index 6ece663..94b7a8c 100644
--- a/drivers/gpu/drm/i915/intel_pm.c
+++ b/drivers/gpu/drm/i915/intel_pm.c
@@ -3800,7 +3800,7 @@ static void vlv_set_rps_idle(struct drm_i915_private *dev_priv)
        struct drm_device *dev = dev_priv->dev;

        /* CHV and latest VLV don't need to force the gfx clock */
-       if (IS_CHERRYVIEW(dev) || dev->pdev->revision >= 0xd) {
+       if (IS_CHERRYVIEW(dev) || IS_VALLEYVIEW(dev)) {
                valleyview_set_rps(dev_priv->dev, dev_priv->rps.min_freq_softlimit);
                return;
        }

Thanks
Deepak
Comment 7 Zouhair 2015-02-09 09:20:45 UTC
is fixed on 3.18.6 ? or need patch too ?
Comment 8 Deepak S 2015-02-09 09:35:28 UTC
your not seeing issue with 3.18.6?
Comment 9 Zouhair 2015-02-09 09:39:30 UTC
I did not yet build version 3.18.6.
Comment 10 Zouhair 2015-02-09 16:58:13 UTC
using 3.18.6, last message from syslog:
Feb  9 17:52:40 zz kernel: [13062.461766] [drm:vlv_set_rps_idle] *ERROR* timed out waiting for Punit

the system not hang yet.
Comment 11 Zouhair 2015-02-09 17:32:26 UTC
Intel D1800 Linux 3.18.6 dmesg:
http://pastebin.com/raw.php?i=rnsndwtg
Comment 12 Zouhair 2015-02-10 07:41:21 UTC
kernel:[20773.649540] NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s! [TorrentSpeedMon:4473]
Feb  9 20:01:06 zz kernel: [20773.649540] NMI watchdog: BUG: soft lockup - CPU#0 stuck for 22s! [TorrentSpeedMon:4473]
Feb  9 20:01:06 zz kernel: [20773.649547] Modules linked in: rfcomm nfsv3 xt_iprange xt_multiport cfg80211 xt_REDIRECT iptable_nat nf_nat_ipv4 nf_nat xt_tcpudp xt_owner nf_conntrack_ipv4 nf_defrag_ipv4 xt_conntrack bnep binfmt_misc nf_conntrack iptable_filter ip_tables nfnetlink_log x_tables nfnetlink nfsd auth_rpcgss oid_registry nfs_acl nfs lockd grace fscache sunrpc hid_generic usbhid nls_utf8 nls_cp437 btusb vfat fat bluetooth intel_powerclamp intel_rapl coretemp kvm_intel rfkill snd_hda_codec_hdmi kvm snd_hda_codec_realtek snd_hda_codec_generic crct10dif_pclmul crc32_pclmul evdev iTCO_wdt ghash_clmulni_intel iTCO_vendor_support cryptd serio_raw i2c_i801 pcspkr snd_hda_intel snd_hda_controller efivars i915 snd_hda_codec battery video snd_hwdep drm_kms_helper i2c_designware_platform shpchp button drm i2c_algo_bit i2c_designware_core lpc_ich snd_pcm mfd_core snd_timer snd soundcore xhci_pci xhci_hcd usbcore usb_common iosf_mbi processor fuse parport_pc ppdev lp parport autofs4 ext4 crc16 mbcache jbd2 sg sr_mod cdrom sd_mod thermal crc32c_intel r8169 fan mii thermal_sys i2c_hid hid i2c_core ahci libahci sdhci_acpi sdhci mmc_core libata scsi_mod
Feb  9 20:01:06 zz kernel: [20773.649648] CPU: 0 PID: 4473 Comm: TorrentSpeedMon Tainted: G        W      3.18.6 #2
Feb  9 20:01:06 zz kernel: [20773.649651] Hardware name: To Be Filled By O.E.M. To Be Filled By O.E.M./D1800M, BIOS P1.30 05/21/2014
Feb  9 20:01:06 zz kernel: [20773.649654] task: ffff880167192390 ti: ffff8801695b0000 task.ti: ffff8801695b0000
Feb  9 20:01:06 zz kernel: [20773.649656] RIP: 0010:[<ffffffff815745ea>]  [<ffffffff815745ea>] _raw_spin_lock+0x3a/0x50
Feb  9 20:01:06 zz kernel: [20773.649665] RSP: 0018:ffff8801695b3c58  EFLAGS: 00000212
Feb  9 20:01:06 zz kernel: [20773.649667] RAX: 000000000000248a RBX: 0000000000000024 RCX: 000000000000d687
Feb  9 20:01:06 zz kernel: [20773.649669] RDX: 000000000000d706 RSI: 00000000a5babb2d RDI: 0000000000000000
Feb  9 20:01:06 zz kernel: [20773.649671] RBP: ffff8801695b3cf0 R08: ffff8801695b3cf0 R09: 000000000000031d
Feb  9 20:01:06 zz kernel: [20773.649674] R10: 0000000000000000 R11: 0000000000000000 R12: ffffffff810c04d4
Feb  9 20:01:06 zz kernel: [20773.649676] R13: 00000000ffffffff R14: ffffffff810bfd4e R15: 0000000000000000
Feb  9 20:01:06 zz kernel: [20773.649679] FS:  00007f384affd700(0000) GS:ffff88017fc00000(0000) knlGS:0000000000000000
Feb  9 20:01:06 zz kernel: [20773.649681] CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
Feb  9 20:01:06 zz kernel: [20773.649683] CR2: 00007fc09decd000 CR3: 0000000162c4a000 CR4: 00000000001007f0
Feb  9 20:01:06 zz kernel: [20773.649685] Stack:
Feb  9 20:01:06 zz kernel: [20773.649687]  0000000002c50b5c ffffffff810e5c7e 0006d75d00000000 ffff8801695b3d40
Feb  9 20:01:06 zz kernel: [20773.649692]  00000000000109b0 000000000000031c 0000000000000000 ffff8801695b3cf8
Feb  9 20:01:06 zz kernel: [20773.649696]  0000000000000000 0000000000000000 0000000000000000 ffffc90000675144
Feb  9 20:01:06 zz kernel: [20773.649700] Call Trace:
Feb  9 20:01:06 zz kernel: [20773.649707]  [<ffffffff810e5c7e>] ? futex_wait_setup+0x9e/0x120
Feb  9 20:01:06 zz kernel: [20773.649711]  [<ffffffff810e5df2>] ? futex_wait+0xf2/0x270
Feb  9 20:01:06 zz kernel: [20773.649715]  [<ffffffff810e4e91>] ? futex_wake+0x71/0x150
Feb  9 20:01:06 zz kernel: [20773.649719]  [<ffffffff810e6e41>] ? do_futex+0xe1/0xab0
Feb  9 20:01:06 zz kernel: [20773.649724]  [<ffffffff8109acd7>] ? try_to_wake_up+0x1d7/0x360
Feb  9 20:01:06 zz kernel: [20773.649729]  [<ffffffff810acfae>] ? __wake_up_common+0x5e/0xa0
Feb  9 20:01:06 zz kernel: [20773.649733]  [<ffffffff8157400f>] ? _raw_spin_unlock_irq+0x1f/0x50
Feb  9 20:01:06 zz kernel: [20773.649738]  [<ffffffff81215e59>] ? SyS_epoll_ctl+0xd9/0xc10
Feb  9 20:01:06 zz kernel: [20773.649741]  [<ffffffff810e7886>] ? SyS_futex+0x76/0x170
Feb  9 20:01:06 zz kernel: [20773.649746]  [<ffffffff81574bed>] ? system_call_fastpath+0x16/0x1b
Feb  9 20:01:06 zz kernel: [20773.649748] Code: bd 4a b2 ff b8 00 00 01 00 f0 0f c1 03 89 c2 c1 ea 10 66 39 c2 89 d1 75 08 5b c3 66 0f 1f 44 00 00 b8 00 80 00 00 eb 08 90 f3 90 <83> e8 01 74 f1 0f b7 13 66 39 ca 75 f1 5b c3 0f 1f 80 00 00 00
Comment 13 Zouhair 2015-02-10 09:00:35 UTC
(In reply to Deepak S from comment #6)
> Hi Zouhair,
> 
> diff --git a/drivers/gpu/drm/i915/intel_pm.c
> b/drivers/gpu/drm/i915/intel_pm.c
> index 6ece663..94b7a8c 100644
> --- a/drivers/gpu/drm/i915/intel_pm.c
> +++ b/drivers/gpu/drm/i915/intel_pm.c
> @@ -3800,7 +3800,7 @@ static void vlv_set_rps_idle(struct drm_i915_private
> *dev_priv)
>         struct drm_device *dev = dev_priv->dev;
> 
>         /* CHV and latest VLV don't need to force the gfx clock */
> -       if (IS_CHERRYVIEW(dev) || dev->pdev->revision >= 0xd) {
> +       if (IS_CHERRYVIEW(dev) || IS_VALLEYVIEW(dev)) {
>                 valleyview_set_rps(dev_priv->dev,
> dev_priv->rps.min_freq_softlimit);
>                 return;
>         }
> 
> Thanks
> Deepak

The patch not compatible with Debian linux-3.16.7-ckt2 intel_pm.c :\
Comment 14 Zouhair 2015-02-10 09:01:25 UTC
(In reply to Zouhair from comment #13)
> (In reply to Deepak S from comment #6)
> > Hi Zouhair,
> > 
> > diff --git a/drivers/gpu/drm/i915/intel_pm.c
> > b/drivers/gpu/drm/i915/intel_pm.c
> > index 6ece663..94b7a8c 100644
> > --- a/drivers/gpu/drm/i915/intel_pm.c
> > +++ b/drivers/gpu/drm/i915/intel_pm.c
> > @@ -3800,7 +3800,7 @@ static void vlv_set_rps_idle(struct drm_i915_private
> > *dev_priv)
> >         struct drm_device *dev = dev_priv->dev;
> > 
> >         /* CHV and latest VLV don't need to force the gfx clock */
> > -       if (IS_CHERRYVIEW(dev) || dev->pdev->revision >= 0xd) {
> > +       if (IS_CHERRYVIEW(dev) || IS_VALLEYVIEW(dev)) {
> >                 valleyview_set_rps(dev_priv->dev,
> > dev_priv->rps.min_freq_softlimit);
> >                 return;
> >         }
> > 
> > Thanks
> > Deepak
> 
> The patch not compatible with Debian linux-3.16.7-ckt2 intel_pm.c :\

Debian linux-3.16.7-ckt2 intel_pm.c: http://pastebin.com/FKG7B3UM