Bug 46031 - kswapd0 moving to uninterruptible sleep (STAT D)
Summary: kswapd0 moving to uninterruptible sleep (STAT D)
Status: RESOLVED INVALID
Alias: None
Product: IO/Storage
Classification: Unclassified
Component: SCSI (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: linux-scsi@vger.kernel.org
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2012-08-16 12:40 UTC by Markus Hetzmannseder
Modified: 2012-09-19 08:01 UTC (History)
0 users

See Also:
Kernel Version: 3.5.2
Subsystem:
Regression: No
Bisected commit-id:


Attachments

Description Markus Hetzmannseder 2012-08-16 12:40:12 UTC
Hi,

I have a hangup problem with my litle server. The Hardware is a Dell Poweredge SC1430 with mirrored harddrives conntected on the PERC 5/i Adapter, it uses the megaraid/megasas scsi driver.

The problem occurs specially at heavy diskIO like update of the file name database.

The system is running in x86_PAE mode with 8GB RAM installed. So far I have tried out kernel 3.1.4 3.6.0-rc1 and now running 3.5.2 version.

According to kernel.log its allways the kswapd0 process which starts to hang in STAT D mode. After that more and more processes are hitting STAT D and the system is getting practically unusable. In that state a login over the network is still possible. A normal reboot is not working anymore (keeps waiting to kill some processes) only a reboot -f is doing the job.

When the error accurs the /proc/sys/kernel/tainted has state 512

In the attachment I add all the kern.log output I got so far.

In the kern.log I see something like this:

-----------------------------------------------------------------
Aug 16 11:49:57 servername kernel: [ 7361.062388] WARNING: at fs/jbd/journal.c:469 __log_start_commit+0x6b/0x7e()
Aug 16 11:49:57 servername kernel: [ 7361.062391] Hardware name: PowerEdge SC1430
Aug 16 11:49:57 servername kernel: [ 7361.062393] jbd: bad log_start_commit: 2168023832 2168023832 0 0
Aug 16 11:49:57 servername kernel: [ 7361.062395] Modules linked in: ppdev lp bluetooth rfkill mperf cpufreq_conservative cpufreq_userspace cpufreq_powersave cpufreq_stats nfsd exportfs nfs lockd fscache auth_rpcgss nfs_acl sunrpc fuse loop psmouse lpc_ich mfd_core i5000_edac edac_core serio_raw evdev tpm_tis pcspkr tpm shpchp hid_generic coretemp rng_core dcdbas tpm_bios i5k_amb pci_hotplug microcode parport_pc processor button parport thermal_sys usbhid hid uhci_hcd sg sr_mod tg3 cdrom ehci_hcd libphy usbcore usb_common sd_mod crc_t10dif [last unloaded: scsi_wait_scan]
Aug 16 11:49:57 servername kernel: [ 7361.062454] Pid: 46, comm: kswapd0 Not tainted 3.5.2 #1
Aug 16 11:49:57 servername kernel: [ 7361.062456] Call Trace:
Aug 16 11:49:57 servername kernel: [ 7361.062464]  [<c1023a3d>] ? warn_slowpath_common+0x6a/0x7b
Aug 16 11:49:57 servername kernel: [ 7361.062468]  [<c11575ae>] ? __log_start_commit+0x6b/0x7e
Aug 16 11:49:57 servername kernel: [ 7361.062472]  [<c1023ab4>] ? warn_slowpath_fmt+0x28/0x2c
Aug 16 11:49:57 servername kernel: [ 7361.062476]  [<c11575ae>] ? __log_start_commit+0x6b/0x7e
Aug 16 11:49:57 servername kernel: [ 7361.062480]  [<c1157625>] ? log_start_commit+0x1b/0x22
Aug 16 11:49:57 servername kernel: [ 7361.062484]  [<c110fa0a>] ? ext3_evict_inode+0xbe/0x1cc
Aug 16 11:49:57 servername kernel: [ 7361.062489]  [<c10d4a6a>] ? evict+0x8a/0x126
Aug 16 11:49:57 servername kernel: [ 7361.062492]  [<c10d4e72>] ? dispose_list+0x2e/0x37
Aug 16 11:49:57 servername kernel: [ 7361.062496]  [<c10d50fa>] ? prune_icache_sb+0x27f/0x287
Aug 16 11:49:57 servername kernel: [ 7361.062501]  [<c10c5f21>] ? prune_super+0xa2/0xf5
Aug 16 11:49:57 servername kernel: [ 7361.062506]  [<c109f8bb>] ? shrink_slab+0x1b7/0x254
Aug 16 11:49:57 servername kernel: [ 7361.062509]  [<c10a16fe>] ? kswapd+0x54f/0x805
Aug 16 11:49:57 servername kernel: [ 7361.062515]  [<c103ad7d>] ? wake_up_bit+0x56/0x56
Aug 16 11:49:57 servername kernel: [ 7361.062519]  [<c10a11af>] ? try_to_free_pages+0xd5/0xd5
Aug 16 11:49:57 servername kernel: [ 7361.062522]  [<c103aa1f>] ? kthread+0x68/0x6d
Aug 16 11:49:57 servername kernel: [ 7361.062526]  [<c103a9b7>] ? kthread_freezable_should_stop+0x45/0x45
Aug 16 11:49:57 servername kernel: [ 7361.062531]  [<c1346b7e>] ? kernel_thread_helper+0x6/0xd
Aug 16 11:49:57 servername kernel: [ 7361.062534] ---[ end trace 7f2284fed89c7a03 ]---
Aug 16 12:33:17 servername kernel: [ 9960.684081] INFO: task acroread:3117 blocked for more than 120 seconds.
Aug 16 12:33:17 servername kernel: [ 9960.684116] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 16 12:33:17 servername kernel: [ 9960.684162] acroread        D 00000000     0  3117   3115 0x00000000
Aug 16 12:33:17 servername kernel: [ 9960.684179]  f0ef69a0 00200082 00000001 00000000 c6b6ddac 00000002 39abe377 c1514dc0
Aug 16 12:33:17 servername kernel: [ 9960.684186]  c6b6ddac c2c0dd38 c1514dc0 c1514dc0 f0ef69a0 c1514dc0 0101b7ba 00000020
Aug 16 12:33:17 servername kernel: [ 9960.684192]  c10d7899 c2c0ddb0 009e8d67 00000000 da7ff09c c6b6ddac 0000000b ce221700
Aug 16 12:33:17 servername kernel: [ 9960.684199] Call Trace:
Aug 16 12:33:17 servername kernel: [ 9960.684210]  [<c10d7899>] ? mntput_no_expire+0x15/0xf1
Aug 16 12:33:17 servername kernel: [ 9960.684215]  [<c1110214>] ? search_dirblock+0x5f/0x93
Aug 16 12:33:17 servername kernel: [ 9960.684221]  [<c103aeef>] ? prepare_to_wait+0x14/0x52
Aug 16 12:33:17 servername kernel: [ 9960.684225]  [<c10d4106>] ? __wait_on_freeing_inode+0x6e/0x88
Aug 16 12:33:17 servername kernel: [ 9960.684229]  [<c103ada6>] ? autoremove_wake_function+0x29/0x29
Aug 16 12:33:17 servername kernel: [ 9960.684232]  [<c10d4155>] ? find_inode_fast+0x35/0x6d
Aug 16 12:33:17 servername kernel: [ 9960.684236]  [<c10d54a8>] ? iget_locked+0x2f/0xd5
Aug 16 12:33:17 servername kernel: [ 9960.684240]  [<c110ce15>] ? ext3_iget+0x18/0x332
Aug 16 12:33:17 servername kernel: [ 9960.684243]  [<c1111e0c>] ? ext3_lookup+0x5d/0x9b
Aug 16 12:33:17 servername kernel: [ 9960.684248]  [<c10cb8b8>] ? __lookup_hash+0x8f/0xa8
Aug 16 12:33:17 servername kernel: [ 9960.684251]  [<c10cb8fd>] ? lookup_slow+0x2c/0x78
Aug 16 12:33:17 servername kernel: [ 9960.684255]  [<c10cccde>] ? walk_component+0x48/0xe8
Aug 16 12:33:17 servername kernel: [ 9960.684259]  [<c10cdc9a>] ? path_lookupat+0xa4/0x2a6
Aug 16 12:33:17 servername kernel: [ 9960.684264]  [<c109a79a>] ? free_hot_cold_page_list+0x4a/0x60
Aug 16 12:33:17 servername kernel: [ 9960.684268]  [<c10cdeb7>] ? do_path_lookup+0x1b/0x85
Aug 16 12:33:17 servername kernel: [ 9960.684271]  [<c10ce88c>] ? user_path_at_empty+0x3d/0x65
Aug 16 12:33:17 servername kernel: [ 9960.684277]  [<c10adb55>] ? handle_mm_fault+0x118/0x129
Aug 16 12:33:17 servername kernel: [ 9960.684281]  [<c10ce8bf>] ? user_path_at+0xb/0xe
Aug 16 12:33:17 servername kernel: [ 9960.684284]  [<c10c75ab>] ? vfs_fstatat+0x3d/0x63
Aug 16 12:33:17 servername kernel: [ 9960.684287]  [<c10c768d>] ? vfs_stat+0x10/0x12
Aug 16 12:33:17 servername kernel: [ 9960.684290]  [<c10c769e>] ? sys_stat64+0xf/0x23
Aug 16 12:33:17 servername kernel: [ 9960.684295]  [<c1343c4b>] ? spurious_fault+0xe5/0xe5
Aug 16 12:33:17 servername kernel: [ 9960.684299]  [<c1346613>] ? sysenter_do_call+0x12/0x22
Aug 16 12:35:17 servername kernel: [10080.684102] INFO: task acroread:3117 blocked for more than 120 seconds.
Aug 16 12:35:17 servername kernel: [10080.684138] "echo 0 > /proc/sys/kernel/hung_task_timeout_secs" disables this message.
Aug 16 12:35:17 servername kernel: [10080.684183] acroread        D 00000000     0  3117   3115 0x00000000
Aug 16 12:35:17 servername kernel: [10080.684200]  f0ef69a0 00200082 00000001 00000000 c6b6ddac 00000002 39abe377 c1514dc0
Aug 16 12:35:17 servername kernel: [10080.684207]  c6b6ddac c2c0dd38 c1514dc0 c1514dc0 f0ef69a0 c1514dc0 0101b7ba 00000020
Aug 16 12:35:17 servername kernel: [10080.684214]  c10d7899 c2c0ddb0 009e8d67 00000000 da7ff09c c6b6ddac 0000000b ce221700
Aug 16 12:35:17 servername kernel: [10080.684220] Call Trace:
Aug 16 12:35:17 servername kernel: [10080.684231]  [<c10d7899>] ? mntput_no_expire+0x15/0xf1
Aug 16 12:35:17 servername kernel: [10080.684237]  [<c1110214>] ? search_dirblock+0x5f/0x93
Aug 16 12:35:17 servername kernel: [10080.684243]  [<c103aeef>] ? prepare_to_wait+0x14/0x52
Aug 16 12:35:17 servername kernel: [10080.684247]  [<c10d4106>] ? __wait_on_freeing_inode+0x6e/0x88
Aug 16 12:35:17 servername kernel: [10080.684251]  [<c103ada6>] ? autoremove_wake_function+0x29/0x29
Aug 16 12:35:17 servername kernel: [10080.684254]  [<c10d4155>] ? find_inode_fast+0x35/0x6d
Aug 16 12:35:17 servername kernel: [10080.684258]  [<c10d54a8>] ? iget_locked+0x2f/0xd5
Aug 16 12:35:17 servername kernel: [10080.684261]  [<c110ce15>] ? ext3_iget+0x18/0x332
Aug 16 12:35:17 servername kernel: [10080.684265]  [<c1111e0c>] ? ext3_lookup+0x5d/0x9b
Aug 16 12:35:17 servername kernel: [10080.684269]  [<c10cb8b8>] ? __lookup_hash+0x8f/0xa8
Aug 16 12:35:17 servername kernel: [10080.684273]  [<c10cb8fd>] ? lookup_slow+0x2c/0x78
Aug 16 12:35:17 servername kernel: [10080.684276]  [<c10cccde>] ? walk_component+0x48/0xe8
Aug 16 12:35:17 servername kernel: [10080.684280]  [<c10cdc9a>] ? path_lookupat+0xa4/0x2a6
Aug 16 12:35:17 servername kernel: [10080.684285]  [<c109a79a>] ? free_hot_cold_page_list+0x4a/0x60
Aug 16 12:35:17 servername kernel: [10080.684289]  [<c10cdeb7>] ? do_path_lookup+0x1b/0x85
Aug 16 12:35:17 servername kernel: [10080.684292]  [<c10ce88c>] ? user_path_at_empty+0x3d/0x65
Aug 16 12:35:17 servername kernel: [10080.684298]  [<c10adb55>] ? handle_mm_fault+0x118/0x129
Aug 16 12:35:17 servername kernel: [10080.684302]  [<c10ce8bf>] ? user_path_at+0xb/0xe
Aug 16 12:35:17 servername kernel: [10080.684305]  [<c10c75ab>] ? vfs_fstatat+0x3d/0x63
Aug 16 12:35:17 servername kernel: [10080.684308]  [<c10c768d>] ? vfs_stat+0x10/0x12
Aug 16 12:35:17 servername kernel: [10080.684311]  [<c10c769e>] ? sys_stat64+0xf/0x23
Aug 16 12:35:17 servername kernel: [10080.684316]  [<c1343c4b>] ? spurious_fault+0xe5/0xe5
Aug 16 12:35:17 servername kernel: [10080.684320]  [<c1346613>] ? sysenter_do_call+0x12/0x22
--------------------------------------------------------------

Any hints how to get the system back in a stable mode?

Markus
Comment 1 Markus Hetzmannseder 2012-08-16 12:44:35 UTC
Full kern.log is to big for attachment, you can find it under:
http://oban.stoch.uni-linz.ac.at/kern.log
Comment 2 Anonymous Emailer 2012-08-16 12:46:19 UTC
Reply-To: James.Bottomley@HansenPartnership.com

[Cc of more relevant fs lists added]
On Thu, 2012-08-16 at 12:40 +0000, bugzilla-daemon@bugzilla.kernel.org
wrote:
> https://bugzilla.kernel.org/show_bug.cgi?id=46031
> 
>            Summary: kswapd0 moving to uninterruptible sleep (STAT D)
>            Product: IO/Storage
>            Version: 2.5
>     Kernel Version: 3.5.2
>           Platform: All
>         OS/Version: Linux
>               Tree: Mainline
>             Status: NEW
>           Severity: normal
>           Priority: P1
>          Component: SCSI
>         AssignedTo: linux-scsi@vger.kernel.org
>         ReportedBy: Markus.Hetzmannseder@jku.at
>         Regression: No
> 
> 
> Hi,
> 
> I have a hangup problem with my litle server. The Hardware is a Dell
> Poweredge
> SC1430 with mirrored harddrives conntected on the PERC 5/i Adapter, it uses
> the
> megaraid/megasas scsi driver.
> 
> The problem occurs specially at heavy diskIO like update of the file name
> database.
> 
> The system is running in x86_PAE mode with 8GB RAM installed. So far I have
> tried out kernel 3.1.4 3.6.0-rc1 and now running 3.5.2 version.
> 
> According to kernel.log its allways the kswapd0 process which starts to hang
> in
> STAT D mode. After that more and more processes are hitting STAT D and the
> system is getting practically unusable. In that state a login over the
> network
> is still possible. A normal reboot is not working anymore (keeps waiting to
> kill some processes) only a reboot -f is doing the job.
> 
> When the error accurs the /proc/sys/kernel/tainted has state 512
> 
> In the attachment I add all the kern.log output I got so far.
> 
> In the kern.log I see something like this:
> 
> -----------------------------------------------------------------
> Aug 16 11:49:57 servername kernel: [ 7361.062388] WARNING: at
> fs/jbd/journal.c:469 __log_start_commit+0x6b/0x7e()
> Aug 16 11:49:57 servername kernel: [ 7361.062391] Hardware name: PowerEdge
> SC1430
> Aug 16 11:49:57 servername kernel: [ 7361.062393] jbd: bad log_start_commit:
> 2168023832 2168023832 0 0

This would seem to be the root cause unless there's some clue in the log
you cut (the fs people may ask for more).

This either indicates a bug in the ext4 journalling layer or that
there's some type of hardware problem like memory corruption.  Hopefully
the fs people can provide more insight.  It looks like kswapd is trying
to do direct reclaim and the write fails for some reason (which could be
the HW problem) and then it doesn't recover from the error.

James


> Aug 16 11:49:57 servername kernel: [ 7361.062395] Modules linked in: ppdev lp
> bluetooth rfkill mperf cpufreq_conservative cpufreq_userspace
> cpufreq_powersave
> cpufreq_stats nfsd exportfs nfs lockd fscache auth_rpcgss nfs_acl sunrpc fuse
> loop psmouse lpc_ich mfd_core i5000_edac edac_core serio_raw evdev tpm_tis
> pcspkr tpm shpchp hid_generic coretemp rng_core dcdbas tpm_bios i5k_amb
> pci_hotplug microcode parport_pc processor button parport thermal_sys usbhid
> hid uhci_hcd sg sr_mod tg3 cdrom ehci_hcd libphy usbcore usb_common sd_mod
> crc_t10dif [last unloaded: scsi_wait_scan]
> Aug 16 11:49:57 servername kernel: [ 7361.062454] Pid: 46, comm: kswapd0 Not
> tainted 3.5.2 #1
> Aug 16 11:49:57 servername kernel: [ 7361.062456] Call Trace:
> Aug 16 11:49:57 servername kernel: [ 7361.062464]  [<c1023a3d>] ?
> warn_slowpath_common+0x6a/0x7b
> Aug 16 11:49:57 servername kernel: [ 7361.062468]  [<c11575ae>] ?
> __log_start_commit+0x6b/0x7e
> Aug 16 11:49:57 servername kernel: [ 7361.062472]  [<c1023ab4>] ?
> warn_slowpath_fmt+0x28/0x2c
> Aug 16 11:49:57 servername kernel: [ 7361.062476]  [<c11575ae>] ?
> __log_start_commit+0x6b/0x7e
> Aug 16 11:49:57 servername kernel: [ 7361.062480]  [<c1157625>] ?
> log_start_commit+0x1b/0x22
> Aug 16 11:49:57 servername kernel: [ 7361.062484]  [<c110fa0a>] ?
> ext3_evict_inode+0xbe/0x1cc
> Aug 16 11:49:57 servername kernel: [ 7361.062489]  [<c10d4a6a>] ?
> evict+0x8a/0x126
> Aug 16 11:49:57 servername kernel: [ 7361.062492]  [<c10d4e72>] ?
> dispose_list+0x2e/0x37
> Aug 16 11:49:57 servername kernel: [ 7361.062496]  [<c10d50fa>] ?
> prune_icache_sb+0x27f/0x287
> Aug 16 11:49:57 servername kernel: [ 7361.062501]  [<c10c5f21>] ?
> prune_super+0xa2/0xf5
> Aug 16 11:49:57 servername kernel: [ 7361.062506]  [<c109f8bb>] ?
> shrink_slab+0x1b7/0x254
> Aug 16 11:49:57 servername kernel: [ 7361.062509]  [<c10a16fe>] ?
> kswapd+0x54f/0x805
> Aug 16 11:49:57 servername kernel: [ 7361.062515]  [<c103ad7d>] ?
> wake_up_bit+0x56/0x56
> Aug 16 11:49:57 servername kernel: [ 7361.062519]  [<c10a11af>] ?
> try_to_free_pages+0xd5/0xd5
> Aug 16 11:49:57 servername kernel: [ 7361.062522]  [<c103aa1f>] ?
> kthread+0x68/0x6d
> Aug 16 11:49:57 servername kernel: [ 7361.062526]  [<c103a9b7>] ?
> kthread_freezable_should_stop+0x45/0x45
> Aug 16 11:49:57 servername kernel: [ 7361.062531]  [<c1346b7e>] ?
> kernel_thread_helper+0x6/0xd
> Aug 16 11:49:57 servername kernel: [ 7361.062534] ---[ end trace
> 7f2284fed89c7a03 ]---
> Aug 16 12:33:17 servername kernel: [ 9960.684081] INFO: task acroread:3117
> blocked for more than 120 seconds.
> Aug 16 12:33:17 servername kernel: [ 9960.684116] "echo 0 >
> /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> Aug 16 12:33:17 servername kernel: [ 9960.684162] acroread        D 00000000  
>  0  3117   3115 0x00000000
> Aug 16 12:33:17 servername kernel: [ 9960.684179]  f0ef69a0 00200082 00000001
> 00000000 c6b6ddac 00000002 39abe377 c1514dc0
> Aug 16 12:33:17 servername kernel: [ 9960.684186]  c6b6ddac c2c0dd38 c1514dc0
> c1514dc0 f0ef69a0 c1514dc0 0101b7ba 00000020
> Aug 16 12:33:17 servername kernel: [ 9960.684192]  c10d7899 c2c0ddb0 009e8d67
> 00000000 da7ff09c c6b6ddac 0000000b ce221700
> Aug 16 12:33:17 servername kernel: [ 9960.684199] Call Trace:
> Aug 16 12:33:17 servername kernel: [ 9960.684210]  [<c10d7899>] ?
> mntput_no_expire+0x15/0xf1
> Aug 16 12:33:17 servername kernel: [ 9960.684215]  [<c1110214>] ?
> search_dirblock+0x5f/0x93
> Aug 16 12:33:17 servername kernel: [ 9960.684221]  [<c103aeef>] ?
> prepare_to_wait+0x14/0x52
> Aug 16 12:33:17 servername kernel: [ 9960.684225]  [<c10d4106>] ?
> __wait_on_freeing_inode+0x6e/0x88
> Aug 16 12:33:17 servername kernel: [ 9960.684229]  [<c103ada6>] ?
> autoremove_wake_function+0x29/0x29
> Aug 16 12:33:17 servername kernel: [ 9960.684232]  [<c10d4155>] ?
> find_inode_fast+0x35/0x6d
> Aug 16 12:33:17 servername kernel: [ 9960.684236]  [<c10d54a8>] ?
> iget_locked+0x2f/0xd5
> Aug 16 12:33:17 servername kernel: [ 9960.684240]  [<c110ce15>] ?
> ext3_iget+0x18/0x332
> Aug 16 12:33:17 servername kernel: [ 9960.684243]  [<c1111e0c>] ?
> ext3_lookup+0x5d/0x9b
> Aug 16 12:33:17 servername kernel: [ 9960.684248]  [<c10cb8b8>] ?
> __lookup_hash+0x8f/0xa8
> Aug 16 12:33:17 servername kernel: [ 9960.684251]  [<c10cb8fd>] ?
> lookup_slow+0x2c/0x78
> Aug 16 12:33:17 servername kernel: [ 9960.684255]  [<c10cccde>] ?
> walk_component+0x48/0xe8
> Aug 16 12:33:17 servername kernel: [ 9960.684259]  [<c10cdc9a>] ?
> path_lookupat+0xa4/0x2a6
> Aug 16 12:33:17 servername kernel: [ 9960.684264]  [<c109a79a>] ?
> free_hot_cold_page_list+0x4a/0x60
> Aug 16 12:33:17 servername kernel: [ 9960.684268]  [<c10cdeb7>] ?
> do_path_lookup+0x1b/0x85
> Aug 16 12:33:17 servername kernel: [ 9960.684271]  [<c10ce88c>] ?
> user_path_at_empty+0x3d/0x65
> Aug 16 12:33:17 servername kernel: [ 9960.684277]  [<c10adb55>] ?
> handle_mm_fault+0x118/0x129
> Aug 16 12:33:17 servername kernel: [ 9960.684281]  [<c10ce8bf>] ?
> user_path_at+0xb/0xe
> Aug 16 12:33:17 servername kernel: [ 9960.684284]  [<c10c75ab>] ?
> vfs_fstatat+0x3d/0x63
> Aug 16 12:33:17 servername kernel: [ 9960.684287]  [<c10c768d>] ?
> vfs_stat+0x10/0x12
> Aug 16 12:33:17 servername kernel: [ 9960.684290]  [<c10c769e>] ?
> sys_stat64+0xf/0x23
> Aug 16 12:33:17 servername kernel: [ 9960.684295]  [<c1343c4b>] ?
> spurious_fault+0xe5/0xe5
> Aug 16 12:33:17 servername kernel: [ 9960.684299]  [<c1346613>] ?
> sysenter_do_call+0x12/0x22
> Aug 16 12:35:17 servername kernel: [10080.684102] INFO: task acroread:3117
> blocked for more than 120 seconds.
> Aug 16 12:35:17 servername kernel: [10080.684138] "echo 0 >
> /proc/sys/kernel/hung_task_timeout_secs" disables this message.
> Aug 16 12:35:17 servername kernel: [10080.684183] acroread        D 00000000  
>  0  3117   3115 0x00000000
> Aug 16 12:35:17 servername kernel: [10080.684200]  f0ef69a0 00200082 00000001
> 00000000 c6b6ddac 00000002 39abe377 c1514dc0
> Aug 16 12:35:17 servername kernel: [10080.684207]  c6b6ddac c2c0dd38 c1514dc0
> c1514dc0 f0ef69a0 c1514dc0 0101b7ba 00000020
> Aug 16 12:35:17 servername kernel: [10080.684214]  c10d7899 c2c0ddb0 009e8d67
> 00000000 da7ff09c c6b6ddac 0000000b ce221700
> Aug 16 12:35:17 servername kernel: [10080.684220] Call Trace:
> Aug 16 12:35:17 servername kernel: [10080.684231]  [<c10d7899>] ?
> mntput_no_expire+0x15/0xf1
> Aug 16 12:35:17 servername kernel: [10080.684237]  [<c1110214>] ?
> search_dirblock+0x5f/0x93
> Aug 16 12:35:17 servername kernel: [10080.684243]  [<c103aeef>] ?
> prepare_to_wait+0x14/0x52
> Aug 16 12:35:17 servername kernel: [10080.684247]  [<c10d4106>] ?
> __wait_on_freeing_inode+0x6e/0x88
> Aug 16 12:35:17 servername kernel: [10080.684251]  [<c103ada6>] ?
> autoremove_wake_function+0x29/0x29
> Aug 16 12:35:17 servername kernel: [10080.684254]  [<c10d4155>] ?
> find_inode_fast+0x35/0x6d
> Aug 16 12:35:17 servername kernel: [10080.684258]  [<c10d54a8>] ?
> iget_locked+0x2f/0xd5
> Aug 16 12:35:17 servername kernel: [10080.684261]  [<c110ce15>] ?
> ext3_iget+0x18/0x332
> Aug 16 12:35:17 servername kernel: [10080.684265]  [<c1111e0c>] ?
> ext3_lookup+0x5d/0x9b
> Aug 16 12:35:17 servername kernel: [10080.684269]  [<c10cb8b8>] ?
> __lookup_hash+0x8f/0xa8
> Aug 16 12:35:17 servername kernel: [10080.684273]  [<c10cb8fd>] ?
> lookup_slow+0x2c/0x78
> Aug 16 12:35:17 servername kernel: [10080.684276]  [<c10cccde>] ?
> walk_component+0x48/0xe8
> Aug 16 12:35:17 servername kernel: [10080.684280]  [<c10cdc9a>] ?
> path_lookupat+0xa4/0x2a6
> Aug 16 12:35:17 servername kernel: [10080.684285]  [<c109a79a>] ?
> free_hot_cold_page_list+0x4a/0x60
> Aug 16 12:35:17 servername kernel: [10080.684289]  [<c10cdeb7>] ?
> do_path_lookup+0x1b/0x85
> Aug 16 12:35:17 servername kernel: [10080.684292]  [<c10ce88c>] ?
> user_path_at_empty+0x3d/0x65
> Aug 16 12:35:17 servername kernel: [10080.684298]  [<c10adb55>] ?
> handle_mm_fault+0x118/0x129
> Aug 16 12:35:17 servername kernel: [10080.684302]  [<c10ce8bf>] ?
> user_path_at+0xb/0xe
> Aug 16 12:35:17 servername kernel: [10080.684305]  [<c10c75ab>] ?
> vfs_fstatat+0x3d/0x63
> Aug 16 12:35:17 servername kernel: [10080.684308]  [<c10c768d>] ?
> vfs_stat+0x10/0x12
> Aug 16 12:35:17 servername kernel: [10080.684311]  [<c10c769e>] ?
> sys_stat64+0xf/0x23
> Aug 16 12:35:17 servername kernel: [10080.684316]  [<c1343c4b>] ?
> spurious_fault+0xe5/0xe5
> Aug 16 12:35:17 servername kernel: [10080.684320]  [<c1346613>] ?
> sysenter_do_call+0x12/0x22
> --------------------------------------------------------------
> 
> Any hints how to get the system back in a stable mode?
> 
> Markus
>
Comment 3 Markus Hetzmannseder 2012-09-19 08:01:07 UTC
Thx for the help on it. As far as i could strip down the reasion for the errors, its deffinitly something hardware related. I think there is something wrong with the motherboard. 

Markus

Note You need to log in before you can comment on or make changes to this bug.