Bug 21252 - unable to handle paging request while using two KDE 4 sessions
Summary: unable to handle paging request while using two KDE 4 sessions
Status: RESOLVED UNREPRODUCIBLE
Alias: None
Product: Memory Management
Classification: Unclassified
Component: Page Allocator (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: Andrew Morton
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2010-10-27 10:24 UTC by Martin Steigerwald
Modified: 2010-11-06 08:50 UTC (History)
0 users

See Also:
Kernel Version: 2.6.36
Subsystem:
Regression: No
Bisected commit-id:


Attachments
syslog (321.75 KB, text/plain)
2010-10-27 10:24 UTC, Martin Steigerwald
Details
ps -aux output prior to rebooting the machine (13.85 KB, text/plain)
2010-10-27 10:26 UTC, Martin Steigerwald
Details
some other stuff from proc that might be useful (656.09 KB, application/x-bzip-compressed-tar)
2010-10-27 12:49 UTC, Martin Steigerwald
Details

Description Martin Steigerwald 2010-10-27 10:24:48 UTC
Created attachment 35202 [details]
syslog

Rather unspecific bug title, cause I do not know exactly where to sort this. 

This is with:

martin@shambhala:~> cat /proc/version
Linux version 2.6.36-tp42-gtt-vram-apart-v2-dirty (martin@shambhala) (gcc version 4.4.5 (Debian 4.4.5-4) ) #1 PREEMPT Thu Oct 21 12:10:49 CEST 2010

I only seen it once so far. This is upstream kernel with a with patch from

Bug 28402 - random radeon/kms/drm related freezes with kernel 2.6.34
https://bugs.freedesktop.org/show_bug.cgi?id=28402#c82

While using one of two KDE sessions on my ThinkPad T42 suddenly the X.org display was replaced by parts of a backtrace that I now dig out from the syslog:

Oct 27 09:25:35 shambhala kernel: BUG: unable to handle kernel paging request at 9a7a7a86
Oct 27 09:25:35 shambhala kernel: IP: [<c10bb66e>] vma_prio_tree_next+0x2e/0x50
Oct 27 09:25:35 shambhala kernel: *pde = 00000000 
Oct 27 09:25:35 shambhala kernel: Oops: 0000 [#1] PREEMPT 
Oct 27 09:25:35 shambhala kernel: last sysfs file: /sys/devices/pci0000:00/0000:00:1e.0/0000:02:02.0/rf_kill
Oct 27 09:25:35 shambhala kernel: Modules linked in: ufs qnx4 hfsplus hfs minix jfs ext2 tp_smapi hdaps thinkpad_ec vboxnetadp vboxnetflt vboxdrv snd_usb_audio snd_hwdep snd_usbmidi_lib sata_sil acpi_cpufreq mperf cpufreq_userspace cpufreq_stats cpufreq_conservative cpufreq_powersave ppdev lp cn ipv6 microcode fuse tun usblp ohci_hcd usb_storage usb_libusual nls_iso8859_15 nls_iso8859_1 nls_cp850 ntfs vfat msdos fat reiserfs isofs udf crc_itu_t smbfs pktcdvd uinput radeon ttm drm_kms_helper drm i2c_algo_bit input_polldev dm_crypt snd_intel8x0m snd_intel8x0 thinkpad_acpi hwmon snd_ac97_codec ac97_bus snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_pcm snd_seq_oss snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq snd_timer pcmcia ipw2200 libipw snd_seq_device rtc_cmos yenta_socket cfg80211 rtc_core pcmcia_rsrc snd snd_page_alloc video joydev intel_agp output pcmcia_core rtc_lib led_class parport_pc evdev agpgart nvram i2c_i801 rfkill lib80211 parport processor battery ac button dm_mod raid10 raid456 async_pq async_xor 
Oct 27 09:25:35 shambhala kernel: xor async_memcpy async_raid6_recov raid6_pq async_tx raid1 raid0 linear md_mod btrfs zlib_deflate crc32c libcrc32c usbhid uhci_hcd ehci_hcd sg usbcore sr_mod thermal cdrom [last unloaded: hdaps]
Oct 27 09:25:35 shambhala kernel:
Oct 27 09:25:35 shambhala kernel: Pid: 319, comm: kswapd0 Tainted: G        W   2.6.36-tp42-gtt-vram-apart-v2-dirty #1 2373CXG/2373CXG
Oct 27 09:25:35 shambhala kernel: EIP: 0060:[<c10bb66e>] EFLAGS: 00010282 CPU: 0
Oct 27 09:25:35 shambhala kernel: EIP is at vma_prio_tree_next+0x2e/0x50
Oct 27 09:25:35 shambhala kernel: EAX: 9a7a7a52 EBX: c42ce1b8 ECX: 00000000 EDX: f71e5dfc
Oct 27 09:25:35 shambhala kernel: ESI: c1f127e0 EDI: 00000000 EBP: f71e5dd0 ESP: f71e5dd0
Oct 27 09:25:35 shambhala kernel: DS: 007b ES: 007b FS: 0000 GS: 00e0 SS: 0068
Oct 27 09:25:35 shambhala kernel: Process kswapd0 (pid: 319, ti=f71e4000 task=f7163e80 task.ti=f71e4000)
Oct 27 09:25:35 shambhala kernel: Stack:
Oct 27 09:25:35 shambhala kernel: f71e5e28 c10ca52a f71e5e18 f71e5e7c f71e5e70 00000000 ffffffff 00000001
Oct 27 09:25:35 shambhala kernel: <0> 00000020 00000000 00000001 cce2a918 00000004 00000000 00000000 f4989cd0
Oct 27 09:25:35 shambhala kernel: <0> 00000000 00000000 00000028 c1f127e0 f71e5f64 c1f127f8 f71e5e90 c10b51d3
Oct 27 09:25:35 shambhala kernel: Call Trace:
Oct 27 09:25:35 shambhala kernel: [<c10ca52a>] ? page_referenced+0xaa/0x2d0
Oct 27 09:25:35 shambhala kernel: [<c10b51d3>] ? shrink_active_list+0x183/0x310
Oct 27 09:25:35 shambhala kernel: [<c10b65a3>] ? shrink_zone+0x3c3/0x5b0
Oct 27 09:25:35 shambhala kernel: [<c10dd92e>] ? __mem_cgroup_largest_soft_limit_node+0x5e/0x150
Oct 27 09:25:35 shambhala kernel: [<c10b6df4>] ? kswapd+0x534/0x790
Oct 27 09:25:35 shambhala kernel: [<c1055460>] ? autoremove_wake_function+0x0/0x50
Oct 27 09:25:35 shambhala kernel: [<c10b68c0>] ? kswapd+0x0/0x790
Oct 27 09:25:35 shambhala kernel: [<c1054f04>] ? kthread+0x74/0x80
Oct 27 09:25:35 shambhala kernel: [<c1054e90>] ? kthread+0x0/0x80
Oct 27 09:25:35 shambhala kernel: [<c100347e>] ? kernel_thread_helper+0x6/0x10
Oct 27 09:25:35 shambhala kernel: Code: 0f 1f 44 00 00 85 c0 74 29 8b 48 30 85 c9 74 15 8b 40 34 85 c0 74 1b 8b 50 28 0f 18 02 90 5d c3 90 8d 74 26 00 8b 40 28 83 e8 28 <8b> 48 34 85 c9 74 e5 89 d0 e8 a4 5c 1a 00 85 c0 74 0c 83 e8 28 
Oct 27 09:25:35 shambhala kernel: EIP: [<c10bb66e>] vma_prio_tree_next+0x2e/0x50 SS:ESP 0068:f71e5dd0
Oct 27 09:25:35 shambhala kernel: CR2: 000000009a7a7a86
Oct 27 09:25:35 shambhala kernel: ---[ end trace f13586feeb79aae5 ]---
Oct 27 09:25:35 shambhala kernel: note: kswapd0[319] exited with preempt_count 1

After this I found that the KDE plasma desktop of that session wouldn't respond to any click any more while other KDE applications on the same desktop and the other KDE session still work. On looking at ps aux | grep plasma-desktop I found the process to be in D state. An strace -p on the PID didn't give anything.

In order to try to get the KDE session back working again, I tried to killed its only child ksysguard, but this left it in zombie state, and then I did a killall -u on the username. On this I got:

Oct 27 09:52:49 shambhala kernel: BUG: unable to handle kernel paging request at 9a7a7a7e
Oct 27 09:52:49 shambhala kernel: IP: [<c10bb811>] vma_prio_tree_remove+0xd1/0xf0
Oct 27 09:52:49 shambhala kernel: *pde = 00000000 
Oct 27 09:52:49 shambhala kernel: Oops: 0002 [#2] PREEMPT 
Oct 27 09:52:49 shambhala kernel: last sysfs file: /sys/devices/pci0000:00/0000:00:1e.0/0000:02:02.0/rf_kill
Oct 27 09:52:49 shambhala kernel: Modules linked in: ufs qnx4 hfsplus hfs minix jfs ext2 tp_smapi hdaps thinkpad_ec
 vboxnetadp vboxnetflt vboxdrv snd_usb_audio snd_hwdep snd_usbmidi_lib sata_sil acpi_cpufreq mperf cpufreq_userspac
e cpufreq_stats cpufreq_conservative cpufreq_powersave ppdev lp cn ipv6 microcode fuse tun usblp ohci_hcd usb_stora
ge usb_libusual nls_iso8859_15 nls_iso8859_1 nls_cp850 ntfs vfat msdos fat reiserfs isofs udf crc_itu_t smbfs pktcd
vd uinput radeon ttm drm_kms_helper drm i2c_algo_bit input_polldev dm_crypt snd_intel8x0m snd_intel8x0 thinkpad_acp
i hwmon snd_ac97_codec ac97_bus snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_pcm snd_seq_oss snd_seq_midi snd_rawmid
i snd_seq_midi_event snd_seq snd_timer pcmcia ipw2200 libipw snd_seq_device rtc_cmos yenta_socket cfg80211 rtc_core
 pcmcia_rsrc snd snd_page_alloc video joydev intel_agp output pcmcia_core rtc_lib led_class parport_pc evdev agpgar
t nvram i2c_i801 rfkill lib80211 parport processor battery ac button dm_mod raid10 raid456 async_pq async_xor 
Oct 27 09:52:49 shambhala kernel: xor async_memcpy async_raid6_recov raid6_pq async_tx raid1 raid0 linear md_mod bt
rfs zlib_deflate crc32c libcrc32c usbhid uhci_hcd ehci_hcd sg usbcore sr_mod thermal cdrom [last unloaded: hdaps]
Oct 27 09:52:49 shambhala kernel:
Oct 27 09:52:49 shambhala kernel: Pid: 13466, comm: akonadi_nepomuk Tainted: G      D W   2.6.36-tp42-gtt-vram-apar
t-v2-dirty #1 2373CXG/2373CXG
Oct 27 09:52:49 shambhala kernel: EIP: 0060:[<c10bb811>] EFLAGS: 00210246 CPU: 0
Oct 27 09:52:49 shambhala kernel: EIP is at vma_prio_tree_remove+0xd1/0xf0
Oct 27 09:52:49 shambhala kernel: EAX: c42ce1b8 EBX: 9a7a7a7a ECX: ef61ca78 EDX: c42ce1e0
Oct 27 09:52:49 shambhala kernel: ESI: c42ce1b8 EDI: 00000000 EBP: cb10bd88 ESP: cb10bd78
Oct 27 09:52:49 shambhala kernel: DS: 007b ES: 007b FS: 0000 GS: 00e0 SS: 0068
Oct 27 09:52:49 shambhala kernel: Process akonadi_nepomuk (pid: 13466, ti=cb10a000 task=c356d780 task.ti=cb10a000)
Oct 27 09:52:49 shambhala kernel: Stack:
Oct 27 09:52:49 shambhala kernel: f4989cd0 00000000 c42ce1b8 b1fbd000 cb10bd94 c10c651b cb10a000 cb10bda0
Oct 27 09:52:49 shambhala kernel: <0> c10c67f3 c42ce160 cb10bdc4 c10c13f3 00000000 b1c00000 00000000 c1560aa4
Oct 27 09:52:49 shambhala kernel: <0> d378bf20 f6bf9380 00000000 cb10bde8 c10c88c6 00000000 cb10bdd8 00000000
Oct 27 09:52:49 shambhala kernel: Call Trace:
Oct 27 09:52:49 shambhala kernel: [<c10c651b>] ? __remove_shared_vm_struct+0x3b/0x60
Oct 27 09:52:49 shambhala kernel: [<c10c67f3>] ? unlink_file_vma+0x23/0x50
Oct 27 09:52:49 shambhala kernel: [<c10c13f3>] ? free_pgtables+0x53/0xa0
Oct 27 09:52:49 shambhala kernel: [<c10c88c6>] ? exit_mmap+0xd6/0x180
Oct 27 09:52:49 shambhala kernel: [<c10356eb>] ? mmput+0x3b/0xe0
Oct 27 09:52:49 shambhala kernel: [<c103a2bb>] ? exit_mm+0xdb/0xf0
Oct 27 09:52:49 shambhala kernel: [<c103babd>] ? do_exit+0x5bd/0x710
Oct 27 09:52:49 shambhala kernel: [<c1048d64>] ? __dequeue_signal+0x14/0x120
Oct 27 09:52:49 shambhala kernel: [<c1048ea2>] ? dequeue_signal+0x32/0x190
Oct 27 09:52:49 shambhala kernel: [<c103bc43>] ? do_group_exit+0x33/0xa0
Oct 27 09:52:49 shambhala kernel: [<c104b038>] ? get_signal_to_deliver+0x298/0x4e0
Oct 27 09:52:49 shambhala kernel: [<c108438c>] ? __delayacct_blkio_end+0x4c/0x50
Oct 27 09:52:49 shambhala kernel: [<c1023790>] ? do_page_fault+0x0/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c10023e0>] ? do_signal+0x70/0xaa0
Oct 27 09:52:49 shambhala kernel: [<c10a7d94>] ? unlock_page+0x44/0x50
Oct 27 09:52:49 shambhala kernel: [<c10c3e76>] ? handle_mm_fault+0x4a6/0x870
Oct 27 09:52:49 shambhala kernel: [<c1023946>] ? do_page_fault+0x1b6/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c10e2f3e>] ? vfs_read+0x14e/0x160
Oct 27 09:52:49 shambhala kernel: [<c10e3f70>] ? fget_light+0x90/0xa0
Oct 27 09:52:49 shambhala kernel: [<c1023790>] ? do_page_fault+0x0/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c1002e65>] ? do_notify_resume+0x55/0x70
Oct 27 09:52:49 shambhala kernel: [<c13ff754>] ? work_notifysig+0x13/0x1b
Oct 27 09:52:49 shambhala kernel: [<c13f0000>] ? packet_sendmsg+0x6f0/0x950
Oct 27 09:52:49 shambhala kernel: Code: eb b3 90 8d b4 26 00 00 00 00 8d 50 28 8b 45 f0 e8 45 5d 1a 00 c7 43 34 00 00 00 00 eb 9a 8d 74 26 00 8b 58 28 8d 50 28 8b 48 2c <89> 4b 04 89 19 89 50 28 89 50 2c eb 80 0f 0b eb fe 8d b4 26 00 
Oct 27 09:52:49 shambhala kernel: EIP: [<c10bb811>] vma_prio_tree_remove+0xd1/0xf0 SS:ESP 0068:cb10bd78
Oct 27 09:52:49 shambhala kernel: CR2: 000000009a7a7a7e
Oct 27 09:52:49 shambhala kernel: ---[ end trace f13586feeb79aae6 ]---
Oct 27 09:52:49 shambhala kernel: Fixing recursive fault but reboot is needed!
Oct 27 09:52:49 shambhala kernel: BUG: scheduling while atomic: akonadi_nepomuk/13466/0x00000003
Oct 27 09:52:49 shambhala kernel: Modules linked in: ufs qnx4 hfsplus hfs minix jfs ext2 tp_smapi hdaps thinkpad_ec vboxnetadp vboxnetflt vboxdrv snd_usb_audio snd_hwdep snd_usbmidi_lib sata_sil acpi_cpufreq mperf cpufreq_userspace cpufreq_stats cpufreq_conservative cpufreq_powersave ppdev lp cn ipv6 microcode fuse tun usblp ohci_hcd usb_storage usb_libusual nls_iso8859_15 nls_iso8859_1 nls_cp850 ntfs vfat msdos fat reiserfs isofs udf crc_itu_t smbfs pktcdvd uinput radeon ttm drm_kms_helper drm i2c_algo_bit input_polldev dm_crypt snd_intel8x0m snd_intel8x0 thinkpad_acpi hwmon snd_ac97_codec ac97_bus snd_pcm_oss snd_mixer_oss snd_seq_dummy snd_pcm snd_seq_oss snd_seq_midi snd_rawmidi snd_seq_midi_event snd_seq snd_timer pcmcia ipw2200 libipw snd_seq_device rtc_cmos yenta_socket cfg80211 rtc_core pcmcia_rsrc snd snd_page_alloc video joydev intel_agp output pcmcia_core rtc_lib led_class parport_pc evdev agpgart nvram i2c_i801 rfkill lib80211 parport processor battery ac button dm_mod raid10 raid456 async_pq async_xor 
Oct 27 09:52:49 shambhala kernel: xor async_memcpy async_raid6_recov raid6_pq async_tx raid1 raid0 linear md_mod btrfs zlib_deflate crc32c libcrc32c usbhid uhci_hcd ehci_hcd sg usbcore sr_mod thermal cdrom [last unloaded: hdaps]
Oct 27 09:52:49 shambhala kernel: Pid: 13466, comm: akonadi_nepomuk Tainted: G      D W   2.6.36-tp42-gtt-vram-apart-v2-dirty #1
Oct 27 09:52:49 shambhala kernel: Call Trace:
Oct 27 09:52:49 shambhala kernel: [<c102ba91>] __schedule_bug+0x61/0x70
Oct 27 09:52:49 shambhala kernel: [<c13fd212>] schedule+0x4d2/0x5d0
Oct 27 09:52:49 shambhala kernel: [<c103dfc4>] ? irq_exit+0x34/0x80
Oct 27 09:52:49 shambhala kernel: [<c103bb10>] do_exit+0x610/0x710
Oct 27 09:52:49 shambhala kernel: [<c1038065>] ? kmsg_dump+0x145/0x150
Oct 27 09:52:49 shambhala kernel: [<c13fcc9c>] ? printk+0x1d/0x21
Oct 27 09:52:49 shambhala kernel: [<c1005f18>] oops_end+0x68/0xa0
Oct 27 09:52:49 shambhala kernel: [<c13fcc9c>] ? printk+0x1d/0x21
Oct 27 09:52:49 shambhala kernel: [<c1023456>] no_context+0xc6/0x160
Oct 27 09:52:49 shambhala kernel: [<c1023587>] __bad_area_nosemaphore+0x97/0x130
Oct 27 09:52:49 shambhala kernel: [<c1023637>] bad_area_nosemaphore+0x17/0x20
Oct 27 09:52:49 shambhala kernel: [<c1023aee>] do_page_fault+0x35e/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c10ad1c2>] ? free_pages_prepare+0x102/0x170
Oct 27 09:52:49 shambhala kernel: [<c1023790>] ? do_page_fault+0x0/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c13ffa9d>] error_code+0x65/0x6c
Oct 27 09:52:49 shambhala kernel: [<c10bb811>] ? vma_prio_tree_remove+0xd1/0xf0
Oct 27 09:52:49 shambhala kernel: [<c10c651b>] __remove_shared_vm_struct+0x3b/0x60
Oct 27 09:52:49 shambhala kernel: [<c10c67f3>] unlink_file_vma+0x23/0x50
Oct 27 09:52:49 shambhala kernel: [<c10c13f3>] free_pgtables+0x53/0xa0
Oct 27 09:52:49 shambhala kernel: [<c10c88c6>] exit_mmap+0xd6/0x180
Oct 27 09:52:49 shambhala kernel: [<c10356eb>] mmput+0x3b/0xe0
Oct 27 09:52:49 shambhala kernel: [<c103a2bb>] exit_mm+0xdb/0xf0
Oct 27 09:52:49 shambhala kernel: [<c103babd>] do_exit+0x5bd/0x710
Oct 27 09:52:49 shambhala kernel: [<c1048d64>] ? __dequeue_signal+0x14/0x120
Oct 27 09:52:49 shambhala kernel: [<c1048ea2>] ? dequeue_signal+0x32/0x190
Oct 27 09:52:49 shambhala kernel: [<c103bc43>] do_group_exit+0x33/0xa0
Oct 27 09:52:49 shambhala kernel: [<c104b038>] get_signal_to_deliver+0x298/0x4e0
Oct 27 09:52:49 shambhala kernel: [<c108438c>] ? __delayacct_blkio_end+0x4c/0x50
Oct 27 09:52:49 shambhala kernel: [<c1023790>] ? do_page_fault+0x0/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c10023e0>] do_signal+0x70/0xaa0
Oct 27 09:52:49 shambhala kernel: [<c10a7d94>] ? unlock_page+0x44/0x50
Oct 27 09:52:49 shambhala kernel: [<c10c3e76>] ? handle_mm_fault+0x4a6/0x870
Oct 27 09:52:49 shambhala kernel: [<c1023946>] ? do_page_fault+0x1b6/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c10e2f3e>] ? vfs_read+0x14e/0x160
Oct 27 09:52:49 shambhala kernel: [<c10e3f70>] ? fget_light+0x90/0xa0
Oct 27 09:52:49 shambhala kernel: [<c1023790>] ? do_page_fault+0x0/0x3f0
Oct 27 09:52:49 shambhala kernel: [<c1002e65>] do_notify_resume+0x55/0x70
Oct 27 09:52:49 shambhala kernel: [<c13ff754>] work_notifysig+0x13/0x1b
Oct 27 09:52:49 shambhala kernel: [<c13f0000>] ? packet_sendmsg+0x6f0/0x950

On this hint that a reboot was needed I gracefully killed remaining user processes also from the other KDE session and initiated a reboot. I couldn't leave the machine in that state longer cause I am currently holding a Linux Performance Tuning training and need that laptop for work ;).

Sometime prior to reboot I had lots of processes in D state. I'll add a ps-aux.txt as attachment.

Some things that come to mind:

1) This laptop is using in-kernel-suspend and that quite some hibernation cycles prior to the bug.

2) I have libcgroup /  cgroup-bin 0.36.2-3 debian packages installed. Currently with the default configuration except for also activating the memory controller. 

shambhala:~> grep CGROUP_MEM /boot/config-2.6.36-tp42-gtt-vram-apart-v2-dirty  
CONFIG_CGROUP_MEM_RES_CTLR=y
CONFIG_CGROUP_MEM_RES_CTLR_SWAP=y

For demonstrating purposes I did a mkdir in the cgroup filesystem and added the current shell as process to it to show the memory usage the memory controller accounted to it. I did not set any limit.

I'll add the complete syslog and an archive with quite some stuff from /proc that might turn out to be useful in the course of the day in training breaks. syslog already has the reboot after the bug happened. The traces are before. There is also from eSATA PCMCIA related trace before. I am using a eSATA PCMCIA controller for my external harddisks.
Comment 1 Martin Steigerwald 2010-10-27 10:26:26 UTC
Created attachment 35212 [details]
ps -aux output prior to rebooting the machine

BTW I have no idea on the reproducability on the bug since it happened out of a sudden and only once so far.
Comment 2 Martin Steigerwald 2010-10-27 10:37:34 UTC
The memory situation appeared to be fine short after the bug happened:

martin@shambhala:~/Linux/Kernel/Mainline/Bugs/unable-to-handle-paging-request> cat free-m.txt 
             total       used       free     shared    buffers     cached
Mem:          2014       1047        966          0        121        443
-/+ buffers/cache:        482       1531
Swap:         3906         41       3864

I am using Radeon DRM KMS driver with OpenGL compositing on both desktops. In the time between the last reboot before the bug happened and the happening of the bug I once had some graphics corruption on my first KDE session. I also already had this quite some times with 2.6.35 but only when I had two KDE sessions, never when only one was running. I just restarted the desktop to get rid of the corruption.

I also wondered whether the system was short on lowmem pages, but it doesn't appear so.

martin@shambhala:~/Linux/Kernel/Mainline/Bugs/unable-to-handle-paging-request> cat proc/zoneinfo 
Node 0, zone      DMA
  pages free     953
        min      16
        low      20
        high     24
        scanned  0
        spanned  4095
        present  3966
    nr_free_pages 953
    nr_inactive_anon 35
    nr_active_anon 0
    nr_inactive_file 1891
    nr_active_file 866
    nr_unevictable 0
    nr_mlock     0
    nr_anon_pages 31
    nr_mapped    42
    nr_file_pages 2762
    nr_dirty     0
    nr_writeback 0
    nr_slab_reclaimable 171
    nr_slab_unreclaimable 13
    nr_page_table_pages 0
    nr_kernel_stack 0
    nr_unstable  0
    nr_bounce    0
    nr_vmscan_write 20
    nr_writeback_temp 0
    nr_isolated_anon 0
    nr_isolated_file 0
    nr_shmem     0
        protection: (0, 865, 2015, 2015)
  pagesets
    cpu: 0
              count: 0
              high:  0
              batch: 1
  all_unreclaimable: 0
  start_pfn:         1
  inactive_ratio:    1
Node 0, zone   Normal
  pages free     80137
        min      932
        low      1165
        high     1398
        scanned  0
        spanned  223230
        present  221486
    nr_free_pages 80137
    nr_inactive_anon 82
    nr_active_anon 1059
    nr_inactive_file 22327
    nr_active_file 20378
    nr_unevictable 0
    nr_mlock     0
    nr_anon_pages 1079
    nr_mapped    113
    nr_file_pages 42770
    nr_dirty     13
    nr_writeback 0
    nr_slab_reclaimable 26184
    nr_slab_unreclaimable 5173
    nr_page_table_pages 740
    nr_kernel_stack 177
    nr_unstable  0
    nr_bounce    0
    nr_vmscan_write 59358
    nr_writeback_temp 0
    nr_isolated_anon 0
    nr_isolated_file 0
    nr_shmem     3
        protection: (0, 0, 9202, 9202)
  pagesets
    cpu: 0
              count: 109
              high:  186
              batch: 31
  all_unreclaimable: 0
  start_pfn:         4096
  inactive_ratio:    1
Node 0, zone  HighMem
  pages free     166527
        min      128
        low      437
        high     747
        scanned  0
        spanned  296786
        present  294467
    nr_free_pages 166527
    nr_inactive_anon 12693
    nr_active_anon 17015
    nr_inactive_file 55286
    nr_active_file 43250
    nr_unevictable 0
    nr_mlock     0
    nr_anon_pages 26796
    nr_mapped    7685
    nr_file_pages 102673
    nr_dirty     3
    nr_writeback 0
    nr_slab_reclaimable 0
    nr_slab_unreclaimable 0
    nr_page_table_pages 0
    nr_kernel_stack 0
    nr_unstable  0
    nr_bounce    0
    nr_vmscan_write 298518
    nr_writeback_temp 0
    nr_isolated_anon 0
    nr_isolated_file 32
    nr_shmem     439
        protection: (0, 0, 0, 0)
  pagesets
    cpu: 0
              count: 34
              high:  186
              batch: 31
  all_unreclaimable: 0
  start_pfn:         227326
  inactive_ratio:    3
Comment 3 Martin Steigerwald 2010-10-27 12:49:28 UTC
Created attachment 35222 [details]
some other stuff from proc that might be useful

martin@shambhala:~/Linux/Kernel/Mainline/Bugs/unable-to-handle-paging-request> ls -lh proc
insgesamt 10M
-r--r--r-- 1 martin martin  300 27. Okt 10:01 buddyinfo
-r--r--r-- 1 martin martin 2,0M 27. Okt 10:02 kallsyms
-r-------- 1 martin martin 4,0M 27. Okt 10:00 kpagecount
-r-------- 1 martin martin 4,0M 27. Okt 10:00 kpageflags
-r--r--r-- 1 martin martin 1,1K 27. Okt 10:00 meminfo
-r--r--r-- 1 martin martin 2,3K 27. Okt 10:00 pagetypeinfo
-r--r--r-- 1 martin martin 3,5K 27. Okt 10:02 sched_debug
-r--r--r-- 1 martin martin  12K 27. Okt 10:02 slabinfo
-r--r--r-- 1 martin martin  104 27. Okt 10:02 swaps
-r--r--r-- 1 martin martin  148 27. Okt 10:06 version
-r-------- 1 martin martin  12K 27. Okt 10:00 vmallocinfo
-r--r--r-- 1 martin martin 1,5K 27. Okt 10:00 vmstat
-r--r--r-- 1 martin martin 2,7K 27. Okt 10:03 zoneinfo
Comment 4 Andrew Morton 2010-11-05 20:44:54 UTC
gee, hard. Could be a transient hardware failure, could be a sporadic memory scribble from some errant code (DRI would be a good suspect).

All I can suggest is that you enable as many debug options as possible (especially memory-related ones) then see if it happens again, and if some pattern can be determined.
Comment 5 Martin Steigerwald 2010-11-06 08:50:25 UTC
Thanks. I didn't see this again so far, so closing as unreproducible. If it happens again I consider adding debug options and wait till it happens a third time. Which ones would you recommend?

Note You need to log in before you can comment on or make changes to this bug.