Bug 16616

Summary: kmemcheck: Fatal error since 2.6.36-rc1
Product: Memory Management Reporter: Christian Casteyde (casteyde.christian)
Component: OtherAssignee: Andrew Morton (akpm)
Status: RESOLVED DUPLICATE    
Severity: normal    
Priority: P1    
Hardware: All   
OS: Linux   
Kernel Version: 2.6.36-rc1 Subsystem:
Regression: Yes Bisected commit-id:

Description Christian Casteyde 2010-08-17 19:59:08 UTC
Kernel: 2.6.36-rc1
Athlon 64X2 4200 in 64bits mode
Slackware64 13.1
1GB RAM

While booting 2.6.36-rc1 with kmemcheck and several kernel hacking options, I get the following kmemcheck error:

Write protecting the kernel read-only data: 10240k
Freeing unused kernel memory: 392k freed
Freeing unused kernel memory: 1956k freed
udevd (1466): /proc/1466/oom_adj is deprecated, please use /proc/1466/oom_score_adj instead.
C-Media PCI 0000:02:07.0: PCI INT A -> GSI 19 (level, low) -> IRQ 19
ERROR: kmemcheck: Fatal error

Pid: 1591, comm: modprobe Not tainted 2.6.36-rc1 #2 K8 Combo-Z/K8 Combo-Z
RIP: 0010:[<ffffffff81013ff1>]  [<ffffffff81013ff1>] x86_pmu_handle_irq+0x81/0x180
RSP: 0018:ffff880002607d78  EFLAGS: 00010086
RAX: 0000000000000000 RBX: ffff88003f808400 RCX: 00000000ffffffff
RDX: ffff880002607ef8 RSI: 0000000000000400 RDI: ffff880002607f58
RBP: ffff880002607e18 R08: 0000000000000000 R09: 0000000000000030
R10: 0000000000000000 R11: 0000000000000001 R12: 0000000000000000
R13: ffff88000260b7c0 R14: ffff88000260b5c0 R15: 0000000000000040
FS:  00007f8d95a64700(0000) GS:ffff880002600000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: ffff88003f8367b8 CR3: 000000003d0dc000 CR4: 00000000000006f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff4ff0 DR7: 0000000000000400
 [<ffffffff8102b794>] kmemcheck_error_save_bug+0xb4/0xe0
 [<ffffffff8102c1b8>] kmemcheck_access+0xe8/0x4a0
 [<ffffffff8102c5e2>] kmemcheck_fault+0x72/0x80
 [<ffffffff81026b39>] do_page_fault+0x359/0x440
 [<ffffffff81596a1f>] page_fault+0x1f/0x30
 [<ffffffff81010bab>] perf_event_nmi_handler+0x4b/0x60
 [<ffffffff81065e40>] notifier_call_chain+0x50/0xa0
 [<ffffffff810663f1>] __atomic_notifier_call_chain+0x71/0xd0
 [<ffffffff81066461>] atomic_notifier_call_chain+0x11/0x20
 [<ffffffff8106649e>] notify_die+0x2e/0x30
 [<ffffffff81004343>] do_nmi+0x193/0x2b0
 [<ffffffff81596d0a>] nmi+0x1a/0x2c
 [<ffffffff8102bf47>] kmemcheck_read_strict+0x47/0xa0
 [<ffffffff8102bfe5>] kmemcheck_read+0x45/0x70
 [<ffffffff8102c15d>] kmemcheck_access+0x8d/0x4a0
 [<ffffffff8102c5e2>] kmemcheck_fault+0x72/0x80
 [<ffffffff81026b39>] do_page_fault+0x359/0x440
 [<ffffffff81596a1f>] page_fault+0x1f/0x30
 [<ffffffff810ea300>] mntput_no_expire+0x20/0xb0
 [<ffffffff810d047f>] fput+0x1af/0x290
 [<ffffffff810cce58>] filp_close+0x58/0x90
 [<ffffffff810ccf4c>] sys_close+0xbc/0x190
 [<ffffffff810023ab>] system_call_fastpath+0x16/0x1b
 [<ffffffffffffffff>] 0xffffffffffffffff
Intel ICH 0000:00:04.0: PCI INT A -> GSI 18 (level, low) -> IRQ 18
AC'97 1 does not respond - RESET
AC'97 1 access is not valid [0xffffffff], removing mixer.
Unable to initialize codec #1
...

Do not know what caused the nmi at start. The computer runs fine even after that.
Comment 1 Christian Casteyde 2010-08-29 19:51:35 UTC
Still present in 2.6.36-rc3.
Actually, there are many kmemcheck errors reported, very often on filesystem operation. Here are the different dmesg output I got at various time at boot:

dmesg n°1
EXT3-fs (sda1): mounted filesystem with ordered data mode
ADDRCONF(NETDEV_UP): eth0: link is not ready
uli526x: eth0 NIC Link is Up 100 Mbps Full duplex
ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
ERROR: kmemcheck: Fatal error

Pid: 1716, comm: fc-cache Not tainted 2.6.36-rc3 #6 K8 Combo-Z/K8 Combo-Z
RIP: 0010:[<ffffffff81010a50>]  [<ffffffff81010a50>] x86_perf_event_update+0x0/0x80
RSP: 0018:ffff880002607d70  EFLAGS: 00010086
RAX: 0000000000000000 RBX: 0000000000000000 RCX: 00000000ffffffff
RDX: ffff880002607ef8 RSI: 0000000000000400 RDI: ffff88003f808400
RBP: ffff880002607e18 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000004 R11: 0000000000000038 R12: ffff88000260b7c0
R13: ffff88003f808400 R14: ffff88000260b5c0 R15: ffff880002607f58
FS:  00007fa909fb5700(0000) GS:ffff880002600000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: ffff88003f836470 CR3: 000000003dbd8000 CR4: 00000000000006f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff4ff0 DR7: 0000000000000400
 [<ffffffff8102b8a4>] kmemcheck_error_save_bug+0xb4/0xe0
 [<ffffffff8102c2c8>] kmemcheck_access+0xe8/0x4a0
 [<ffffffff8102c6f2>] kmemcheck_fault+0x72/0x80
 [<ffffffff81026c49>] do_page_fault+0x359/0x440
 [<ffffffff81597edf>] page_fault+0x1f/0x30
 [<ffffffff81010d8b>] perf_event_nmi_handler+0x4b/0x60
 [<ffffffff810660d0>] notifier_call_chain+0x50/0xa0
 [<ffffffff81066681>] __atomic_notifier_call_chain+0x71/0xd0
 [<ffffffff810666f1>] atomic_notifier_call_chain+0x11/0x20
 [<ffffffff8106672e>] notify_die+0x2e/0x30
 [<ffffffff81004343>] do_nmi+0x193/0x2b0
 [<ffffffff815981ca>] nmi+0x1a/0x2c
 [<ffffffff8102ca1f>] kmemcheck_opcode_get_primary+0x1f/0x40
 [<ffffffff8102c237>] kmemcheck_access+0x57/0x4a0
 [<ffffffff8102c6cf>] kmemcheck_fault+0x4f/0x80
 [<ffffffff81026c49>] do_page_fault+0x359/0x440
 [<ffffffff81597edf>] page_fault+0x1f/0x30
 [<ffffffff81596e7e>] _raw_spin_lock+0x3e/0x50
 [<ffffffff810e4b44>] __d_lookup+0xe4/0x1c0
 [<ffffffff810d9c05>] do_lookup+0x55/0x170
 [<ffffffff810dbbc0>] link_path_walk+0x130/0xbd0
 [<ffffffff810dd8cc>] do_filp_open+0x16c/0x660
 [<ffffffff810cd7e0>] do_sys_open+0x60/0x120
 [<ffffffff810cd8cb>] sys_open+0x1b/0x20
 [<ffffffff810023ab>] system_call_fastpath+0x16/0x1b
 [<ffffffffffffffff>] 0xffffffffffffffff


reboot, dmesg n°2 :

VFS: Mounted root (ext4 filesystem) readonly on device 8:18.
Freeing unused kernel memory: 460k freed
Write protecting the kernel read-only data: 10240k
Freeing unused kernel memory: 388k freed
Freeing unused kernel memory: 1952k freed
udevd (1462): /proc/1462/oom_adj is deprecated, please use /proc/1462/oom_score_adj instead.
C-Media PCI 0000:02:07.0: PCI INT A -> GSI 19 (level, low) -> IRQ 19
Intel ICH 0000:00:04.0: PCI INT A -> GSI 18 (level, low) -> IRQ 18
ERROR: kmemcheck: Fatal error

Pid: 1614, comm: fstab_import Not tainted 2.6.36-rc3 #6 K8 Combo-Z/K8 Combo-Z
RIP: 0010:[<ffffffff8126cc98>]  [<ffffffff8126cc98>] rb_insert_color+0xa8/0x160
RSP: 0018:ffff88003ce63a78  EFLAGS: 00010246
RAX: ffff88003db025b8 RBX: ffff88003da99f58 RCX: ffff88003da99f58
RDX: ffff88003da99f60 RSI: ffff88003db7b308 RDI: ffff88003da99668
RBP: ffff88003ce63aa8 R08: ffff88003da99f58 R09: 0000000000000000
R10: ffff88003d9fc168 R11: 00000000000001d8 R12: ffff88003db025b8
R13: ffff88003da99f58 R14: ffff88003da99668 R15: ffff88003da99f60
FS:  00007f307a9a5700(0000) GS:ffff880002600000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: ffff88003f80827c CR3: 000000003cc85000 CR4: 00000000000006f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff4ff0 DR7: 0000000000000400
 [<ffffffff8102b8a4>] kmemcheck_error_save_bug+0xb4/0xe0
 [<ffffffff8102bffb>] kmemcheck_show+0x7b/0x90
 [<ffffffff8102c6d7>] kmemcheck_fault+0x57/0x80
 [<ffffffff81026c49>] do_page_fault+0x359/0x440
 [<ffffffff81597edf>] page_fault+0x1f/0x30
 [<ffffffff810b703b>] __vma_link_rb+0x2b/0x30
 [<ffffffff810b70d1>] vma_link+0x91/0xf0
 [<ffffffff810b8c88>] mmap_region+0x418/0x550
 [<ffffffff810b910c>] do_mmap_pgoff+0x34c/0x3a0
 [<ffffffff81116dc0>] elf_map+0x190/0x1c0
 [<ffffffff81118488>] load_elf_binary+0x1698/0x1c40
 [<ffffffff810d4bc9>] search_binary_handler+0xf9/0x2f0
 [<ffffffff810d66cf>] do_execve+0x20f/0x270
 [<ffffffff8100b869>] sys_execve+0x39/0x60
 [<ffffffff810027ca>] stub_execve+0x6a/0xc0
 [<ffffffffffffffff>] 0xffffffffffffffff
AC'97 1 does not respond - RESET
AC'97 1 access is not valid [0xffffffff], removing mixer.
Unable to initialize codec #1
intel8x0_measure_ac97_clock: measured 60345 usecs (2912 samples)
Adding 506012k swap on /dev/sdb1.  Priority:-1 extents:1 across:506012k 
EXT4-fs (sdb2): re-mounted. Opts: (null)
EXT4-fs (sdb2): re-mounted. Opts: (null)
EXT3-fs: barriers not enabled
kjournald starting.  Commit interval 5 seconds
EXT3-fs (sda1): using internal journal
EXT3-fs (sda1): mounted filesystem with ordered data mode
ADDRCONF(NETDEV_UP): eth0: link is not ready
uli526x: eth0 NIC Link is Up 100 Mbps Full duplex
ADDRCONF(NETDEV_CHANGE): eth0: link becomes ready
ERROR: kmemcheck: Fatal error

Pid: 1950, comm: udevadm Not tainted 2.6.36-rc3 #6 K8 Combo-Z/K8 Combo-Z
RIP: 0010:[<ffffffff81010a50>]  [<ffffffff81010a50>] x86_perf_event_update+0x0/0x80

RSP: 0018:ffff880002607d70  EFLAGS: 00010086
RAX: 0000000000000000 RBX: 0000000000000000 RCX: 00000000ffffffff
RDX: ffff880002607ef8 RSI: 0000000000000400 RDI: ffff88003f808400
RBP: ffff880002607e18 R08: 0000000000000000 R09: 0000000000000000
R10: 0000000000000004 R11: 2222222222222222 R12: ffff88000260b7c0
R13: ffff88003f808400 R14: ffff88000260b5c0 R15: ffff880002607f58
FS:  00007f5ab153b700(0000) GS:ffff880002600000(0000) knlGS:0000000000000000
CS:  0010 DS: 0000 ES: 0000 CR0: 0000000080050033
CR2: ffff88003f836420 CR3: 000000003dbda000 CR4: 00000000000006f0
DR0: 0000000000000000 DR1: 0000000000000000 DR2: 0000000000000000
DR3: 0000000000000000 DR6: 00000000ffff4ff0 DR7: 0000000000000400
 [<ffffffff8102b8a4>] kmemcheck_error_save_bug+0xb4/0xe0
 [<ffffffff8102c2c8>] kmemcheck_access+0xe8/0x4a0
 [<ffffffff8102c6f2>] kmemcheck_fault+0x72/0x80
 [<ffffffff81026c49>] do_page_fault+0x359/0x440
 [<ffffffff81597edf>] page_fault+0x1f/0x30
 [<ffffffff81010d8b>] perf_event_nmi_handler+0x4b/0x60
 [<ffffffff810660d0>] notifier_call_chain+0x50/0xa0
 [<ffffffff81066681>] __atomic_notifier_call_chain+0x71/0xd0
 [<ffffffff810666f1>] atomic_notifier_call_chain+0x11/0x20
 [<ffffffff8106672e>] notify_die+0x2e/0x30
 [<ffffffff81004343>] do_nmi+0x193/0x2b0
 [<ffffffff815981ca>] nmi+0x1a/0x2c
 [<ffffffff8102c0f5>] kmemcheck_read+0x45/0x70
 [<ffffffff8102c26d>] kmemcheck_access+0x8d/0x4a0
 [<ffffffff8102c6f2>] kmemcheck_fault+0x72/0x80
 [<ffffffff81026c49>] do_page_fault+0x359/0x440
 [<ffffffff81597edf>] page_fault+0x1f/0x30
 [<ffffffff8112cfba>] sysfs_permission+0x3a/0x70
 [<ffffffff810d95b0>] exec_permission+0x20/0x70
 [<ffffffff810dbb0b>] link_path_walk+0x7b/0xbd0
 [<ffffffff810dc7d0>] path_walk+0x60/0xe0
 [<ffffffff810dc8a3>] do_path_lookup+0x53/0x60
 [<ffffffff810dd4e3>] user_path_at+0x53/0xa0
 [<ffffffff810d3cca>] sys_readlinkat+0x3a/0xa0
 [<ffffffff810d3d46>] sys_readlink+0x16/0x20
 [<ffffffff810023ab>] system_call_fastpath+0x16/0x1b
 [<ffffffffffffffff>] 0xffffffffffffffff
Comment 2 Christian Casteyde 2010-09-22 18:13:08 UTC
Duplicate I think of 17361 where I have other dumps.

*** This bug has been marked as a duplicate of bug 17361 ***