Bug 6535 - Ops with 2.6.16.16 on reiserfs with block-size 512
Summary: Ops with 2.6.16.16 on reiserfs with block-size 512
Status: REJECTED DOCUMENTED
Alias: None
Product: File System
Classification: Unclassified
Component: ReiserFS (show other bugs)
Hardware: i386 Linux
: P2 high
Assignee: ReiseFS developers team
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2006-05-11 06:19 UTC by Fabio
Modified: 2008-03-23 23:36 UTC (History)
3 users (show)

See Also:
Kernel Version: 2.6.16.16 Vanilla (not Gentoo-sources)
Subsystem:
Regression: ---
Bisected commit-id:


Attachments

Description Fabio 2006-05-11 06:19:46 UTC
Most recent kernel where this bug did not occur: 2.6.15 I think 
Distribution: Gentoo 2006.0 
Hardware Environment: Notebook Compaq nx7010 
Software Environment: 
Problem Description: when i run "emerge --sync" my kernel opses.... The 1st 
time when happened I was using 2.6.16.14: 
 
Unable to handle kernel NULL pointer dereference at virtual address 00000004 
 printing eip: 
b01c3889 
*pde = 00000000 
Oops: 0000 [#1] 
PREEMPT 
Modules linked in: radeon drm agpgart snd_rtctimer stir4200 xt_limit xt_length 
ipt_LOG xt_state iptable_filter ipt_MASQUERADE iptable_nat ip_nat ipt_TOS 
xt_tcpudp iptable_mangle ip_tables x_tables snd_pcm_oss snd_mixer_oss 
snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device 
snd_intel8x0m ohci_hcd psmouse eeprom parport_pc parport irtty_sir sir_dev 
irda wbsd mmc_core ipw2200 ieee80211 ieee80211_crypt ohci1394 ieee1394 
snd_intel8x0 snd_ac97_codec snd_ac97_bus snd_pcm snd_timer snd soundcore 
snd_page_alloc i2c_i801 ehci_hcd usbhid usbmouse uhci_hcd usbcore jfs dm_mod 
yenta_socket rsrc_nonstatic pcmcia pcmcia_core firmware_class cpufreq_ondemand 
rfcomm l2cap bluetooth 8139cp 
CPU:    0 
EIP:    0060:[<b01c3889>]    Not tainted VLI 
EFLAGS: 00210297   (2.6.16.14 #1) 
EIP is at internal_insert_childs+0x69/0x1c0 
eax: 00000000   ebx: d1d91a9c   ecx: ca3a2e44   edx: 00000000 
esi: 00000001   edi: dc968f58   ebp: 00000002   esp: d1d919b4 
ds: 007b   es: 007b   ss: 0068 
Process rsync (pid: 16860, threadinfo=d1d91000 task=ddf7e030) 
Stack: <0>dc968f68 dc968f58 00000018 0000000e dc968e00 dc80d7b4 001bccb8 
d00b0008 
       dc80d7e4 0000000c 00000002 00000002 00000000 0000000e b01c49ab d1d91a60 
       0000000b 00000002 d1d91aa4 d1d91a9c ce1689f4 d1d91aa4 00000000 00000000 
Call Trace: 
 [<b01c49ab>] balance_internal+0x33b/0xa80 
 [<b01aa8b3>] do_balance+0xb3/0x140 
 [<b01c94b4>] reiserfs_insert_item+0x314/0x380 
 [<b01b54e9>] reiserfs_allocate_blocks_for_region+0x1569/0x1730 
 [<b01c54b8>] pathrelse+0x28/0x40 
 [<b01b6256>] reiserfs_prepare_file_region_for_write+0x3a6/0x900 
 [<b01b6efc>] reiserfs_file_write+0x74c/0x7c0 
 [<b01465c2>] get_page_from_freelist+0xc2/0xf0 
 [<b0146647>] __alloc_pages+0x57/0x310 
 [<b0150593>] __handle_mm_fault+0x143/0x280 
 [<b016260a>] vfs_write+0xaa/0x1a0 
 [<b01627d1>] sys_write+0x51/0x80 
 [<b010307b>] sysenter_past_esp+0x54/0x75 
Code: d0 20 8b 44 24 0c 89 7c 24 04 29 d0 c1 e0 03 89 44 24 08 8d 04 ef 89 04 
24 e8 d4 5f 05 00 eb 21 89 f6 8b 0c b3 8b 51 18 8b 41 10 <0f> b7 52 04 29 d0 
83 e8 18 66 89 44 f4 1c 8b 41 14 89 44 f4 18 
 Badness in do_exit at kernel/exit.c:802 
 [<b0121064>] do_exit+0x3e4/0x430 
 [<b01044a4>] die+0x194/0x1a0 
 [<b0118adb>] do_page_fault+0x3bb/0x5b1 
 [<b01c3d81>] internal_copy_pointers_items+0x1d1/0x220 
 [<b0118720>] do_page_fault+0x0/0x5b1 
 [<b0103c07>] error_code+0x4f/0x54 
 [<b01c3889>] internal_insert_childs+0x69/0x1c0 
 [<b01c49ab>] balance_internal+0x33b/0xa80 
 [<b01aa8b3>] do_balance+0xb3/0x140 
 [<b01c94b4>] reiserfs_insert_item+0x314/0x380 
 [<b01b54e9>] reiserfs_allocate_blocks_for_region+0x1569/0x1730 
 [<b01c54b8>] pathrelse+0x28/0x40 
 [<b01b6256>] reiserfs_prepare_file_region_for_write+0x3a6/0x900 
 [<b01b6efc>] reiserfs_file_write+0x74c/0x7c0 
 [<b01465c2>] get_page_from_freelist+0xc2/0xf0 
 [<b0146647>] __alloc_pages+0x57/0x310 
 [<b0150593>] __handle_mm_fault+0x143/0x280 
 [<b016260a>] vfs_write+0xaa/0x1a0 
 [<b01627d1>] sys_write+0x51/0x80 
 [<b010307b>] sysenter_past_esp+0x54/0x75 
VFS: brelse: Trying to free free buffer 
Badness in __brelse at fs/buffer.c:1275 
 [<b0165065>] bh_lru_install+0xc5/0xf0 
 [<b016519a>] __find_get_block+0x6a/0x70 
 [<b01651cb>] __getblk+0x2b/0x60 
 [<b01c598d>] search_by_key+0xbd/0xd20 
 [<b01416b6>] find_get_page+0x26/0x60 
 [<b01c54b8>] pathrelse+0x28/0x40 
 [<b01b039c>] init_inode+0x33c/0x480 
 [<b01b0c6e>] reiserfs_read_locked_inode+0xfe/0x190 
 [<b017e91d>] find_inode+0x4d/0x80 
 [<b01aaafc>] search_by_entry_key+0x3c/0x240 
 [<b01ab11e>] reiserfs_find_entry+0xae/0x140 
 [<b01ab261>] reiserfs_lookup+0xb1/0x190 
 [<b01462a3>] buffered_rmqueue+0xd3/0x250 
 [<b017cba1>] d_alloc+0x21/0x1a0 
 [<b017d21e>] d_lookup+0x2e/0x60 
 [<b0171423>] real_lookup+0xc3/0x100 
 [<b017177d>] do_lookup+0x9d/0xb0 
 [<b0172004>] __link_path_walk+0x874/0xea0 
 [<b0172688>] link_path_walk+0x58/0xf0 
 [<b0121e4b>] do_wait+0x17b/0x420 
 [<b0172a30>] do_path_lookup+0x110/0x270 
 [<b0172ebc>] __user_walk_fd+0x3c/0x70 
 [<b016cae1>] vfs_lstat_fd+0x21/0x70 
 [<b0102eba>] do_signal+0xaa/0x160 
 [<b0121e4b>] do_wait+0x17b/0x420 
 [<b016cb4f>] vfs_lstat+0x1f/0x30 
 [<b016d278>] sys_lstat64+0x18/0x40 
 [<b011a580>] default_wake_function+0x0/0x20 
 [<b0102540>] sys_sigreturn+0xb0/0xd0 
 [<b010307b>] sysenter_past_esp+0x54/0x75 
 
I tried many times with the same result. 
I did mkfs.reiserfs of my portage partition and "emerge --sync" ran ok (from a 
scratch portage). 
I rerun "emerge --sync" and the kernel ops again. 
Then I compiled 2.6.16.16 with debug enabled and this is the output: 
 
ReiserFS: dm-3: warning: journal-1777: buffer 2097184 bad 
state !PREPARED !LOCKED !DIRTY !JDIRTY_WAIT 
Unable to handle kernel paging request at virtual address 00200038 
 printing eip: 
b01acdd3 
*pde = 00000000 
Oops: 0000 [#1] 
PREEMPT 
Modules linked in: radeon drm agpgart snd_rtctimer stir4200 ipt_TOS 
iptable_mangle ipt_MASQUERADE iptable_nat ip_nat xt_limit xt_length ipt_LOG 
xt_tcpudp xt_state iptable_filter ip_tables x_tables snd_pcm_oss snd_mixer_oss 
snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device 
snd_intel8x0m eeprom ohci_hcd psmouse parport_pc parport irtty_sir sir_dev 
irda wbsd mmc_core ohci1394 ieee1394 snd_intel8x0 snd_ac97_codec snd_ac97_bus 
snd_pcm snd_timer snd soundcore snd_page_alloc i2c_i801 ehci_hcd usbhid 
usbmouse uhci_hcd usbcore jfs dm_mod yenta_socket rsrc_nonstatic pcmcia 
pcmcia_core firmware_class cpufreq_ondemand rfcomm l2cap bluetooth 8139cp 
CPU:    0 
EIP:    0060:[<b01acdd3>]    Not tainted VLI 
EFLAGS: 00210202   (2.6.16.16 #1) 
EIP is at balance_leaf+0xb83/0x2be0 
eax: c32b8ae4   ebx: c6fc7a24   ecx: c6fc7d24   edx: c6fc7a2c 
esi: 00200038   edi: c6fc7a80   ebp: c6fc7a58   esp: c6fc7954 
ds: 007b   es: 007b   ss: 0068 
Process rsync (pid: 2973, threadinfo=c6fc7000 task=d7ddd550) 
Stack: <0>c6fc7a3c 00000000 c6fc7d24 dc0ab5d0 00000000 00000001 00000001 
00000001 
       c6fc7a64 c6fc7a34 c6fc7a2c c6fc7a24 c6fc7a80 00000000 00000001 00000000 
       0001ffff ffffffff c6fc79bc 00000001 00000000 000001d0 00000001 c6fc7ae8 
Call Trace: 
 [<b010416a>] show_stack_log_lvl+0xaa/0xe0 
 [<b010438f>] show_registers+0x19f/0x220 
 [<b01045a4>] die+0xf4/0x1a0 
 [<b0119d2a>] do_page_fault+0x3da/0x5d1 
 [<b0103d87>] error_code+0x4f/0x54 
 [<b01af187>] do_balance+0x77/0x130 
 [<b01cd5b6>] reiserfs_insert_item+0x326/0x380 
 [<b01b997a>] reiserfs_allocate_blocks_for_region+0x14ba/0x1640 
 [<b01bb3b7>] reiserfs_file_write+0x7c7/0x840 
 [<b01665d1>] vfs_write+0xa1/0x190 
 [<b016678b>] sys_write+0x4b/0x80 
 [<b01031fb>] sysenter_past_esp+0x54/0x75 
Code: 89 44 24 0c 8b 45 08 c7 04 24 04 00 00 00 89 44 24 04 e8 81 99 01 00 8b 
9d 28 ff ff ff 8b bd 2c ff ff ff 8b 03 8b 70 18 83 c6 18 <a5> a5 a5 a5 8b 03 
83 eb 04 8b b5 30 ff ff ff 8b 55 1c 89 04 16 
 Badness in do_exit at kernel/exit.c:802 
 [<b01040b1>] show_trace+0x21/0x30 
 [<b01041ee>] dump_stack+0x1e/0x20 
 [<b0122b0d>] do_exit+0x44d/0x4a0 
 [<b0104644>] die+0x194/0x1a0 
 [<b0119d2a>] do_page_fault+0x3da/0x5d1 
 [<b0103d87>] error_code+0x4f/0x54 
 [<b01af187>] do_balance+0x77/0x130 
 [<b01cd5b6>] reiserfs_insert_item+0x326/0x380 
 [<b01b997a>] reiserfs_allocate_blocks_for_region+0x14ba/0x1640 
 [<b01bb3b7>] reiserfs_file_write+0x7c7/0x840 
 [<b01665d1>] vfs_write+0xa1/0x190 
 [<b016678b>] sys_write+0x4b/0x80 
 [<b01031fb>] sysenter_past_esp+0x54/0x75 
BUG: rsync/2973, lock held at task exit time! 
 [c6664e5c] {inode_init_once} 
.. held by:             rsync: 2973 [d7ddd550, 135] 
... acquired at:               reiserfs_file_write+0xe5/0x840 
Unable to handle kernel NULL pointer dereference at virtual address 000000c9 
 printing eip: 
b018de97 
*pde = 00000000 
Oops: 0000 [#2] 
PREEMPT 
Modules linked in: radeon drm agpgart snd_rtctimer stir4200 ipt_TOS 
iptable_mangle ipt_MASQUERADE iptable_nat ip_nat xt_limit xt_length ipt_LOG 
xt_tcpudp xt_state iptable_filter ip_tables x_tables snd_pcm_oss snd_mixer_oss 
snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device 
snd_intel8x0m eeprom ohci_hcd psmouse parport_pc parport irtty_sir sir_dev 
irda wbsd mmc_core ohci1394 ieee1394 snd_intel8x0 snd_ac97_codec snd_ac97_bus 
snd_pcm snd_timer snd soundcore snd_page_alloc i2c_i801 ehci_hcd usbhid 
usbmouse uhci_hcd usbcore jfs dm_mod yenta_socket rsrc_nonstatic pcmcia 
pcmcia_core firmware_class cpufreq_ondemand rfcomm l2cap bluetooth 8139cp 
CPU:    0 
EIP:    0060:[<b018de97>]    Not tainted VLI 
EFLAGS: 00210283   (2.6.16.16 #1) 
EIP is at sync_sb_inodes+0x67/0x310 
eax: 00000091   ebx: c32ad900   ecx: c32ad908   edx: b1751000 
esi: df35e294   edi: df35e200   ebp: b1751ee4   esp: b1751eb8 
ds: 007b   es: 007b   ss: 0068 
Process pdflush (pid: 164, threadinfo=b1751000 task=dff49a70) 
Stack: <0>c4776ae4 b1751f18 b1751000 df35e294 b1751000 df35e28c 00000000 
000017f1 
       df35e200 b1751000 b03acfb0 b1751f00 b018e249 df35e200 b1751f18 b1751f18 
       000019e5 00010fa8 b1751f64 b014bbd3 b1751f18 b03fbfb8 00000c39 b0351d5f 
Call Trace: 
 [<b010416a>] show_stack_log_lvl+0xaa/0xe0 
 [<b010438f>] show_registers+0x19f/0x220 
 [<b01045a4>] die+0xf4/0x1a0 
 [<b0119d2a>] do_page_fault+0x3da/0x5d1 
 [<b0103d87>] error_code+0x4f/0x54 
 [<b018e249>] writeback_inodes+0x109/0x140 
 [<b014bbd3>] wb_kupdate+0x83/0x100 
 [<b014c678>] __pdflush+0xe8/0x210 
 [<b014c7d3>] pdflush+0x33/0x40 
 [<b0134e16>] kthread+0xa6/0xf0 
 [<b0101439>] kernel_thread_helper+0x5/0xc 
Code: 00 00 00 00 3b 45 e0 74 5c 8d 87 8c 00 00 00 ba 00 f0 ff ff 89 45 e8 21 
e2 89 55 dc 8b 8f 98 00 00 00 8d 59 f8 8b 83 b0 00 00 00 <8b> 70 38 f6 46 08 
02 0f 84 8c 00 00 00 8b 41 04 8b 11 89 10 89 
 <6>note: pdflush[164] exited with preempt_count 1 
Debug: sleeping function called from invalid context at fs/proc/base.c:1954 
in_atomic():1, irqs_disabled():0 
 [<b01040b1>] show_trace+0x21/0x30 
 [<b01041ee>] dump_stack+0x1e/0x20 
 [<b011cc6f>] __might_sleep+0x9f/0xb0 
 [<b01a0fcf>] proc_pid_flush+0x1f/0x40 
 [<b012127b>] release_task+0xcb/0x170 
 [<b01225b5>] exit_notify+0x285/0x390 
 [<b0122968>] do_exit+0x2a8/0x4a0 
 [<b0104644>] die+0x194/0x1a0 
 [<b0119d2a>] do_page_fault+0x3da/0x5d1 
 [<b0103d87>] error_code+0x4f/0x54 
 [<b018e249>] writeback_inodes+0x109/0x140 
 [<b014bbd3>] wb_kupdate+0x83/0x100 
 [<b014c678>] __pdflush+0xe8/0x210 
 [<b014c7d3>] pdflush+0x33/0x40 
 [<b0134e16>] kthread+0xa6/0xf0 
 [<b0101439>] kernel_thread_helper+0x5/0xc 
 
 
Steps to reproduce: 
 
Ask me to run "emerge --sync"..... 
 
Thanks and bye 
 
Fabio
Comment 1 Fabio 2006-05-11 15:06:11 UTC
Some other info: 
 
[root@mignolo:/usr/src/linux]# cat /proc/cpuinfo 
processor       : 0 
vendor_id       : GenuineIntel 
cpu family      : 6 
model           : 13 
model name      : Intel(R) Pentium(R) M processor 1.70GHz 
stepping        : 6 
cpu MHz         : 1700.000 
cache size      : 2048 KB 
fdiv_bug        : no 
hlt_bug         : no 
f00f_bug        : no 
coma_bug        : no 
fpu             : yes 
fpu_exception   : yes 
cpuid level     : 2 
wp              : yes 
flags           : fpu vme de pse tsc msr mce cx8 apic sep mtrr pge mca cmov 
pat clflush dts acpi mmx fxsr sse sse2 ss tm pbe est tm2 
bogomips        : 3393.15 
 
 
[root@mignolo:/usr/src/linux]# cat /proc/modules 
radeon 107872 0 - Live 0xe2409000 
drm 64916 1 radeon, Live 0xe23dc000 
agpgart 29040 1 drm, Live 0xe23be000 
snd_rtctimer 2380 1 - Live 0xe22ee000 
stir4200 11076 0 - Live 0xe239a000 
ipt_TOS 1984 18 - Live 0xe2393000 
iptable_mangle 2176 1 - Live 0xe231c000 
ipt_MASQUERADE 2560 4 - Live 0xe235e000 
iptable_nat 7172 1 - Live 0xe238e000 
ip_nat 14316 2 ipt_MASQUERADE,iptable_nat, Live 0xe237c000 
xt_limit 2048 5 - Live 0xe2374000 
xt_length 1600 1 - Live 0xe2360000 
ipt_LOG 6272 8 - Live 0xe2362000 
xt_tcpudp 3328 41 - Live 0xe2334000 
xt_state 1664 22 - Live 0xe233c000 
iptable_filter 2240 1 - Live 0xe227f000 
ip_tables 11800 3 iptable_mangle,iptable_nat,iptable_filter, Live 0xe235a000 
x_tables 9988 9 
ipt_TOS,ipt_MASQUERADE,iptable_nat,xt_limit,xt_length,ipt_LOG,xt_tcpudp,xt_state,ip_tables, 
Live 0xe2356000 
snd_pcm_oss 49632 0 - Live 0xe2366000 
snd_mixer_oss 17280 1 snd_pcm_oss, Live 0xe2336000 
snd_seq_dummy 2692 0 - Live 0xe2281000 
snd_seq_oss 32320 0 - Live 0xe234d000 
snd_seq_midi_event 6016 1 snd_seq_oss, Live 0xe22f6000 
snd_seq 49616 6 snd_seq_dummy,snd_seq_oss,snd_seq_midi_event, Live 0xe233f000 
snd_seq_device 6796 3 snd_seq_dummy,snd_seq_oss,snd_seq, Live 0xe22f3000 
snd_intel8x0m 14860 6 - Live 0xe2317000 
ohci_hcd 19780 0 - Live 0xe231e000 
eeprom 5520 0 - Live 0xe22f0000 
psmouse 38408 0 - Live 0xe2324000 
parport_pc 37380 0 - Live 0xe22d9000 
parport 33736 1 parport_pc, Live 0xe22e4000 
irtty_sir 5696 0 - Live 0xe2283000 
sir_dev 14060 1 irtty_sir, Live 0xe22d4000 
irda 118456 2 stir4200,sir_dev, Live 0xe22f9000 
wbsd 16520 0 - Live 0xe220a000 
mmc_core 19152 1 wbsd, Live 0xe2201000 
ipw2200 116036 0 - Live 0xe2286000 
ieee80211 31944 1 ipw2200, Live 0xe21ee000 
ieee80211_crypt 4928 1 ieee80211, Live 0xe2207000 
ohci1394 32112 0 - Live 0xe21db000 
ieee1394 290936 1 ohci1394, Live 0xe221a000 
snd_intel8x0 30044 1 - Live 0xe2138000 
snd_ac97_codec 94240 2 snd_intel8x0m,snd_intel8x0, Live 0xe2191000 
snd_ac97_bus 1792 1 snd_ac97_codec, Live 0xe2132000 
snd_pcm 83208 6 snd_pcm_oss,snd_intel8x0m,snd_intel8x0,snd_ac97_codec, Live 
0xe21aa000 
snd_timer 21956 3 snd_rtctimer,snd_seq,snd_pcm, Live 0xe2174000 
snd 47460 23 
snd_pcm_oss,snd_mixer_oss,snd_seq_oss,snd_seq,snd_seq_device,snd_intel8x0m,snd_intel8x0,snd_ac97_codec,snd_pcm,snd_timer, 
Live 0xe2184000 
soundcore 7776 1 snd, Live 0xe2124000 
snd_page_alloc 8520 3 snd_intel8x0m,snd_intel8x0,snd_pcm, Live 0xe2134000 
i2c_i801 7692 0 - Live 0xe211c000 
ehci_hcd 31944 0 - Live 0xe216b000 
usbhid 37216 0 - Live 0xe2160000 
usbmouse 4416 0 - Live 0xe211f000 
uhci_hcd 30928 0 - Live 0xe2127000 
usbcore 121092 7 stir4200,ohci_hcd,ehci_hcd,usbhid,usbmouse,uhci_hcd, Live 
0xe2141000 
jfs 183068 1 - Live 0xe1a92000 
dm_mod 50552 7 - Live 0xe19ad000 
yenta_socket 24652 1 - Live 0xe1995000 
rsrc_nonstatic 10304 1 yenta_socket, Live 0xe0834000 
pcmcia 31828 0 - Live 0xe198c000 
pcmcia_core 36560 3 yenta_socket,rsrc_nonstatic,pcmcia, Live 0xe0875000 
firmware_class 8256 2 ipw2200,pcmcia, Live 0xe0841000 
cpufreq_ondemand 4956 0 - Live 0xe083e000 
rfcomm 34904 0 - Live 0xe085f000 
l2cap 22532 5 rfcomm, Live 0xe0858000 
bluetooth 43236 4 rfcomm,l2cap, Live 0xe084c000 
8139cp 18816 0 - Live 0xe0838000 
 
 
[root@mignolo:/usr/src/linux]# lspci -vvv 
00:00.0 Host bridge: Intel Corporation 82855PM Processor to I/O Controller 
(rev 03) 
        Subsystem: Compaq Computer Corporation Unknown device 0860 
        Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- 
<TAbort- <MAbort+ >SERR- <PERR- 
        Latency: 0 
        Region 0: Memory at b0000000 (32-bit, prefetchable) [size=256M] 
        Capabilities: [e4] Vendor Specific Information 
        Capabilities: [a0] AGP version 2.0 
                Status: RQ=32 Iso- ArqSz=0 Cal=0 SBA+ ITACoh- GART64- HTrans- 
64bit- FW+ AGP3- Rate=x1,x2,x4 
                Command: RQ=1 ArqSz=0 Cal=0 SBA- AGP- GART64- 64bit- FW+ 
Rate=x1 
 
00:01.0 PCI bridge: Intel Corporation 82855PM Processor to AGP Controller (rev 
03) (prog-if 00 [Normal decode]) 
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap- 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 128 
        Bus: primary=00, secondary=01, subordinate=01, sec-latency=0 
        I/O behind bridge: 00003000-00003fff 
        Memory behind bridge: 90400000-904fffff 
        Prefetchable memory behind bridge: 98000000-9fffffff 
        Secondary status: 66MHz+ FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- <SERR- <PERR- 
        BridgeCtl: Parity- SERR- NoISA+ VGA+ MAbort- >Reset- FastB2B- 
 
00:1d.0 USB Controller: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) 
USB UHCI Controller #1 (rev 01) (prog-if 00 [UHCI]) 
        Subsystem: Compaq Computer Corporation Unknown device 0860 
        Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 0 
        Interrupt: pin A routed to IRQ 10 
        Region 4: I/O ports at 48c0 [size=32] 
 
00:1d.1 USB Controller: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) 
USB UHCI Controller #2 (rev 01) (prog-if 00 [UHCI]) 
        Subsystem: Compaq Computer Corporation Unknown device 0860 
        Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 0 
        Interrupt: pin B routed to IRQ 5 
        Region 4: I/O ports at 48e0 [size=32] 
 
00:1d.2 USB Controller: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) 
USB UHCI Controller #3 (rev 01) (prog-if 00 [UHCI]) 
        Subsystem: Compaq Computer Corporation Unknown device 0860 
        Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 0 
        Interrupt: pin C routed to IRQ 5 
        Region 4: I/O ports at 4c00 [size=32] 
 
00:1d.7 USB Controller: Intel Corporation 82801DB/DBM (ICH4/ICH4-M) USB2 EHCI 
Controller (rev 01) (prog-if 20 [EHCI]) 
        Subsystem: Compaq Computer Corporation Unknown device 0860 
        Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 0 
        Interrupt: pin D routed to IRQ 5 
        Region 0: Memory at a0000000 (32-bit, non-prefetchable) [size=1K] 
        Capabilities: [50] Power Management version 2 
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=375mA 
PME(D0+,D1-,D2-,D3hot+,D3cold+) 
                Status: D0 PME-Enable- DSel=0 DScale=0 PME- 
        Capabilities: [58] Debug port 
 
00:1e.0 PCI bridge: Intel Corporation 82801 Mobile PCI Bridge (rev 81) 
(prog-if 00 [Normal decode]) 
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR+ FastB2B- 
        Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=fast >TAbort- 
<TAbort- <MAbort- >SERR- <PERR+ 
        Latency: 0 
        Bus: primary=00, secondary=02, subordinate=03, sec-latency=32 
        I/O behind bridge: 00002000-00002fff 
        Memory behind bridge: 90000000-903fffff 
        Prefetchable memory behind bridge: 40000000-41ffffff 
        Secondary status: 66MHz- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- <SERR- <PERR- 
        BridgeCtl: Parity- SERR+ NoISA+ VGA- MAbort- >Reset- FastB2B- 
 
00:1f.0 ISA bridge: Intel Corporation 82801DBM (ICH4-M) LPC Interface Bridge 
(rev 01) 
        Control: I/O+ Mem+ BusMaster+ SpecCycle+ MemWINV- VGASnoop- ParErr- 
Stepping- SERR+ FastB2B- 
        Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 0 
 
00:1f.1 IDE interface: Intel Corporation 82801DBM (ICH4-M) IDE Controller (rev 
01) (prog-if 8a [Master SecP PriP]) 
        Subsystem: Compaq Computer Corporation Unknown device 0860 
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 0 
        Interrupt: pin A routed to IRQ 5 
        Region 0: I/O ports at <unassigned> 
        Region 1: I/O ports at <unassigned> 
        Region 2: I/O ports at <unassigned> 
        Region 3: I/O ports at <unassigned> 
        Region 4: I/O ports at 4c40 [size=16] 
        Region 5: Memory at 42000000 (32-bit, non-prefetchable) [size=1K] 
 
00:1f.3 SMBus: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) SMBus 
Controller (rev 01) 
        Subsystem: Compaq Computer Corporation Unknown device 0860 
        Control: I/O+ Mem- BusMaster- SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap- 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Interrupt: pin B routed to IRQ 10 
        Region 4: I/O ports at 4c20 [size=32] 
 
00:1f.5 Multimedia audio controller: Intel Corporation 82801DB/DBL/DBM 
(ICH4/ICH4-L/ICH4-M) AC'97 Audio Controller (rev 01) 
        Subsystem: Compaq Computer Corporation Unknown device 0860 
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 0 
        Interrupt: pin B routed to IRQ 10 
        Region 0: I/O ports at 4000 [size=256] 
        Region 1: I/O ports at 4880 [size=64] 
        Region 2: Memory at a0200000 (32-bit, non-prefetchable) [size=512] 
        Region 3: Memory at a0300000 (32-bit, non-prefetchable) [size=256] 
        Capabilities: [50] Power Management version 2 
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=375mA 
PME(D0+,D1-,D2-,D3hot+,D3cold+) 
                Status: D0 PME-Enable- DSel=0 DScale=0 PME- 
 
00:1f.6 Modem: Intel Corporation 82801DB/DBL/DBM (ICH4/ICH4-L/ICH4-M) AC'97 
Modem Controller (rev 01) (prog-if 00 [Generic]) 
        Subsystem: Compaq Computer Corporation Unknown device 0860 
        Control: I/O+ Mem- BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 0 
        Interrupt: pin B routed to IRQ 10 
        Region 0: I/O ports at 4400 [size=256] 
        Region 1: I/O ports at 4800 [size=128] 
        Capabilities: [50] Power Management version 2 
                Flags: PMEClk- DSI- D1- D2- AuxCurrent=375mA 
PME(D0+,D1-,D2-,D3hot+,D3cold+) 
                Status: D0 PME-Enable- DSel=0 DScale=0 PME- 
 
01:00.0 VGA compatible controller: ATI Technologies Inc Radeon R250 Lf [FireGL 
9000] (rev 01) (prog-if 00 [VGA]) 
        Subsystem: Compaq Computer Corporation Unknown device 0860 
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping+ SERR- FastB2B- 
        Status: Cap+ 66MHz+ UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 128 (2000ns min), Cache Line Size 08 
        Interrupt: pin A routed to IRQ 10 
        Region 0: Memory at 98000000 (32-bit, prefetchable) [size=128M] 
        Region 1: I/O ports at 3000 [size=256] 
        Region 2: Memory at 90400000 (32-bit, non-prefetchable) [size=64K] 
        [virtual] Expansion ROM at 90420000 [disabled] [size=128K] 
        Capabilities: [58] AGP version 2.0 
                Status: RQ=48 Iso- ArqSz=0 Cal=0 SBA+ ITACoh- GART64- HTrans- 
64bit- FW+ AGP3- Rate=x1,x2,x4 
                Command: RQ=1 ArqSz=0 Cal=0 SBA+ AGP- GART64- 64bit- FW- 
Rate=<none> 
        Capabilities: [50] Power Management version 2 
                Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=0mA 
PME(D0-,D1-,D2-,D3hot-,D3cold-) 
                Status: D0 PME-Enable- DSel=0 DScale=0 PME- 
 
02:00.0 FireWire (IEEE 1394): VIA Technologies, Inc. IEEE 1394 Host Controller 
(rev 80) (prog-if 10 [OHCI]) 
        Subsystem: VIA Technologies, Inc. IEEE 1394 Host Controller 
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 128 (8000ns max), Cache Line Size 08 
        Interrupt: pin A routed to IRQ 10 
        Region 0: Memory at 90200000 (32-bit, non-prefetchable) [size=2K] 
        Region 1: I/O ports at 2400 [size=128] 
        Capabilities: [50] Power Management version 2 
                Flags: PMEClk- DSI- D1- D2+ AuxCurrent=0mA 
PME(D0-,D1-,D2+,D3hot+,D3cold+) 
                Status: D0 PME-Enable- DSel=0 DScale=0 PME- 
 
02:01.0 Ethernet controller: Realtek Semiconductor Co., Ltd. 
RTL-8139/8139C/8139C+ (rev 20) 
        Subsystem: Compaq Computer Corporation Unknown device 0860 
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV+ VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 128 (8000ns min, 16000ns max), Cache Line Size 08 
        Interrupt: pin A routed to IRQ 10 
        Region 0: I/O ports at 2000 [size=256] 
        Region 1: Memory at 90300000 (32-bit, non-prefetchable) [size=256] 
        Capabilities: [50] Power Management version 2 
                Flags: PMEClk- DSI- D1+ D2+ AuxCurrent=375mA 
PME(D0-,D1+,D2+,D3hot+,D3cold+) 
                Status: D0 PME-Enable- DSel=0 DScale=0 PME- 
 
02:02.0 Network controller: Intel Corporation PRO/Wireless 2200BG (rev 05) 
        Subsystem: Hewlett-Packard Company Unknown device 12f6 
        Control: I/O- Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap+ 66MHz- UDF- FastB2B+ ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 128 (750ns min, 6000ns max), Cache Line Size 08 
        Interrupt: pin A routed to IRQ 5 
        Region 0: Memory at 90000000 (32-bit, non-prefetchable) [size=4K] 
        Capabilities: [dc] Power Management version 2 
                Flags: PMEClk- DSI+ D1- D2- AuxCurrent=0mA 
PME(D0+,D1-,D2-,D3hot+,D3cold+) 
                Status: D0 PME-Enable- DSel=0 DScale=1 PME- 
 
02:04.0 CardBus bridge: ENE Technology Inc CB1410 Cardbus Controller (rev 01) 
        Subsystem: Compaq Computer Corporation Unknown device 0860 
        Control: I/O+ Mem+ BusMaster+ SpecCycle- MemWINV- VGASnoop- ParErr- 
Stepping- SERR- FastB2B- 
        Status: Cap+ 66MHz- UDF- FastB2B- ParErr- DEVSEL=medium >TAbort- 
<TAbort- <MAbort- >SERR- <PERR- 
        Latency: 168, Cache Line Size 10 
        Interrupt: pin A routed to IRQ 5 
        Region 0: Memory at 90100000 (32-bit, non-prefetchable) [size=4K] 
        Bus: primary=02, secondary=03, subordinate=06, sec-latency=176 
        Memory window 0: 40000000-41fff000 (prefetchable) 
        Memory window 1: 44000000-45fff000 
        I/O window 0: 00002800-000028ff 
        I/O window 1: 00002c00-00002cff 
        BridgeCtl: Parity- SERR- ISA- VGA- MAbort- >Reset+ 16bInt+ PostWrite+ 
        16-bit legacy interface ports at 0001 
 
 
[root@mignolo:/usr/src/linux]# cat /proc/ioports 
0000-001f : dma1 
0020-0021 : pic1 
0040-0043 : timer0 
004e-004f : wbsd 
0050-0053 : timer1 
0060-006f : keyboard 
0070-0077 : rtc 
0080-008f : dma page reg 
00a0-00a1 : pic2 
00c0-00df : dma2 
00f0-00ff : fpu 
0170-0177 : ide1 
01f0-01f7 : ide0 
0248-024f : wbsd 
0376-0376 : ide1 
0378-037a : parport0 
03c0-03df : vga+ 
03e8-03ef : serial 
03f6-03f6 : ide0 
03f8-03ff : serial 
04d0-04d1 : pnp 00:0d 
0778-077a : parport0 
0cf8-0cff : PCI conf1 
1000-107f : 0000:00:1f.0 
  1000-107f : motherboard 
    1000-1003 : PM1a_EVT_BLK 
    1004-1005 : PM1a_CNT_BLK 
    1008-100b : PM_TMR 
    1010-1015 : ACPI CPU throttle 
    1020-1020 : PM2_CNT_BLK 
    1028-102f : GPE0_BLK 
1100-113f : 0000:00:1f.0 
  1100-113f : motherboard 
    1100-113f : pnp 00:0d 
1200-121f : motherboard 
  1200-121f : pnp 00:0d 
2000-2fff : PCI Bus #02 
  2000-20ff : 0000:02:01.0 
    2000-20ff : 8139cp 
  2400-247f : 0000:02:00.0 
  2800-28ff : PCI CardBus #03 
  2c00-2cff : PCI CardBus #03 
3000-3fff : PCI Bus #01 
  3000-30ff : 0000:01:00.0 
4000-40ff : 0000:00:1f.5 
  4000-40ff : Intel 82801DB-ICH4 
4400-44ff : 0000:00:1f.6 
  4400-44ff : Intel 82801DB-ICH4 Modem 
4800-487f : 0000:00:1f.6 
  4800-487f : Intel 82801DB-ICH4 Modem 
4880-48bf : 0000:00:1f.5 
  4880-48bf : Intel 82801DB-ICH4 
48c0-48df : 0000:00:1d.0 
  48c0-48df : uhci_hcd 
48e0-48ff : 0000:00:1d.1 
  48e0-48ff : uhci_hcd 
4c00-4c1f : 0000:00:1d.2 
  4c00-4c1f : uhci_hcd 
4c20-4c3f : 0000:00:1f.3 
  4c20-4c2f : i801_smbus 
4c40-4c4f : 0000:00:1f.1 
  4c40-4c47 : ide0 
  4c48-4c4f : ide1 
 
 
[root@mignolo:/usr/src/linux]# cat /proc/iomem 
00000000-0009fbff : System RAM 
0009fc00-0009ffff : reserved 
000a0000-000bffff : Video RAM area 
000c0000-000cffff : Video ROM 
000f0000-000fffff : System ROM 
00100000-2ffcffff : System RAM 
  00100000-0034f202 : Kernel code 
  0034f203-003f3f2b : Kernel data 
2ffd0000-2fff0bff : reserved 
2fff0c00-2fffbfff : ACPI Non-volatile Storage 
2fffc000-2fffffff : reserved 
40000000-41ffffff : PCI Bus #02 
  40000000-41ffffff : PCI CardBus #03 
42000000-420003ff : 0000:00:1f.1 
44000000-45ffffff : PCI CardBus #03 
90000000-903fffff : PCI Bus #02 
  90000000-90000fff : 0000:02:02.0 
    90000000-90000fff : ipw2200 
  90100000-90100fff : 0000:02:04.0 
    90100000-90100fff : yenta_socket 
  90200000-902007ff : 0000:02:00.0 
    90200000-902007ff : ohci1394 
  90300000-903000ff : 0000:02:01.0 
    90300000-903000ff : 8139cp 
90400000-904fffff : PCI Bus #01 
  90400000-9040ffff : 0000:01:00.0 
    90400000-9040ffff : radeonfb mmio 
  90420000-9043ffff : 0000:01:00.0 
98000000-9fffffff : PCI Bus #01 
  98000000-9fffffff : 0000:01:00.0 
    98000000-9fffffff : radeonfb framebuffer 
a0000000-a00003ff : 0000:00:1d.7 
  a0000000-a00003ff : ehci_hcd 
a0200000-a02001ff : 0000:00:1f.5 
  a0200000-a02001ff : Intel 82801DB-ICH4 
a0300000-a03000ff : 0000:00:1f.5 
  a0300000-a03000ff : Intel 82801DB-ICH4 
b0000000-bfffffff : 0000:00:00.0 
 
Comment 2 Fabio 2006-05-16 14:38:31 UTC
I reformatted this partition, remount clean, ran 3-4 times "emerge --sync" 
(about one per day).... again! 
What can currupt my fs? 
What can I do to help you investigate this problem? 
 
Thanks 
Fabio 
 
REISERFS: panic (device dm-3): green-9006: Allocated but absent buffer for a 
page? 
------------[ cut here ]------------ 
kernel BUG at fs/reiserfs/prints.c:362! 
invalid opcode: 0000 [#1] 
PREEMPT 
Modules linked in: radeon drm agpgart snd_rtctimer stir4200 ipt_TOS 
iptable_mangle ipt_MASQUERADE iptable_nat ip_nat xt_limit xt_length ipt_LOG 
xt_tcpudp xt_state iptable_filter ip_tables x_tables snd_pcm_oss snd_mixer_oss 
snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device 
snd_intel8x0m eeprom ohci_hcd psmouse parport_pc parport irtty_sir sir_dev 
irda wbsd mmc_core ohci1394 ieee1394 snd_intel8x0 snd_ac97_codec snd_ac97_bus 
snd_pcm snd_timer snd soundcore snd_page_alloc i2c_i801 ehci_hcd usbhid 
usbmouse uhci_hcd usbcore jfs dm_mod yenta_socket rsrc_nonstatic pcmcia 
pcmcia_core firmware_class cpufreq_ondemand rfcomm l2cap bluetooth 8139cp 
CPU:    0 
EIP:    0060:[<b01c3f4f>]    Not tainted VLI 
EFLAGS: 00010282   (2.6.16.16 #2) 
EIP is at reiserfs_panic+0x4f/0x80 
eax: 00000056   ebx: b036f714   ecx: ddb95000   edx: 10000000 
esi: df240c00   edi: df240d48   ebp: b9a77ca4   esp: b9a77c8c 
ds: 007b   es: 007b   ss: 0068 
Process rsync (pid: 16015, threadinfo=b9a77000 task=df9e1550) 
Stack: <0>b0373830 df240d48 b044a8c0 00000200 00000000 df240c00 b9a77e20 
b01b9882 
       df240c00 b03725ac b9a77df4 b9a77d24 cceb95b8 bb209800 00000220 00000000 
       df240c00 00000009 00017da4 cceb956c cce30ba4 00000000 00000000 00000000 
Call Trace: 
 [<b010416a>] show_stack_log_lvl+0xaa/0xe0 
 [<b010438f>] show_registers+0x19f/0x220 
 [<b01045a4>] die+0xf4/0x1a0 
 [<b01046d9>] do_trap+0x89/0xd0 
 [<b0104a58>] do_invalid_op+0xb8/0xd0 
 [<b0103d87>] error_code+0x4f/0x54 
 [<b01b9882>] reiserfs_allocate_blocks_for_region+0x13c2/0x1640 
 [<b01bb3b7>] reiserfs_file_write+0x7c7/0x840 
 [<b01665d1>] vfs_write+0xa1/0x190 
 [<b016678b>] sys_write+0x4b/0x80 
 [<b01031fb>] sysenter_past_esp+0x54/0x75 
Code: 01 00 00 89 04 24 e8 01 fd ff ff c7 04 24 30 38 37 b0 85 f6 89 d8 0f 45 
c7 ba c0 a8 44 b0 89 54 24 08 89 44 24 04 e8 31 c4 f5 ff <0f> 0b 6a 01 4d fc 
36 b0 c7 04 24 54 38 37 b0 85 f6 be c0 a8 44 
 Badness in do_exit at kernel/exit.c:802 
 [<b01040b1>] show_trace+0x21/0x30 
 [<b01041ee>] dump_stack+0x1e/0x20 
 [<b0122b0d>] do_exit+0x44d/0x4a0 
 [<b0104644>] die+0x194/0x1a0 
 [<b01046d9>] do_trap+0x89/0xd0 
 [<b0104a58>] do_invalid_op+0xb8/0xd0 
 [<b0103d87>] error_code+0x4f/0x54 
 [<b01b9882>] reiserfs_allocate_blocks_for_region+0x13c2/0x1640 
 [<b01bb3b7>] reiserfs_file_write+0x7c7/0x840 
 [<b01665d1>] vfs_write+0xa1/0x190 
 [<b016678b>] sys_write+0x4b/0x80 
 [<b01031fb>] sysenter_past_esp+0x54/0x75 
BUG: rsync/16015, lock held at task exit time! 
 [cceb9628] {inode_init_once} 
.. held by:             rsync:16015 [df9e1550, 134] 
... acquired at:               reiserfs_file_write+0xe5/0x840 
 
Comment 3 Piet Delaney 2006-05-16 15:04:32 UTC
WHy not patch your kernel with kgdb, reproduce it, and look at it with gdb.
You could likely use the ethernet interface which seems to be working best
these days.

-piet
Comment 4 Hans Reiser 2006-05-16 23:17:23 UTC
Try a block size of 4k.  Why would  you want to use 512?  It is a bug
though.  Chris, do you want to look at it?  Who is dm?

Hans

bugme-daemon@bugzilla.kernel.org wrote:

>http://bugzilla.kernel.org/show_bug.cgi?id=6535
>
>qnxuserit@yahoo.it changed:
>
>           What    |Removed                     |Added
>----------------------------------------------------------------------------
>            Summary|Ops with 2.6.16.16 while    |Ops with 2.6.16.16 on
>                   |"emerge --sync" (gentoo)    |reiserfs with block-size 512
>
>
>
>------- Additional Comments From qnxuserit@yahoo.it  2006-05-16 14:38 -------
>I reformatted this partition, remount clean, ran 3-4 times "emerge --sync" 
>(about one per day).... again! 
>What can currupt my fs? 
>What can I do to help you investigate this problem? 
> 
>Thanks 
>Fabio 
> 
>REISERFS: panic (device dm-3): green-9006: Allocated but absent buffer for a 
>page? 
>------------[ cut here ]------------ 
>kernel BUG at fs/reiserfs/prints.c:362! 
>invalid opcode: 0000 [#1] 
>PREEMPT 
>Modules linked in: radeon drm agpgart snd_rtctimer stir4200 ipt_TOS 
>iptable_mangle ipt_MASQUERADE iptable_nat ip_nat xt_limit xt_length ipt_LOG 
>xt_tcpudp xt_state iptable_filter ip_tables x_tables snd_pcm_oss snd_mixer_oss 
>snd_seq_dummy snd_seq_oss snd_seq_midi_event snd_seq snd_seq_device 
>snd_intel8x0m eeprom ohci_hcd psmouse parport_pc parport irtty_sir sir_dev 
>irda wbsd mmc_core ohci1394 ieee1394 snd_intel8x0 snd_ac97_codec snd_ac97_bus 
>snd_pcm snd_timer snd soundcore snd_page_alloc i2c_i801 ehci_hcd usbhid 
>usbmouse uhci_hcd usbcore jfs dm_mod yenta_socket rsrc_nonstatic pcmcia 
>pcmcia_core firmware_class cpufreq_ondemand rfcomm l2cap bluetooth 8139cp 
>CPU:    0 
>EIP:    0060:[<b01c3f4f>]    Not tainted VLI 
>EFLAGS: 00010282   (2.6.16.16 #2) 
>EIP is at reiserfs_panic+0x4f/0x80 
>eax: 00000056   ebx: b036f714   ecx: ddb95000   edx: 10000000 
>esi: df240c00   edi: df240d48   ebp: b9a77ca4   esp: b9a77c8c 
>ds: 007b   es: 007b   ss: 0068 
>Process rsync (pid: 16015, threadinfo=b9a77000 task=df9e1550) 
>Stack: <0>b0373830 df240d48 b044a8c0 00000200 00000000 df240c00 b9a77e20 
>b01b9882 
>       df240c00 b03725ac b9a77df4 b9a77d24 cceb95b8 bb209800 00000220 00000000 
>       df240c00 00000009 00017da4 cceb956c cce30ba4 00000000 00000000 00000000 
>Call Trace: 
> [<b010416a>] show_stack_log_lvl+0xaa/0xe0 
> [<b010438f>] show_registers+0x19f/0x220 
> [<b01045a4>] die+0xf4/0x1a0 
> [<b01046d9>] do_trap+0x89/0xd0 
> [<b0104a58>] do_invalid_op+0xb8/0xd0 
> [<b0103d87>] error_code+0x4f/0x54 
> [<b01b9882>] reiserfs_allocate_blocks_for_region+0x13c2/0x1640 
> [<b01bb3b7>] reiserfs_file_write+0x7c7/0x840 
> [<b01665d1>] vfs_write+0xa1/0x190 
> [<b016678b>] sys_write+0x4b/0x80 
> [<b01031fb>] sysenter_past_esp+0x54/0x75 
>Code: 01 00 00 89 04 24 e8 01 fd ff ff c7 04 24 30 38 37 b0 85 f6 89 d8 0f 45 
>c7 ba c0 a8 44 b0 89 54 24 08 89 44 24 04 e8 31 c4 f5 ff <0f> 0b 6a 01 4d fc 
>36 b0 c7 04 24 54 38 37 b0 85 f6 be c0 a8 44 
> Badness in do_exit at kernel/exit.c:802 
> [<b01040b1>] show_trace+0x21/0x30 
> [<b01041ee>] dump_stack+0x1e/0x20 
> [<b0122b0d>] do_exit+0x44d/0x4a0 
> [<b0104644>] die+0x194/0x1a0 
> [<b01046d9>] do_trap+0x89/0xd0 
> [<b0104a58>] do_invalid_op+0xb8/0xd0 
> [<b0103d87>] error_code+0x4f/0x54 
> [<b01b9882>] reiserfs_allocate_blocks_for_region+0x13c2/0x1640 
> [<b01bb3b7>] reiserfs_file_write+0x7c7/0x840 
> [<b01665d1>] vfs_write+0xa1/0x190 
> [<b016678b>] sys_write+0x4b/0x80 
> [<b01031fb>] sysenter_past_esp+0x54/0x75 
>BUG: rsync/16015, lock held at task exit time! 
> [cceb9628] {inode_init_once} 
>.. held by:             rsync:16015 [df9e1550, 134] 
>... acquired at:               reiserfs_file_write+0xe5/0x840 
> 
>
>------- You are receiving this mail because: -------
>You are the assignee for the bug, or are watching the assignee.
>
>
>  
>

Comment 5 Fabio 2006-05-17 11:52:31 UTC
I'm using a block size of 512 because the files in gentoo's portage tree  
normally are very smaller. This decreases the size of the portage-tree to  
about two-third of the original size. I will try with a block-size of 2k...  
 
I could patch my kernel with kgdb but I'm not familiar with gdb... if anyone 
can explain to me step-by-step what should I do... no problem! 
 
This is from my fstab:   
 
/dev/vg00/lvol4   /var/portage  reiserfs   defaults,notail,noatime 1 2   
 
Bye 
Fabio 
 
Comment 6 Hans Reiser 2006-05-17 12:01:28 UTC
bugme-daemon@bugzilla.kernel.org wrote:

>http://bugzilla.kernel.org/show_bug.cgi?id=6535
>
>
>
>
>
>------- Additional Comments From qnxuserit@yahoo.it  2006-05-17 11:52 -------
>I'm using a block size of 512 because the files in gentoo's portage tree  
>normally are very smaller. This decreases the size of the portage-tree to  
>about two-third of the original size. I will try with a block-size of 2k...  
>  
>
Do you know about tails (see mount options for reiserfs), or even
better, reiser4.?

> 
>I could patch my kernel with kgdb but I'm not familiar with gdb... if anyone 
>can explain to me step-by-step what should I do... no problem! 
> 
>This is from my fstab:   
> 
>/dev/vg00/lvol4   /var/portage  reiserfs   defaults,notail,noatime 1 2   
> 
>Bye 
>Fabio 
> 
>
>------- You are receiving this mail because: -------
>You are the assignee for the bug, or are watching the assignee.
>
>
>  
>

Comment 7 Piet Delaney 2006-05-17 14:53:12 UTC
Hi Fabio:

I'd recomend you use kgdb over ethernet for starters, no
wires to mess around with. Since your using a recient
2.6.16 kernel you may want to 'just' use one of Andy
Morton's 'mm' patches which I believe have the kgdb 
patch included. Andy use to use George Anzinger's
kgdb patchs, Amit has assimilated most of these and
everyone is useing the kgdb patches on kgdb.sourceforge.net.

Visit:
    http://sourceforge.net/projects/kgdb

you can pull the patch that matchs your kernel with
the date option with cvs. Unfortunately there is a
conflict of interest, so the information at the site
isn't great. For example the documentaion says you can
just apply the patches for your archecture, but in fact
that causes a lot of patch rejects and it's best to apply the
complete series.

Once patch you can att the ethernet info while booting of via
the /proc/sys/kgdboe interface:

    [piet@gateway /proc/sys]$ pwd
    /proc/sys
    [piet@gateway /proc/sys]$ cat kgdboe
    6443@192.168.110.61/mgmt,33534@10.10.111.70/00:0F:90:D5:36:C4
    [piet@gateway /proc/sys]$

your just telling the kgdb stub what ip address to target the udp
packets for and the ARP info for the 1st gateway.

I cd to the kernel directory and use the following .gdbinit with ddd;
a graphical debugger for gdb:
----------------------------------------------------------------------
set editing on
set radix 0x0a
set print pretty
set print array
set print union
set pagination off
# set debug remote 10
# set backtrace limit 50

# target remote udp:10.10.111.51:6443
target remote udp:192.168.110.61:6443

# set debug remote 1
# set remotebaud 115200
# set remotebaud 19200
# target remote /dev/ttyS0

break panic
# break sa_dproxy_tcp_v4_rcv
# break alloc_context
# break dealloc_context
break bp
# set preempt_audit_enabled = 1
# break sa_tcp_transmit
# break sa_tcp_transmit_skb
# break sa_pass_packet_to_server (spinlock held?)
# break sa_tcp_write_xmit
# break sa_tcp_v4_compute_complete_checksum
# break sa_ip_queue_xmit
# break ip_send_check
# break dev_queue_xmit
break sa_new_tcp_synack_timer
break sa_client_tcp_check_req
# disable
-------------------------------------------------------------

I've always found:

           George Anzinger <george@wildturkeyranch.net> 

a great source of help on kgdb;  He doesn't have a kgdb consulting service. 
The mailing list at kgdb-bugreport@lists.sourceforge.net isn't working
very good; at least not for me posting articles.

If you have any question I'll be glad to help. There are about a dozen
guys on gdb-bugreport@lists.sourceforge.net the should be able to answer
your questions. 

Serial interface is broken for 2.6.12 and 2.6.13; it gives a stub error
'E22'. I looked thru the cvs files and found the bug fix for this but then
ran into yet another bug. Questions to Amit haven't yielded the insight I 
would have liked to have received; maybe I expect to much from folks. Perhaps 
I shuold cought up some kugerands for his kgdb-pro consulting....

Anyway, I'm pretty sure George can dig you our of any problem and I can
help you on easier problems. I've been using kgdb for over 10 years on
UNIX and Linux. I just wish programmers would write code with gdb in mind.

Gotta go; Back to kgdb on some heavy dudy tcp/ip hacking. 

Hey I liked the partage paradigm you guys are supporting, so does
our team leader. I wonder if we shouldn't be using Gentoo as the
platform for our product. Installing stuff like etherreal is painfull.
With portage it should be very easy (like FreeBSD ports).

Good Luck; you can call me for quick questions here at work:

-piet

Work: (408) 200-5256
Comment 8 Fabio 2006-05-19 00:20:33 UTC
I compiled 2.6.17-rc4-mm1. There is kgdb in. 
This evening I will try... 
 
But, after the ops my system is still usable, what have I to do with the 
ethernet? I can't kill rsync, I can't umount my fs but it's usable... 
 
 
Bye 
Fabio 
 
Comment 9 Piet Delaney 2006-05-19 01:50:23 UTC
Make sure kgdb is configured for ethernet, ie: .config has:

CONFIG_KGDB=y
CONFIG_KGDBOE_NOMODULE=y
CONFIG_KGDBOE=y

then on boot you should see:
-----------------------------------------------------------------------------
[   74.613783]         kgdboe: configuration incorrect - kgdboe not loaded.
[   74.657729]           Usage:
kgdboe=[src-port]@[src-ip]/[dev],[tgt-port]@<tgt-ip>/[tgt-macaddr]
[   74.714800]        kgdb: Could not setup core I/O for KGDB.
[   74.751342]        kgdb: Defering I/O setup to kernel module.
-----------------------------------------------------------------------------
if you didn't set the kgdboe info at boot. 


Grub Boot file should looks like:
...........................................................................
title 9: Slot 2 - www.kernel.org vmlinuz-2.6.12-mm10 kgdboe
root (hd0,2)
kernel /boot/vmlinuz-2.6.12-mm10  root=/dev/hda3 console=ttyS0,19200 panic=10 kgg
dboe=6443@192.168.110.61/eth0,33534@10.10.111.70/00:0F:90:D5:36:C4
............................................................................

I'm using a modified kgdboe that allow me to set the kgdboe info via /proc;
that may be in your code since it's very new; I'm still on 2.6.13.

Wiht the new kgdboe code you can have something like this in your
/etc/rc.local file:
.............................................................................
#
#
# Enable kgdb if compiled in the kernel
#
if [[ -e /proc/sys/kgdboe ]]; then
        echo "rc.local: Initialize kgdb";
        echo "rc.local: 6443@192.168.110.61/mgmt,33534@10.10.111.70/00:0F:90:D5::
36:C4   > /proc/sys/kgdboe";
        echo           "6443@192.168.110.61/mgmt,33534@10.10.111.70/00:0F:90:D5::
36:C4"  > /proc/sys/kgdboe;
fi
.............................................................................

I'd first try making sure gdb works with the stub. Then I'd try reproduceing
your bug that's causing the NULL pointer dereference. Since it doesn't
cause a panic you likely need to put a breakpoint in/at die_if_kernel()
or in the path of the problem; likley do_page_fault().

Make sure gdb and the stub is working before you try to work on the problem.
I like to put a breakpoint at a common place, tcp_sendmsg() for example and
then ssh to the target to make sure gdb is working.

Once kgdb is working, put a breakpoint where the printf is occuring and
then reproduce the bug.

I find it a lot easier to keep things simple and compile all modules into
the kernel. Using kgdb on modules does work but it's not as easy.

I'm getting tierd and will be leaving soon; you may want to call if this
isn't clear. It's 1:50 now, I'll hang around for anothe 10 mins.

Don't forget you need to target your host with gdb, I do this in 
my .gdbinit file that I keep in the top linux dir where you do the
make config. In my .gdbint I have:

     target remote udp:192.168.110.61:6443

This is telling gdb were on the internet to send it's ptrace packets
that are in udp packets.


-piet
Comment 10 Piet Delaney 2006-05-19 02:14:05 UTC
Good luck.

-piet
Comment 11 Fabio 2006-05-21 02:53:55 UTC
With 2.6.17-rc4-mm1 I had some difficulty to reproduce the ops (all fine or 
system hangs....) 
So i recompiled 2.6.16.16 with 
 
CONFIG_REISERFS_CHECK=y 
CONFIG_DEBUG_KERNEL=y 
CONFIG_DEBUG_BUGVERBOSE=y 
CONFIG_DEBUG_INFO=y 
CONFIG_DEBUG_FS=y 
 
I think I found something useful... 
Now my system opses early on boot: 
 
ReiserFS: dm-3: found reiserfs format "3.6" with standard journal 
ReiserFS: dm-3: warning: CONFIG_REISERFS_CHECK is set ON 
ReiserFS: dm-3: warning: - it is slow mode for debugging. 
ReiserFS: dm-3: using ordered data mode 
ReiserFS: dm-3: journal params: device dm-3, size 3965, journal first block 
130, max trans len 128, max batch 112, max commit age 30, max trans age 30 
ReiserFS: dm-3: checking transaction log (dm-3) 
ReiserFS: dm-3: journal-1153: found in header: first_unflushed_offset 3037, 
last_flushed_trans_id 5588 
ReiserFS: dm-3: journal-1206: Starting replay from offset 24004572220381, 
trans_id 0 
ReiserFS: dm-3: journal-1299: Setting newest_mount_id to 24 
REISERFS: panic (device Null superblock): reiserfs[5345]: assertion !
( !p_s_key || p_s_chk_path->path_length < 2 || p_s_chk_path->path_length > 5 ) 
failed at fs/reiserfs/stree.c:350:key_in_buffer: PAP-5050: pointer to the 
key(dea16d58) is NULL or invalid path length(6) 
 
------------[ cut here ]------------ 
kernel BUG at fs/reiserfs/prints.c:362! 
invalid opcode: 0000 [#1] 
PREEMPT 
Modules linked in: jfs dm_mod yenta_socket rsrc_nonstatic pcmcia pcmcia_core 
firmware_class cpufreq_ondemand rfcomm l2cap bluetooth 8139cp 
CPU:    0 
EIP:    0060:[<b01bf3ef>]    Not tainted VLI 
EFLAGS: 00010282   (2.6.16.16-dbg #3) 
EIP is at reiserfs_panic+0x4f/0x80 
eax: 00000115   ebx: b03685ef   ecx: dff2e000   edx: 00000000 
esi: 00000000   edi: 00000130   ebp: dea16bd4   esp: dea16bbc 
ds: 007b   es: 007b   ss: 0068 
Process mount (pid: 5345, threadinfo=dea16000 task=dea1d550) 
Stack: <0>b037256c b03685ef b044f8a0 ded97800 b03774d8 0000015e dea16d40 
b01c8e06 
       00000000 b03774d8 000014e1 0000015e b03546b7 dea16d58 00000006 debe1574 
       dea1d550 dea16000 dea16c7c 00000000 dea16c84 0bf88b80 00000000 dea16c28 
Call Trace: 
 [<b0103ffa>] show_stack_log_lvl+0xaa/0xe0 
 [<b010420e>] show_registers+0x18e/0x210 
 [<b0104421>] die+0xf1/0x190 
 [<b0104549>] do_trap+0x89/0xd0 
 [<b01048c8>] do_invalid_op+0xb8/0xd0 
 [<b0103c17>] error_code+0x4f/0x54 
 [<b01c8e06>] search_by_key+0x1466/0x1cb0 
 [<b01aec9c>] reiserfs_read_locked_inode+0x7c/0x140 
 [<b01bda12>] reiserfs_fill_super+0x612/0x850 
 [<b0168729>] get_sb_bdev+0xf9/0x170 
 [<b01be64f>] get_super_block+0x2f/0x40 
 [<b01689b6>] do_kern_mount+0x56/0xd0 
 [<b0180558>] do_new_mount+0x58/0xb0 
 [<b0180c21>] do_mount+0x1a1/0x1d0 
 [<b0180fb7>] sys_mount+0x97/0xe0 
 [<b010308b>] sysenter_past_esp+0x54/0x75 
Code: 01 00 00 89 04 24 e8 b1 fc ff ff c7 04 24 6c 25 37 b0 85 f6 89 d8 0f 45 
c7 ba a0 f8 44 b0 89 54 24 08 89 44 24 04 e8 c1 f6 f5 ff <0f> 0b 6a 01 b0 8b 
36 b0 c7 04 24 90 25 37 b0 85 f6 b9 a0 f8 44 
 
I think this: 
 
!( !p_s_key || p_s_chk_path->path_length < 2 || p_s_chk_path->path_length > 
5 ) 
 
is a piece of sources. Can this line be the one containing the bug? 
 
Without CONFIG_REISERFS_CHECK=y my system does not ops during the boot, but 
later during the "emerge --sync"... there is an inconsistency on disk. But I 
reformatted the fs many times since I have this problem, so it's my kernel 
that corrupt the fs and then, as consequence, opses... I think. 
 
Now I retry with 2.6.17-rc4-mm1 with kdbg... 
 
Bye 
Fabio 
 
Comment 12 Fabio 2006-06-09 08:41:23 UTC
During this time I made a lot of tests.
I have been able to reproduce the problem only on my notebook, not on other pc 
with identical software configuration. I thinked about gcc... (Gentoo compiles 
everithing). Every machine had a different story of updates: on some machine I 
updated gcc as the last one, on other pc I updated gcc as the first one, some 
other machines in pseudo-random order (it depends on how much time I had on 
hand), on my notebook I don't remeber.

I updated gcc from 3.4.5 to 3.4.6 then I recompiled the kernel and 
reiserfsprogs, then reboot.
I re-initialized my /var/portage whith blksize 512 (therefore with mkreiserfs 
compiled with gcc-3.4.6) and the problem disappeared.
I'm running "emerge --sync" every time I can but all works fine.

Perhaps it was a GCC problem with my CFLAGS and reiserfsprogs sources...
I don't know, but with gcc-3.4.6 I have no problems.

I'll stay tuned...

Tnx
Fabio
Comment 13 Piet Delaney 2006-06-09 13:39:31 UTC
Might be resonable to let Andy Morton <akpm@osdl.org> know about this apparent
Kernel/C-Compiler problem. 

What happen with you trying out kgdb in the mm releaase?

-piet
Comment 14 Fabio 2006-06-13 16:30:30 UTC
Sorry but I haven't been able to use kgdb.
I used netcat on another pc in lan but I have never seen a packet...
I think hacking the kernel is not for me :-(.

But I have a news: since 2 days there is the problem again.

After running the famous "emerge --sync" I noticed it hangs.
dmesg shown me these messages:

ReiserFS: warning: is_tree_node: node level 33515 does not match to the 
expected one 1
ReiserFS: dm-3: warning: vs-5150: search_by_key: invalid format found in block 
1801657. Fsck?
ReiserFS: dm-3: warning: vs-13050: reiserfs_update_sd: i/o failure occurred 
trying to update [123610 143253 0x0 SD] stat data
ReiserFS: warning: is_tree_node: node level 33515 does not match to the 
expected one 1
ReiserFS: dm-3: warning: vs-5150: search_by_key: invalid format found in block 
1801657. Fsck?
ReiserFS: dm-3: warning: vs-13050: reiserfs_update_sd: i/o failure occurred 
trying to update [123610 143253 0x0 SD] stat data
ReiserFS: warning: is_tree_node: node level 33515 does not match to the 
expected one 1
ReiserFS: dm-3: warning: vs-5150: search_by_key: invalid format found in block 
1801657. Fsck?
ReiserFS: dm-3: warning: vs-13050: reiserfs_update_sd: i/o failure occurred 
trying to update [123610 143253 0x0 SD] stat data
ReiserFS: warning: is_tree_node: node level 33515 does not match to the 
expected one 1
ReiserFS: dm-3: warning: vs-5150: search_by_key: invalid format found in block 
1801657. Fsck?
ReiserFS: dm-3: warning: vs-13050: reiserfs_update_sd: i/o failure occurred 
trying to update [123610 143253 0x0 SD] stat data
ReiserFS: warning: is_tree_node: node level 33515 does not match to the 
expected one 1
ReiserFS: dm-3: warning: vs-5150: search_by_key: invalid format found in block 
1801657. Fsck?
ReiserFS: dm-3: warning: vs-13050: reiserfs_update_sd: i/o failure occurred 
trying to update [123610 128043 0x0 SD] stat data
ReiserFS: warning: is_tree_node: node level 33515 does not match to the 
expected one 1
ReiserFS: dm-3: warning: vs-5150: search_by_key: invalid format found in block 
1801657. Fsck?
ReiserFS: dm-3: warning: vs-5657: reiserfs_do_truncate: i/o failure occurred 
trying to truncate [123610 128043 0xfffffffffffffff DIRECT]
ReiserFS: warning: is_tree_node: node level 33515 does not match to the 
expected one 1
ReiserFS: dm-3: warning: vs-5150: search_by_key: invalid format found in block 
1801657. Fsck?
ReiserFS: dm-3: warning: vs-13050: reiserfs_update_sd: i/o failure occurred 
trying to update [123610 128044 0x0 SD] stat data
ReiserFS: warning: is_tree_node: node level 33515 does not match to the 
expected one 1
ReiserFS: dm-3: warning: vs-5150: search_by_key: invalid format found in block 
1801657. Fsck?
ReiserFS: dm-3: warning: vs-5657: reiserfs_do_truncate: i/o failure occurred 
trying to truncate [123610 128044 0xfffffffffffffff DIRECT]
ReiserFS: warning: is_tree_node: node level 29487 does not match to the 
expected one 1
ReiserFS: dm-3: warning: vs-5150: search_by_key: invalid format found in block 
416529. Fsck?
ReiserFS: warning: is_tree_node: node level 29487 does not match to the 
expected one 1
ReiserFS: dm-3: warning: vs-5150: search_by_key: invalid format found in block 
416529. Fsck?
ReiserFS: warning: is_tree_node: node level 29487 does not match to the 
expected one 1
ReiserFS: dm-3: warning: vs-5150: search_by_key: invalid format found in block 
416529. Fsck?
ReiserFS: warning: is_tree_node: node level 29487 does not match to the 
expected one 1
ReiserFS: dm-3: warning: vs-5150: search_by_key: invalid format found in block 
416529. Fsck?


I umounted the fs and did an fsck:

Replaying journal..
Reiserfs journal '/dev/vg00/lvol4' in blocks [130..4096]: 0 transactions
replayed
Checking internal tree../  8 (of   9)/ 11 (of  16)/  9 (of  12)bad_internal:
vpf-10320: block 614475, items 3 and 4: The wrong order of items: [123610
143253 0xd601 IND (1)], [123610 143253 0xca01 IND (1)]
the problem in the internal node occured (614475), whole subtree is skipped
finished
Comparing bitmaps..vpf-10640: The on-disk and the correct bitmaps differs.
Bad nodes were found, Semantic pass skipped
1 found corruptions can be fixed only when running with --rebuild-tree


Then with --rebuild-tree:

Pass 0:
####### Pass 0 #######
Loading on-disk bitmap .. ok, 385806 blocks marked used
Skipping 4607 blocks (super block, journal, bitmaps) 381199 blocks will be
read
0%....20%....40%....60%....80%....100%                       left 0, 21177
/sec
143233 directory entries were hashed with "r5" hash.
        "r5" hash is selected
Flushing..finished
        Read blocks (but not data blocks) 381199
                Leaves among those 47347
                Objectids found 143235

Pass 1 (will try to insert 47347 leaves):
####### Pass 1 #######
Looking for allocable blocks .. finished
0%....20%....40%....60%....80%....100%                        left 0, 4304
/sec
Flushing..finished
        47347 leaves read
                47054 inserted
                293 not inserted
####### Pass 2 #######

Pass 2:
0%....20%....40%....60%....80%....100%                           left 0, 0
/sec
Flushing..finished
        Leaves inserted item by item 293
Pass 3 (semantic):
####### Pass 3 #########
/sys-devel/gccrebuild_semantic_pass: The entry [123610 143253] ("Manifest") in
directory [122875 123610] points to nowhere - is removed
/sys-devel/gccrebuild_semantic_pass: The entry [123610 128042]
("gcc-4.0.3.ebuild") in directory [122875 123610] points to nowhere - is
removed
rebuild_semantic_pass: The entry [123610 128046] ("metadata.xml") in directory
[122875 123610] points to nowhere - is removed
vpf-10650: The directory [122875 123610] has the wrong size in the StatData
(928) - corrected to (840)
Flushing..finished
        Files found: 120136
        Directories found: 23096
        Names pointing to nowhere (removed): 3
Pass 3a (looking for lost dir/files):
####### Pass 3a (lost+found pass) #########
Looking for lost directories:
Flushing..finished9, 418 /sec
Pass 4 - finished       done 0, 0 /sec
        Deleted unreachable items 1
Flushing..finished
Syncing..finished

After remount the fs I ran "emerge --sync" again and the system rebooted!
During the boot ops and panic... I loose the messages.
Another reboot: another ops without panic.

I compiled the 2.6.16.20 with 

CONFIG_REISERFS_CHECK=y
CONFIG_DEBUG_KERNEL=y
CONFIG_DEBUG_SLAB=y
CONFIG_DEBUG_PREEMPT=y
CONFIG_DEBUG_BUGVERBOSE=y
CONFIG_DEBUG_INFO=y
CONFIG_DEBUG_FS=y
CONFIG_DEBUG_STACKOVERFLOW=y
CONFIG_DEBUG_STACK_USAGE=y

I rebooted and the system opses during the boot...
These are the messages I captured:

ReiserFS: dm-3: found reiserfs format "3.6" with standard journal
ReiserFS: dm-3: warning: CONFIG_REISERFS_CHECK is set ON
ReiserFS: dm-3: warning: - it is slow mode for debugging.
ReiserFS: dm-3: using ordered data mode
ReiserFS: dm-3: journal params: device dm-3, size 3965, journal first block 
130, max trans len 128, max batch 112, max commit age 30, max trans age 30
ReiserFS: dm-3: checking transaction log (dm-3)
ReiserFS: dm-3: journal-1153: found in header: first_unflushed_offset 2332, 
last_flushed_trans_id 1182
ReiserFS: dm-3: journal-1206: Starting replay from offset 5080946313500, 
trans_id 0
ReiserFS: dm-3: journal-1299: Setting newest_mount_id to 13
REISERFS: panic (device Null superblock): reiserfs[5344]: assertion !
( !p_s_key || p_s_chk_path->path_length < 2 || p_s_chk_path->path_length > 5 ) 
failed at fs/reiserfs/stree.c:350:key_in_buffer: PAP-5050: pointer to the 
key(ded92d58) is NULL or invalid path length(6)

------------[ cut here ]------------
kernel BUG at fs/reiserfs/prints.c:362!
invalid opcode: 0000 [#1]
PREEMPT
Modules linked in: jfs dm_mod yenta_socket rsrc_nonstatic pcmcia pcmcia_core 
firmware_class cpufreq_ondemand rfcomm l2cap bluetooth 8139cp
CPU:    0
EIP:    0060:[<b01c496f>]    Not tainted VLI
EFLAGS: 00010282   (2.6.16.20-trace #4)
EIP is at reiserfs_panic+0x4f/0x80
eax: 00000115   ebx: b0373750   ecx: dff36000   edx: 10000000
esi: 00000000   edi: 00000130   ebp: ded92bd4   esp: ded92bbc
ds: 007b   es: 007b   ss: 0068
Process mount (pid: 5344, threadinfo=ded92000 task=dede7550)
Stack: <0>b037d6d0 b0373750 b045a8a0 df0275f8 b038263c 0000015e ded92d40 
b01ce386
       00000000 b038263c 000014e0 0000015e b035f3d7 ded92d58 00000006 ddd374dc
       dede7550 ded92000 ded92c7c 00000000 ded92c84 0bf88b80 00000000 ded92c28
Call Trace:
 [<b010410a>] show_stack_log_lvl+0xaa/0xe0
 [<b010432f>] show_registers+0x19f/0x220
 [<b0104544>] die+0xf4/0x1a0
 [<b0104679>] do_trap+0x89/0xd0
 [<b01049f8>] do_invalid_op+0xb8/0xd0
 [<b0103d27>] error_code+0x4f/0x54
 [<b01ce386>] search_by_key+0x1466/0x1cb0
 [<b01b418c>] reiserfs_read_locked_inode+0x7c/0x140
 [<b01c2f7d>] reiserfs_fill_super+0x63d/0x860
 [<b016c1a9>] get_sb_bdev+0xf9/0x170
 [<b01c3bcf>] get_super_block+0x2f/0x40
 [<b016c436>] do_kern_mount+0x56/0xd0
 [<b0184ca8>] do_new_mount+0x58/0xb0
 [<b01853c1>] do_mount+0x1a1/0x1d0
 [<b0185767>] sys_mount+0x97/0xe0
 [<b010319b>] sysenter_past_esp+0x54/0x75
Code: 01 00 00 89 04 24 e8 b1 fc ff ff c7 04 24 d0 d6 37 b0 85 f6 89 d8 0f 45 
c7 ba a0 a8 45 b0 89 54 24 08 89 44 24 04 e8 01 a9 f5 ff <0f> 0b 6a 01 11 3d 
37 b0 c7 04 24 f4 d6 37 b0 85 f6 b9 a0 a8 45



In stree.c there is a test:

        RFALSE(!p_s_key || p_s_chk_path->path_length < 
FIRST_PATH_ELEMENT_OFFSET
               || p_s_chk_path->path_length > MAX_HEIGHT,
               "PAP-5050: pointer to the key(%p) is NULL or invalid path 
length(%d)",
               p_s_key, p_s_chk_path->path_length);


In my case it becomes 

( !p_s_key || p_s_chk_path->path_length < 2 || p_s_chk_path->path_length > 5 )

with p_s_chk_path->path_length=6     ?!?!?!?

I reinitialized the fs many times with the same result.
If I use blksize = 1024 there is no problem (like in this moment); if I come 
back to 512 the kernel opses...

Could be 

   p_s_chk_path->path_length > MAX_HEIGHT

not valid/correct for blksize=512...

Thank you very much.
Here is late, good night.

Fabio
Comment 15 Vladimir V. Saveliev 2006-06-14 03:03:16 UTC
with 512 bytes blocks reiserfs tree grows too high and 
MAX_HEIGHT which is 5 by default is not enough. You may want to try to increase
it to 7 for example.
PS: I did not try it and can not garantee that it is enough, but it is necessary
thing anyway
Comment 16 Fabio 2006-06-14 04:19:44 UTC
Thank you for your explanation.
I think I will use blksize = 1024.

But shouldn't this condition be catched and reported as an I/O error rather 
than ops and/or panic?

Thank you very much.
Bye
Fabio
Comment 17 Natalie Protasevich 2007-07-06 22:22:37 UTC
Any updates on this, does the problem still exist?
Thanks.
Comment 18 Fabio 2007-07-07 03:50:53 UTC
Since when I use blksize = 1024 there are no problems...

I think Vladimir is right ("with 512 bytes blocks reiserfs tree grows too high").
But in my opinion the error should be catched and reported in user-space.

Bye
Fabio
Comment 19 Andrew Morton 2007-07-25 15:00:47 UTC
Vladimir, do you think that any kernel changes are needed to address this?  If not, we should close the bug.

Thanks.
Comment 20 Natalie Protasevich 2008-03-23 23:36:13 UTC
Since no more activity for a while (and the issue appears to be resolved), closing this bug.

Note You need to log in before you can comment on or make changes to this bug.