Bug 9109

Summary: Assertion failure in journal_stop()
Product: File System Reporter: Tefnet developers (developers)
Component: ext3Assignee: Andrew Morton (akpm)
Status: REJECTED INSUFFICIENT_DATA    
Severity: high CC: patrick.j.rice
Priority: P1    
Hardware: All   
OS: Linux   
Kernel Version: 2.6.23-rc3 Subsystem:
Regression: --- Bisected commit-id:
Attachments: Kernel configuration
Dmesg after reset

Description Tefnet developers 2007-10-02 00:48:29 UTC
Most recent kernel where this bug did not occur: not known, new production server
Distribution: gentoo
Hardware Environment:
model name      : Intel(R) Core(TM)2 Duo CPU     E6750  @ 2.66GHz
wiga services # free -m
             total       used       free     shared    buffers     cached
Mem:          3040       1025       2015          0        243        383
-/+ buffers/cache:        397       2642
Swap:         6149          0       6149
wiga services # 

sd 0:0:0:0: [sda] 488395055 512-byte hardware sectors (250058 MB)
sd 1:0:0:0: [sdb] 488395055 512-byte hardware sectors (250058 MB)

/dev/md5 on / type ext3 (rw,noatime,data=ordered)
/dev/md6 on /home type ext3 (rw,noexec,noatime,data=ordered)
/dev/md3 on /var type ext3 (rw,noexec,noatime,data=ordered)
/dev/sda2 on /var/spool/squid/1 type ext3 (rw,noatime,data=ordered)
/dev/sdb2 on /var/spool/squid/2 type ext3 (rw,noatime,data=ordered)

wiga services # lspci
00:00.0 Host bridge: Intel Corporation Unknown device 29c0 (rev 02)
00:02.0 VGA compatible controller: Intel Corporation Unknown device 29c2 (rev 02)
00:1a.0 USB Controller: Intel Corporation Unknown device 2937 (rev 02)
00:1a.1 USB Controller: Intel Corporation Unknown device 2938 (rev 02)
00:1a.2 USB Controller: Intel Corporation Unknown device 2939 (rev 02)
00:1a.7 USB Controller: Intel Corporation Unknown device 293c (rev 02)
00:1c.0 PCI bridge: Intel Corporation Unknown device 2940 (rev 02)
00:1c.4 PCI bridge: Intel Corporation Unknown device 2948 (rev 02)
00:1c.5 PCI bridge: Intel Corporation Unknown device 294a (rev 02)
00:1d.0 USB Controller: Intel Corporation Unknown device 2934 (rev 02)
00:1d.1 USB Controller: Intel Corporation Unknown device 2935 (rev 02)
00:1d.2 USB Controller: Intel Corporation Unknown device 2936 (rev 02)
00:1d.7 USB Controller: Intel Corporation Unknown device 293a (rev 02)
00:1e.0 PCI bridge: Intel Corporation 82801 PCI Bridge (rev 92)
00:1f.0 ISA bridge: Intel Corporation Unknown device 2916 (rev 02)
00:1f.2 IDE interface: Intel Corporation Unknown device 2920 (rev 02)
00:1f.3 SMBus: Intel Corporation Unknown device 2930 (rev 02)
00:1f.5 IDE interface: Intel Corporation Unknown device 2926 (rev 02)
02:00.0 IDE interface: JMicron Technologies, Inc. Unknown device 2368
03:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. Unknown device 8168 (rev 01)
04:00.0 Ethernet controller: Realtek Semiconductor Co., Ltd. RTL-8139/8139C/8139C+ (rev 10)
04:07.0 FireWire (IEEE 1394): Texas Instruments TSB43AB23 IEEE-1394a-2000 Controller (PHY/Link)
wiga services # 

Software Environment:

Linux wiga 2.6.23-rc3-tefserver #1 SMP Wed Aug 29 00:22:25 CEST 2007 i686 Intel(R) Core(TM)2 Duo CPU E6750 @ 2.66GHz GenuineIntel GNU/Linux

gcc (GCC) 4.1.2 (Gentoo 4.1.2 p1.0.1)

Problem Description:
Oct  1 17:19:09 wiga kernel: Assertion failure in journal_stop() at fs/jbd/transaction.c:1335: "transaction->t_updates > 0"
Oct  1 17:19:09 wiga kernel: ------------[ cut here ]------------
Oct  1 17:19:09 wiga kernel: kernel BUG at fs/jbd/transaction.c:1335!
Oct  1 17:19:09 wiga kernel: invalid opcode: 0000 [#1]
Oct  1 17:19:09 wiga kernel: SMP 
Oct  1 17:19:09 wiga kernel: Modules linked in: nls_utf8 nls_iso8859_2 tun ipt_REJECT xt_tcpudp ipt_owner ipt_MASQUERADE iptable_nat nf_nat nf_conntrack_ipv4
 nf_conntrack iptable_filter ip_tables x_tables ide_disk jmicron generic ide_core
Oct  1 17:19:09 wiga kernel: CPU:    0
Oct  1 17:19:09 wiga kernel: EIP:    0060:[<c0198893>]    Not tainted VLI
Oct  1 17:19:09 wiga kernel: EFLAGS: 00010292   (2.6.23-rc3-tefserver #1)
Oct  1 17:19:09 wiga kernel: EIP is at journal_stop+0x91/0x1b8
Oct  1 17:19:09 wiga kernel: eax: 00000062   ebx: 00000000   ecx: 00000400   edx: 00000082
Oct  1 17:19:09 wiga kernel: esi: eb0b4140   edi: f7f47200   ebp: f7a175a4   esp: f02d5dfc
Oct  1 17:19:09 wiga kernel: ds: 007b   es: 007b   fs: 00d8  gs: 0000  ss: 0068
Oct  1 17:19:09 wiga kernel: Process pdflush (pid: 16554, ti=f02d4000 task=d4076560 task.ti=f02d4000)
Oct  1 17:19:09 wiga kernel: Stack: c0341c00 c02fb69b c0344074 00000537 c0344178 00000000 00000000 f7f47000 
Oct  1 17:19:09 wiga kernel: c02faf41 f7a175a4 c0194a41 00000000 cc624990 00000000 c01903f3 00001000 
Oct  1 17:19:09 wiga kernel: 00000000 c018d9e9 f02d5ed4 00000286 f02d5f7c d63d8324 00000000 00000000 
Oct  1 17:19:09 wiga kernel: Call Trace:
Oct  1 17:19:09 wiga kernel: [<c0194a41>] __ext3_journal_stop+0x19/0x34
Oct  1 17:19:09 wiga kernel: [<c01903f3>] ext3_ordered_writepage+0x134/0x189
Oct  1 17:19:09 wiga kernel: [<c018d9e9>] bput_one+0x0/0x7
Oct  1 17:19:09 wiga kernel: [<c0144c4c>] __writepage+0x8/0x21
Oct  1 17:19:09 wiga kernel: [<c014501e>] write_cache_pages+0x166/0x279
Oct  1 17:19:09 wiga kernel: [<c0144c44>] __writepage+0x0/0x21
Oct  1 17:19:09 wiga kernel: [<c0118c66>] enqueue_entity+0x1f3/0x217
Oct  1 17:19:09 wiga kernel: [<c0145150>] generic_writepages+0x1f/0x26
Oct  1 17:19:09 wiga kernel: [<c0145180>] do_writepages+0x29/0x30
Oct  1 17:19:09 wiga kernel: [<c0170b61>] __writeback_single_inode+0x198/0x308
Oct  1 17:19:09 wiga kernel: [<c0170fac>] sync_sb_inodes+0x168/0x211
Oct  1 17:19:09 wiga kernel: [<c01711fb>] writeback_inodes+0x63/0xa2
Oct  1 17:19:09 wiga kernel: [<c01459be>] pdflush+0x0/0x1aa
Oct  1 17:19:09 wiga kernel: [<c01456fc>] wb_kupdate+0x7b/0xdf
Oct  1 17:19:09 wiga kernel: [<c0145ad4>] pdflush+0x116/0x1aa
Oct  1 17:19:09 wiga kernel: [<c0145681>] wb_kupdate+0x0/0xdf
Oct  1 17:19:09 wiga kernel: [<c012e1b9>] kthread+0x38/0x5f
Oct  1 17:19:09 wiga kernel: [<c012e181>] kthread+0x0/0x5f
Oct  1 17:19:09 wiga kernel: [<c0104937>] kernel_thread_helper+0x7/0x10
Oct  1 17:19:09 wiga kernel: =======================
Oct  1 17:19:09 wiga kernel: Code: 44 24 10 78 41 34 c0 c7 44 24 0c 37 05 00 00 c7 44 24 08 74 40 34 c0 c7 44 24 04 9b b6 2f c0 c7 04 24 00 1c 34 c0 e8 78 67
 f8 ff <0f> 0b eb fe c7 44 24 14 fb ff ff ff 8b 45 08 48 85 c0 89 45 08 
Oct  1 17:19:09 wiga kernel: EIP: [<c0198893>] journal_stop+0x91/0x1b8 SS:ESP 0068:f02d5dfc

Steps to reproduce:
Unknown, server just stopped running some processes (namely slapd, preventing logins). Had to have it reset.
Comment 1 Tefnet developers 2007-10-02 00:50:05 UTC
Created attachment 13015 [details]
Kernel configuration

Config file of kernel affected by this bug.
Comment 2 Tefnet developers 2007-10-02 01:02:21 UTC
Created attachment 13016 [details]
Dmesg after reset

Shows device initialization etc.
Comment 3 Pat Rice 2008-06-23 05:18:46 UTC
hi all
any updates on this ???
I'm presuming that we have an issue handling disks ?

Thanks
Pat