Bug 76581 - Host freeze under heavy FS RW (Bacula backup)
Summary: Host freeze under heavy FS RW (Bacula backup)
Status: RESOLVED OBSOLETE
Alias: None
Product: File System
Classification: Unclassified
Component: XFS (show other bugs)
Hardware: x86-64 Linux
: P1 normal
Assignee: XFS Guru
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2014-05-21 09:49 UTC by Vitaly L. Fadeev
Modified: 2016-09-22 16:24 UTC (History)
1 user (show)

See Also:
Kernel Version: 3.14.4-gentoo
Tree: Mainline
Regression: No


Attachments
some output from console (8.90 KB, text/plain)
2014-05-21 09:49 UTC, Vitaly L. Fadeev
Details

Description Vitaly L. Fadeev 2014-05-21 09:49:38 UTC
Created attachment 136901 [details]
some output from console

I have a random freeze when bacula running tasks, or when rsyncing.
Please help me what i need to do to help you debug my problem.
I run kexec -p /boot/kernel-genkernel-x86_64-3.14.4-gentoo --initrd=/boot/initramfs-genkernel-x86_64-3.14.4-gentoo --append="root=/dev/sda4 dolvm single irqpoll maxcpus=1 reset_devices"
and waiting for freeze to do a kernel dump.
I do not realy know what subsystem have a problem.

Panic that i can found on serail line:
backup login: [ 7435.046194] Kernel panic - not syncing: Watchdog detected hard LOCKUP on cpu 1
[ 7435.053535] CPU: 1 PID: 0 Comm: swapper/1 Not tainted 3.14.4-gentoo #1
[ 7435.060156] Hardware name: Supermicro X8DTN/X8DTN, BIOS 2.1c       10/28/2011
[ 7435.067396]  0000000000000000 ffff880332806c38 ffffffff8165ff15 ffffffff8185ef60
[ 7435.074978]  ffff880332806cb0 ffffffff8165986a ffffffff00000010 ffff880332806cc0
[ 7435.082567]  ffff880332806c60 ffff88033280bda0 0000000000000001 000000000000073e
[ 7435.090148] Call Trace:
[ 7435.092640]  <NMI>  [<ffffffff8165ff15>] dump_stack+0x4d/0x66
[ 7435.098511]  [<ffffffff8165986a>] panic+0xc7/0x1d5
[ 7435.103380]  [<ffffffff811176a0>] ? restart_watchdog_hrtimer+0x50/0x50
[ 7435.110007]  [<ffffffff81117762>] watchdog_overflow_callback+0xc2/0xd0
[ 7435.116633]  [<ffffffff811566ee>] __perf_event_overflow+0x8e/0x330
[ 7435.122904]  [<ffffffff81154ce8>] ? perf_event_update_userpage+0x148/0x2a0
[ 7435.129888]  [<ffffffff81154ba0>] ? perf_event_task_disable+0x90/0x90
[ 7435.136423]  [<ffffffff81157734>] perf_event_overflow+0x14/0x20
[ 7435.142443]  [<ffffffff8101ef16>] intel_pmu_handle_irq+0x1c6/0x3b0
[ 7435.148720]  [<ffffffff8113d171>] ? function_test_events_call+0xd1/0xe0
[ 7435.155440]  [<ffffffff810d1090>] ? rcu_nmi_enter+0x60/0x60
[ 7435.161096]  [<ffffffff8166aa2b>] perf_event_nmi_handler+0x2b/0x50
[ 7435.167368]  [<ffffffff81669f05>] nmi_handle.isra.3+0xf5/0x3b0
[ 7435.173287]  [<ffffffff81669e15>] ? nmi_handle.isra.3+0x5/0x3b0
[ 7435.179299]  [<ffffffff8166a349>] do_nmi+0x189/0x340
[ 7435.184339]  [<ffffffff81669507>] end_repeat_nmi+0x1e/0x2e
[ 7435.189910]  [<ffffffff814119f0>] ? __ndelay+0x30/0x30
[ 7435.195127]  [<ffffffff814119f0>] ? __ndelay+0x30/0x30
[ 7435.200348]  [<ffffffff814119f0>] ? __ndelay+0x30/0x30
[ 7435.205565]  <<EOE>>  <IRQ>  [<ffffffff8141194f>] ? __delay+0xf/0x20
[ 7435.212070]  [<ffffffff810b2ed1>] do_raw_spin_lock+0xe1/0x140
[ 7435.217904]  [<ffffffff81667fd9>] _raw_spin_lock_irqsave+0x69/0x90
[ 7435.224182]  [<ffffffff8156b540>] ? add_unmap+0x20/0xd0
[ 7435.229482]  [<ffffffff8156b540>] add_unmap+0x20/0xd0
[ 7435.234605]  [<ffffffff8156c816>] intel_unmap_page.part.44+0x96/0x110
[ 7435.241140]  [<ffffffff8156d526>] intel_unmap_page+0x26/0x30
[ 7435.246895]  [<ffffffffa04a5c30>] igb_clean_rx_irq+0x520/0x900 [igb]
[ 7435.253345]  [<ffffffffa04a63d7>] igb_poll+0x3c7/0x7d0 [igb]
[ 7435.259095]  [<ffffffff8158f51f>] ? __napi_schedule+0x5f/0x70
[ 7435.264922]  [<ffffffff81595935>] net_rx_action+0xb5/0x290
[ 7435.270490]  [<ffffffff8105436e>] __do_softirq+0x12e/0x430
[ 7435.276054]  [<ffffffff81054916>] irq_exit+0x96/0xc0
[ 7435.281097]  [<ffffffff816743f8>] do_IRQ+0x58/0xf0
[ 7435.285960]  [<ffffffff81668e6f>] common_interrupt+0x6f/0x6f
[ 7435.291703]  <EOI>  [<ffffffff810aa17d>] ? trace_hardirqs_on+0xd/0x10
[ 7435.298274]  [<ffffffff8100c1e6>] ? default_idle+0x26/0x210
[ 7435.303925]  [<ffffffff8100c1e4>] ? default_idle+0x24/0x210
[ 7435.312459]  [<ffffffff8100cc46>] arch_cpu_idle+0x26/0x30
[ 7435.320865]  [<ffffffff810c01d5>] cpu_startup_entry+0x185/0x3e0
[ 7435.329765]  [<ffffffff8102f788>] start_secondary+0x1c8/0x2f0
[ 7436.461065] Shutting down cpus with NMI
[ 7436.468065] Kernel Offset: 0x0 from 0xffffffff81000000 (relocation range: 0xffffffff80000000-0xffffffff9fffffff)
[ 7436.481187] drm_kms_helper: panic occurred, switching back to text console
Comment 1 Eric Sandeen 2016-09-22 16:24:40 UTC
I'm batch-closing all xfs bugs which are more than 1 year old, sorry about that.

If you still have this issue on a current kernel, please retest and re-open with this information.

Thanks,
-Eric

Note You need to log in before you can comment on or make changes to this bug.