Bug 76411 - Kernel panic: Watchdog detected hard LOCKUP on cpu 0 - freezes the whole machine
Summary: Kernel panic: Watchdog detected hard LOCKUP on cpu 0 - freezes the whole machine
Status: RESOLVED OBSOLETE
Alias: None
Product: File System
Classification: Unclassified
Component: btrfs (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: Josef Bacik
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2014-05-18 15:50 UTC by Tormen
Modified: 2022-10-03 15:01 UTC (History)
2 users (show)

See Also:
Kernel Version: 3.14.4
Subsystem:
Regression: No
Bisected commit-id:


Attachments
Screenshot of the kernel panic (775.89 KB, image/jpeg)
2014-05-18 16:43 UTC, Tormen
Details

Description Tormen 2014-05-18 15:50:44 UTC
Hi, 

I tried with debian linux-image 3.13, then with latest debian 3.14.4, same problem:

I always get a kernel panic " Watchdog detected hard LOCKUP on cpu 0 " after some while when using btrfs and rsyncing (rsync -ax) 2TB (from an xfs partition onto a 3TB btrfs parition (created with kernel 3.14.4, mkfs.btrfs v0.19) mounted with compress=zlib.

When using xfs, everything works as it should (same machine, same source, same target, same kernel, same rsync command.

The partition contains backup data so MANY smaller files.

The kernel panic locks up the machine completely (even NumLock disabled), so the machine is really DEAD ;)
Traces of the kernel panic are ONLY printed to screen, they did not even get through /dev/kmsg via nc into a logfile on another machine of /dev/kmsg
I took a photo if your interested.

Not sure how to further debug this.


Tormen
Comment 1 Tormen 2014-05-18 15:52:02 UTC
The xfs partition is plain vanilla (mkfs.xfs /dev/xxx, no special mount options what so ever).
Comment 2 Tormen 2014-05-18 16:43:28 UTC
Created attachment 136631 [details]
Screenshot of the kernel panic
Comment 3 Tormen 2014-05-18 16:47:39 UTC
I know that in the screenshot it does not say anything about btrfs.
But I tried it 3 times: 2x with 3.13 1x with 3.14.4 on an otherwise IDLE server and xfs -> xfs worked right away (and is still running ;), so this  +  the fact that I NEVER saw this kernel panic before the moment I tried btrfs clearly points to btrfs as "guilty" variable in the equation.

My guess: A combination of the amount of data + SOOO many little files + compress=zlib.

But you tell me ;)

Thanks again in advance.

Let me know if you'll need anything. As it's only my backup data, I can still do tests for you on a spare drive.
Comment 4 Tormen 2014-05-18 16:48:18 UTC
(and the 3 times it always ended in the screenshot I attached)
Comment 5 David Sterba 2022-10-03 15:01:18 UTC
This is a semi-automated bugzilla cleanup, report is against an old kernel version. If the problem still happens, please open a new bug. Thanks.

Note You need to log in before you can comment on or make changes to this bug.