Bug 105121

Summary: lseek(SEEK_DATA) hangs for a long time for sparse files in the page cache
Product: File System Reporter: ValdikSS (iam)
Component: ext4Assignee: fs_ext4 (fs_ext4)
Status: RESOLVED CODE_FIX    
Severity: normal CC: grossws, szg00000, timur.safin, tytso
Priority: P1    
Hardware: All   
OS: Linux   
Kernel Version: 4.2.1 Subsystem:
Regression: No Bisected commit-id:
Attachments: lseek test program

Description ValdikSS 2015-09-28 12:27:05 UTC
Created attachment 188751 [details]
lseek test program

lseek() call hangs for a long time on fallocate allocated files on ext4.

Steps to reproduce:
1. gcc -o prog lseek-fallocate.c
2. fallocate -l 1G test
3. cat test > /dev/null
4. ./prog

Actual result:
prog hangs for 2 minutes on my system while doing lseek().

Expected result:
lseek() instantly return ENXIO.

Usually lseek works much faster if file wasn't cat'ed before lseek.
Works properly on btrfs.
Comment 1 Theodore Tso 2015-09-28 14:47:15 UTC
Thanks for reporting this!

What's going on is that the "cat test > /dev/null" is instantiating 1GB of zero pages in the page cache.

Currently, we are using the extent status to determine if a logical block is mapped to a physical block.  However, the SEEK_DATA code dates back from a time when we were not storing the status of delayed allocation blocks in the extent status cache.  So if the extent status cache indicates that the blocks are unwritten, the code which is handling fseek(SEEK_DATA) is scanning all of the pages to determine if any of the unwritten blocks happen to be modified in memory, but which haven't been pushed out to disk yet.

We should be able to optimize this (and simplify the code as a bonus) by using the EXTENT_STATUS_DELAYED flag instead of trying to scan through all of the page structs (with all of the locking requirements this entails).

Out of curiosity, what was the use case that caused you to notice this?
Comment 2 ValdikSS 2015-09-28 14:52:14 UTC
>Out of curiosity, what was the use case that caused you to notice this?

A person was wondering why cat|grep works faster than grep file on usual files and his test case involved creating file with fallocate and cat it before grepping. He was apparently used btrfs and had 2 seconds difference while it hang for me as grep does literally what's in the attached source.
Comment 3 Theodore Tso 2015-09-28 15:15:29 UTC
This implies that grep is using lseek(SEEK_DATA) as an optimization when users use grep on sparse files.   So I'm guessing this is a Thing, but I'm at a loss why people are interested in running grep on a sparse file (with or without blocks preallocated using fallocate).   Can you enlighten me as to why people (or at least you and your colleague) find it useful to run grep on such files?

Not that it matters since this is a pretty clear optimization we should add to ext4; I'm just curious what the use case is.

Thanks!!
Comment 4 ValdikSS 2015-09-28 15:23:46 UTC
Nobody really uses grep on sparse files. It was just a discussion on why many people use cat|grep instead of just using 'grep file', as pipes are usually slower. Somebody said that he uses cat|grep because it's actually faster than 'grep file' and made this test case. fallocate was used just to make big enough file to tell a difference in seconds. His result was 1 second with cat|grep and 3 seconds with grep file, but he was running btrfs. When I tried his test case, it hang grep so hard I couldn't kill it with SIGKILL even if it was in 'running' state. I took a look in strace what's going on and why it is stalled and filled this bug.

Anyway, thanks for confirmation and acknowledgment this problem!
Comment 5 ValdikSS 2019-01-18 07:44:16 UTC
Seems fixed as for 4.19.13.