Bug 34932 - Freezing of tasks failed. fuse_request_send
Summary: Freezing of tasks failed. fuse_request_send
Status: RESOLVED OBSOLETE
Alias: None
Product: File System
Classification: Unclassified
Component: VFS (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: fs_vfs
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2011-05-11 16:49 UTC by Witold Baryluk
Modified: 2023-04-03 16:50 UTC (History)
5 users (show)

See Also:
Kernel Version: 2.6.39-rc7-00708-g9f381a6-dirty
Subsystem:
Regression: No
Bisected commit-id:


Attachments
kernel config (85.24 KB, application/octet-stream)
2011-05-11 16:49 UTC, Witold Baryluk
Details

Description Witold Baryluk 2011-05-11 16:49:25 UTC
I hit problem when trying to suspend my laptop on 2.6.39-rc7-00708-g9f381a6-dirty



[32702.942975] PM: Syncing filesystems ... 
[32703.381941] EXT4-fs (dm-2): re-mounted. Opts: barrier=1,user_xattr,acl,commit=0,commit=600
[32704.209273] EXT4-fs (dm-3): re-mounted. Opts: barrier=1,user_xattr,acl,commit=0,commit=600
[32704.247851] EXT4-fs (dm-6): re-mounted. Opts: barrier=1,user_xattr,acl,commit=0,commit=600
[32704.403624] done.
[32704.522217] Freezing user space processes ... 
[32724.538322] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0):
[32724.540833] as              D 000005cb  6720 15772  15770 0x00800084
[32724.542991]  c61abcf0 00000096 00000000 000005cb 00000000 c38561a0 c73da000 c1b346c0
[32724.545228]  c1b346c0 c1b346c0 c1b346c0 00000000 00000000 c1b346c0 f4a45300 00001dbe
[32724.547470]  c38561a0 c1097e23 c38561a0 c1787985 c2048060 00000001 00000286 00000002
[32724.549768] Call Trace:
[32724.551989]  [<c1097e23>] ? mark_held_locks+0x53/0x70
[32724.554251]  [<c1787985>] ? _raw_spin_unlock_irqrestore+0x65/0x80
[32724.556521]  [<c178aed5>] ? sub_preempt_count+0x85/0xc0
[32724.558761]  [<c1787964>] ? _raw_spin_unlock_irqrestore+0x44/0x80
[32724.561045]  [<c107dc48>] ? prepare_to_wait+0x48/0x70
[32724.563338]  [<c128c245>] request_wait_answer+0xa5/0x1f0
[32724.565671]  [<c107d9e0>] ? wake_up_bit+0x30/0x30
[32724.568020]  [<c128c420>] fuse_request_send+0x90/0xd0
[32724.570382]  [<c128fde1>] fuse_create_open+0x1f1/0x430
[32724.572782]  [<c1290a28>] ? fuse_permission+0x38/0x220
[32724.575186]  [<c129007a>] fuse_create+0x5a/0x70
[32724.577620]  [<c115d618>] vfs_create+0x98/0xf0
[32724.580074]  [<c178aed5>] ? sub_preempt_count+0x85/0xc0
[32724.582505]  [<c115dc75>] do_last+0x605/0x760
[32724.584929]  [<c115df17>] path_openat+0x97/0x370
[32724.587362]  [<c115e225>] do_filp_open+0x35/0x80
[32724.589796]  [<c17874fc>] ? _raw_spin_unlock+0x2c/0x50
[32724.592220]  [<c116bb6f>] ? alloc_fd+0x15f/0x1b0
[32724.594612]  [<c114e617>] do_sys_open+0xf7/0x1c0
[32724.597033]  [<c114e70e>] sys_open+0x2e/0x40
[32724.599433]  [<c178e858>] sysenter_do_call+0x12/0x38
[32724.602014] 
[32724.604367] Restarting tasks ... done.
...




trying again few seconds later

[32841.332711] PM: Syncing filesystems ... done.
[32841.872070] Freezing user space processes ... 
[32849.760179] thinkpad_acpi: fan watchdog: enabling fan
[32861.876252] 
[32861.878583] Freezing of tasks failed after 20.00 seconds (1 tasks refusing to freeze, wq_busy=0):
[32861.881399] git             D f4125c00  6608 17509  17508 0x00800084
[32861.883825]  c73d7dac 00000086 c178aed5 f4125c00 c73d7d34 f431cddc f4006000 c1b346c0
[32861.886312]  c1b346c0 c1b346c0 c1b346c0 00000000 00000000 c1b346c0 e93e77c0 00001dde
[32861.888848]  f431c960 00000000 00000002 00000046 00000000 00000001 d13521a4 c73d7da4
[32861.891411] Call Trace:
[32861.893948]  [<c178aed5>] ? sub_preempt_count+0x85/0xc0
[32861.896512]  [<c108aa16>] ? ktime_get_ts+0xf6/0x120
[32861.899044]  [<c1784cd6>] io_schedule+0x76/0xc0
[32861.901576]  [<c10feb3d>] sleep_on_page+0xd/0x20
[32861.904086]  [<c178568d>] __wait_on_bit+0x4d/0x80
[32861.906550]  [<c10feb30>] ? __lock_page+0xc0/0xc0
[32861.909001]  [<c10fed1e>] wait_on_page_bit+0xbe/0xd0
[32861.911436]  [<c107da30>] ? autoremove_wake_function+0x50/0x50
[32861.913889]  [<c1101077>] __lock_page_or_retry+0x47/0x70
[32861.916331]  [<c11013b1>] filemap_fault+0x311/0x420
[32861.918745]  [<c111e6e1>] ? __do_fault+0x1f1/0x4d0
[32861.921166]  [<c111e53c>] __do_fault+0x4c/0x4d0
[32861.923567]  [<c1096b65>] ? __lock_acquire+0x375/0x7f0
[32861.925992]  [<c1121b17>] handle_pte_fault+0x87/0x240
[32861.928423]  [<c112296c>] handle_mm_fault+0x13c/0x260
[32861.930808]  [<c178aa4a>] do_page_fault+0x11a/0x520
[32861.933225]  [<c10830a4>] ? hrtimer_interrupt+0x154/0x280
[32861.935619]  [<c17881df>] ? error_code+0x5b/0x64
[32861.937997]  [<c178a930>] ? vmalloc_fault+0x1b0/0x1b0
[32861.940369]  [<c178a930>] ? vmalloc_fault+0x1b0/0x1b0
[32861.942725]  [<c17881e3>] error_code+0x5f/0x64
[32861.945075]  [<c178a930>] ? vmalloc_fault+0x1b0/0x1b0
[32861.947408] 
[32861.949710] Restarting tasks ... done.


and again one more time

[32910.258453] PM: Syncing filesystems ... done.
[32910.469287] Freezing user space processes ... 
[32917.088177] thinkpad_acpi: fan watchdog: enabling fan
[32930.484250] 
[32930.486472] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0):
[32930.489180] git             D 00000000  6412 18636  17504 0x00800084
[32930.491484]  c3977d14 00000096 00000001 00000000 00000000 c3824120 c895c000 c1b346c0
[32930.493843]  c1b346c0 c1b346c0 c1b346c0 00000000 00000000 c1b346c0 e93e7a40 00001dee
[32930.496209]  c3824120 00000286 c3824120 c1787985 c2048060 00000001 00000286 00000002
[32930.498584] Call Trace:
[32930.500910]  [<c1787985>] ? _raw_spin_unlock_irqrestore+0x65/0x80
[32930.503258]  [<c178aed5>] ? sub_preempt_count+0x85/0xc0
[32930.505606]  [<c1787964>] ? _raw_spin_unlock_irqrestore+0x44/0x80
[32930.507957]  [<c107dc48>] ? prepare_to_wait+0x48/0x70
[32930.510315]  [<c128c245>] request_wait_answer+0xa5/0x1f0
[32930.512676]  [<c107d9e0>] ? wake_up_bit+0x30/0x30
[32930.515023]  [<c128c420>] fuse_request_send+0x90/0xd0
[32930.517389]  [<c128e747>] fuse_dentry_revalidate+0x1a7/0x320
[32930.519752]  [<c115b4a8>] do_lookup+0x148/0x250
[32930.522120]  [<c115c62f>] path_lookupat+0x12f/0x6a0
[32930.524484]  [<c111dfc1>] ? might_fault+0x91/0xa0
[32930.526830]  [<c111df7b>] ? might_fault+0x4b/0xa0
[32930.529183]  [<c115cbcc>] do_path_lookup+0x2c/0xb0
[32930.531525]  [<c115ceab>] user_path_at+0x3b/0x70
[32930.533879]  [<c11542a5>] vfs_fstatat+0x55/0xa0
[32930.536221]  [<c1154310>] vfs_lstat+0x20/0x30
[32930.538547]  [<c11545d9>] sys_lstat64+0x19/0x30
[32930.540886]  [<c10c012f>] ? audit_syscall_entry+0x1ef/0x210
[32930.543220]  [<c1097f6c>] ? trace_hardirqs_on_caller+0x12c/0x170
[32930.545576]  [<c178e858>] sysenter_do_call+0x12/0x38
[32930.547932] 
[32930.550254] Restarting tasks ... done.



All this processes was working on the zfs-fuse filesystem compiling kernel. I canceled make or git respectivly. And i could easly suspend and resume later. So quite easly reproductible.

kernel config attached.

compiled on i386, pentium-m, Debian GNU/Linux unstable. gcc 4.6.0-7. -dirty only because of changed Makefile to use gcc-4.6.

Thanks.
Comment 1 Witold Baryluk 2011-05-11 16:49:50 UTC
Created attachment 57372 [details]
kernel config
Comment 2 Witold Baryluk 2011-05-11 16:51:03 UTC
This build also have revalidate https://bugzilla.kernel.org/show_bug.cgi?id=34732 (fuse_dentry_revalidate  NULL dereference) patch applied.
Comment 3 Witold Baryluk 2011-05-16 22:27:59 UTC
One more on  2.6.39-rc7-thinkpad-t43-00829-geed631e-dirty

[ 9383.111642] Freezing user space processes ... 
[ 9400.320741] thinkpad_acpi: fan watchdog: enabling fan
[ 9403.120228] 
[ 9403.122551] Freezing of tasks failed after 20.00 seconds (1 tasks refusing to freeze, wq_busy=0):
[ 9403.125748] conf            D f1baaecc  6432 24943  24886 0x00800084
[ 9403.128181]  f235fd7c 00000086 8179d659 f1baaecc f235fd08 8179d704 ab628371 00000888
[ 9403.130739]  f235fd28 81c94900 81c94900 00000000 00000888 f1baaa40 b77a3f60 00000000
[ 9403.133290]  00000000 f1baaa40 00000001 8107bd68 001aa17c f235fd74 81088dca 0500bd84
[ 9403.135874] Call Trace:
[ 9403.138391]  [<8179d659>] ? sub_preempt_count.part.170+0x9/0x90
[ 9403.140964]  [<8179d704>] ? sub_preempt_count+0x24/0x50
[ 9403.143478]  [<8107bd68>] ? prepare_to_wait_exclusive+0x48/0x80
[ 9403.146026]  [<81088dca>] ? ktime_get_ts+0xea/0x120
[ 9403.148604]  [<81797438>] io_schedule+0x78/0xb0
[ 9403.151225]  [<810fdaad>] sleep_on_page_killable+0xd/0x40
[ 9403.153853]  [<81797ca7>] __wait_on_bit_lock+0x47/0xa0
[ 9403.156507]  [<810fdaa0>] ? sleep_on_page+0x20/0x20
[ 9403.159186]  [<810fd9c7>] __lock_page_killable+0xa7/0xb0
[ 9403.161932]  [<8107bfa0>] ? autoremove_wake_function+0x50/0x50
[ 9403.164684]  [<810ff2ef>] do_generic_file_read.constprop.33+0x2ef/0x4f0
[ 9403.167477]  [<81083955>] ? local_clock+0x65/0x70
[ 9403.170267]  [<811001a4>] generic_file_aio_read+0xd4/0x290
[ 9403.173035]  [<810114c8>] ? sched_clock+0x8/0x10
[ 9403.175816]  [<8108360f>] ? sched_clock_local+0xcf/0x1d0
[ 9403.178601]  [<81291fd4>] fuse_file_aio_read+0xc4/0xf0
[ 9403.181384]  [<8114f00c>] do_sync_read+0xac/0xe0
[ 9403.184126]  [<812b474c>] ? security_file_permission+0x8c/0xa0
[ 9403.186912]  [<8114f34c>] ? rw_verify_area+0x6c/0x120
[ 9403.189688]  [<810942a2>] ? check_flags+0x12/0x20
[ 9403.192450]  [<8114f82c>] vfs_read+0x8c/0x160
[ 9403.195156]  [<8114ef60>] ? do_sync_write+0xe0/0xe0
[ 9403.197844]  [<8114f93d>] sys_read+0x3d/0x70
[ 9403.200524]  [<817a1098>] sysenter_do_call+0x12/0x38
[ 9403.203293] 
[ 9403.205884] Restarting tasks ... done.
[ 9403.299179] video LNXVIDEO:00: Restoring backlight state

It happens only when i have fuse file system busy (i.e. some process is doing something heavy on fuse filesystem and I'm trying to suspend computer).
Comment 4 Witold Baryluk 2011-05-16 22:49:31 UTC
On, more.

[ 9907.601105] Freezing user space processes ... 
[ 9909.280188] thinkpad_acpi: fan watchdog: enabling fan
[ 9927.620460] 
[ 9927.624458] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0):
[ 9927.630280] git             D 8179d659  5720 30852  30851 0x00800084
[ 9927.634513]  ef75ddb4 00000082 ef75dd38 8179d659 b770aecc ef75dd44 ca2f065f 00000902
[ 9927.638879]  8109173d 81c94900 81c94900 00000000 00000902 b770aa40 b7589520 ef75dd7c
[ 9927.643379]  00000000 00000000 b770aa40 00000001 001c3a40 ef75ddac 81088dca 22f7d887
[ 9927.647841] Call Trace:
[ 9927.652236]  [<8179d659>] ? sub_preempt_count.part.170+0x9/0x90
[ 9927.656673]  [<8109173d>] ? put_lock_stats.isra.20+0xd/0x30
[ 9927.661120]  [<81088dca>] ? ktime_get_ts+0xea/0x120
[ 9927.665564]  [<81797438>] io_schedule+0x78/0xb0
[ 9927.670088]  [<810fda8d>] sleep_on_page+0xd/0x20
[ 9927.674617]  [<81797def>] __wait_on_bit+0x4f/0x80
[ 9927.679177]  [<810fda80>] ? __lock_page+0xb0/0xb0
[ 9927.683825]  [<810fdc63>] wait_on_page_bit+0xb3/0xc0
[ 9927.688477]  [<8107bfa0>] ? autoremove_wake_function+0x50/0x50
[ 9927.693244]  [<811004a7>] __lock_page_or_retry+0x47/0x70
[ 9927.698017]  [<81100816>] filemap_fault+0x346/0x3b0
[ 9927.702872]  [<811004d0>] ? __lock_page_or_retry+0x70/0x70
[ 9927.707675]  [<8111d0ef>] __do_fault+0x4f/0x4d0
[ 9927.712476]  [<811004d0>] ? __lock_page_or_retry+0x70/0x70
[ 9927.717308]  [<81120497>] handle_pte_fault+0x87/0x240
[ 9927.722171]  [<81120818>] handle_mm_fault+0x138/0x260
[ 9927.726934]  [<8179d247>] do_page_fault+0x117/0x520
[ 9927.731709]  [<812940f0>] ? fuse_flush+0x40/0x140
[ 9927.736491]  [<8179aa2f>] ? error_code+0x5b/0x64
[ 9927.741285]  [<8179d130>] ? vmalloc_fault+0x190/0x190
[ 9927.746005]  [<8179d130>] ? vmalloc_fault+0x190/0x190
[ 9927.750679]  [<8179aa33>] error_code+0x5f/0x64
[ 9927.755260]  [<8179d130>] ? vmalloc_fault+0x190/0x190
[ 9927.759879] 
[ 9927.762581] Restarting tasks ... done.

and one more (same?)

[10109.434374] Freezing user space processes ... 
[10111.840098] thinkpad_acpi: fan watchdog: enabling fan
[10129.450245] 
[10129.452542] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0):
[10129.455775] git             D 8179d659  6252 32330  32326 0x00800084
[10129.458196]  f1befdb4 00000082 f1befd38 8179d659 f0a219ac f1befd44 c9103491 00000931
[10129.460737]  8109173d 81c94900 81c94900 00000000 00000931 f0a21520 ee078000 f1befd7c
[10129.463286]  00000000 00000000 f0a21520 00000001 001cd784 f1befdac 81088dca 19b59094
[10129.465860] Call Trace:
[10129.468353]  [<8179d659>] ? sub_preempt_count.part.170+0x9/0x90
[10129.470899]  [<8109173d>] ? put_lock_stats.isra.20+0xd/0x30
[10129.473445]  [<81088dca>] ? ktime_get_ts+0xea/0x120
[10129.476013]  [<81797438>] io_schedule+0x78/0xb0
[10129.478597]  [<810fda8d>] sleep_on_page+0xd/0x20
[10129.481234]  [<81797def>] __wait_on_bit+0x4f/0x80
[10129.483864]  [<810fda80>] ? __lock_page+0xb0/0xb0
[10129.486513]  [<810fdc63>] wait_on_page_bit+0xb3/0xc0
[10129.489201]  [<8107bfa0>] ? autoremove_wake_function+0x50/0x50
[10129.491949]  [<811004a7>] __lock_page_or_retry+0x47/0x70
[10129.494698]  [<81100816>] filemap_fault+0x346/0x3b0
[10129.497425]  [<811004d0>] ? __lock_page_or_retry+0x70/0x70
[10129.500167]  [<8111d0ef>] __do_fault+0x4f/0x4d0
[10129.502908]  [<811004d0>] ? __lock_page_or_retry+0x70/0x70
[10129.505656]  [<81120497>] handle_pte_fault+0x87/0x240
[10129.508376]  [<81120818>] handle_mm_fault+0x138/0x260
[10129.511096]  [<8179d247>] do_page_fault+0x117/0x520
[10129.513814]  [<810813a1>] ? hrtimer_interrupt+0x151/0x280
[10129.516531]  [<8179d69d>] ? sub_preempt_count.part.170+0x4d/0x90
[10129.519244]  [<8179aa2f>] ? error_code+0x5b/0x64
[10129.521952]  [<8179d130>] ? vmalloc_fault+0x190/0x190
[10129.524607]  [<8179d130>] ? vmalloc_fault+0x190/0x190
[10129.527213]  [<8179aa33>] error_code+0x5f/0x64
[10129.529794]  [<8179d130>] ? vmalloc_fault+0x190/0x190
[10129.532430] 
[10129.535002] Restarting tasks ... done.

And one more (different again):

[10216.592699] Freezing user space processes ... 
[10234.880096] thinkpad_acpi: fan watchdog: enabling fan
[10236.610246] 
[10236.612850] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0):
[10236.616459] git             D 8179d659  6140  1900  32304 0x00800084
[10236.619227]  f0afbdb4 00000082 f0afbd38 8179d659 8c0eaecc f0afbd44 bc523cf1 0000094a
[10236.622089]  8109173d 81c94900 81c94900 00000000 0000094a 8c0eaa40 eca9aa40 f0afbd7c
[10236.624929]  00000000 00000000 8c0eaa40 00000001 001d2b34 f0afbdac 81088dca 23452c4c
[10236.627765] Call Trace:
[10236.630540]  [<8179d659>] ? sub_preempt_count.part.170+0x9/0x90
[10236.633302]  [<8109173d>] ? put_lock_stats.isra.20+0xd/0x30
[10236.636066]  [<81088dca>] ? ktime_get_ts+0xea/0x120
[10236.638805]  [<81797438>] io_schedule+0x78/0xb0
[10236.641580]  [<810fda8d>] sleep_on_page+0xd/0x20
[10236.644305]  [<81797def>] __wait_on_bit+0x4f/0x80
[10236.647015]  [<810fda80>] ? __lock_page+0xb0/0xb0
[10236.649709]  [<810fdc63>] wait_on_page_bit+0xb3/0xc0
[10236.652441]  [<8107bfa0>] ? autoremove_wake_function+0x50/0x50
[10236.655162]  [<811004a7>] __lock_page_or_retry+0x47/0x70
[10236.657892]  [<81100816>] filemap_fault+0x346/0x3b0
[10236.660633]  [<811004d0>] ? __lock_page_or_retry+0x70/0x70
[10236.663331]  [<8111d0ef>] __do_fault+0x4f/0x4d0
[10236.666019]  [<811004d0>] ? __lock_page_or_retry+0x70/0x70
[10236.668740]  [<81120497>] handle_pte_fault+0x87/0x240
[10236.671470]  [<81120818>] handle_mm_fault+0x138/0x260
[10236.674150]  [<8179d247>] do_page_fault+0x117/0x520
[10236.676819]  [<81154bdb>] ? sys_lstat64+0x2b/0x30
[10236.679470]  [<8179aa2f>] ? error_code+0x5b/0x64
[10236.682140]  [<8179d130>] ? vmalloc_fault+0x190/0x190
[10236.684760]  [<8179d130>] ? vmalloc_fault+0x190/0x190
[10236.687349]  [<8179aa33>] error_code+0x5f/0x64
[10236.689926]  [<8179d130>] ? vmalloc_fault+0x190/0x190
[10236.692501] 
[10236.694967] Restarting tasks ... done.


Interesingly, I have this problem only when STARTING compilation of kernel. Most problematic for some reason is git command (git describe --exact-match) and conf. When this commands are in progress, then i cannot suspend. But when they end, and normal compilation took place (still with pretty big disk usage), I have no problem suspending machine (10 success in 10 tries).

But when i put really big load on machine (make -j6), then also can reproduce this when compiling: 

[10345.074878] PM: Syncing filesystems ... done.
[10350.738837] Freezing user space processes ... 
[10370.754726] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0):
[10370.760811] ld              D 8179d659  6424  4532   4528 0x00800084
[10370.765250]  941cfdb4 00000082 941cfd38 8179d659 f38f6e2c 941cfd44 f6d76dfa 00000969
[10370.769809]  8109173d 81c94900 81c94900 00000000 00000969 f38f69a0 b7589520 941cfd7c
[10370.774510]  00000000 00000000 f38f69a0 00000001 001d93fa 941cfdac 81088dca 2ac4aabc
[10370.779148] Call Trace:
[10370.783733]  [<8179d659>] ? sub_preempt_count.part.170+0x9/0x90
[10370.788360]  [<8109173d>] ? put_lock_stats.isra.20+0xd/0x30
[10370.793045]  [<81088dca>] ? ktime_get_ts+0xea/0x120
[10370.797682]  [<81797438>] io_schedule+0x78/0xb0
[10370.802341]  [<810fda8d>] sleep_on_page+0xd/0x20
[10370.806943]  [<81797def>] __wait_on_bit+0x4f/0x80
[10370.811579]  [<810fda80>] ? __lock_page+0xb0/0xb0
[10370.816158]  [<810fdc63>] wait_on_page_bit+0xb3/0xc0
[10370.820774]  [<8107bfa0>] ? autoremove_wake_function+0x50/0x50
[10370.825366]  [<811004a7>] __lock_page_or_retry+0x47/0x70
[10370.829953]  [<81100816>] filemap_fault+0x346/0x3b0
[10370.834603]  [<811004d0>] ? __lock_page_or_retry+0x70/0x70
[10370.839221]  [<8111d0ef>] __do_fault+0x4f/0x4d0
[10370.843868]  [<811004d0>] ? __lock_page_or_retry+0x70/0x70
[10370.848482]  [<81120497>] handle_pte_fault+0x87/0x240
[10370.853124]  [<81120818>] handle_mm_fault+0x138/0x260
[10370.857713]  [<8179d247>] do_page_fault+0x117/0x520
[10370.862384]  [<8179aa2f>] ? error_code+0x5b/0x64
[10370.867012]  [<8179d130>] ? vmalloc_fault+0x190/0x190
[10370.871710]  [<8179d130>] ? vmalloc_fault+0x190/0x190
[10370.876288]  [<8179aa33>] error_code+0x5f/0x64
[10370.880902]  [<8179d130>] ? vmalloc_fault+0x190/0x190
[10370.885571] 
[10370.888072] Restarting tasks ... done.


In fact I could reproduce it also using cat *somebig_files* > /dev/null

[10561.423121] PM: Syncing filesystems ... done.
[10561.874139] Freezing user space processes ... 
[10571.360097] thinkpad_acpi: fan watchdog: enabling fan
[10581.890234] 
[10581.892674] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0):
[10581.895973] cat             D 8c0eaecc  6188  5057  14924 0x00800084
[10581.898497]  f1dd1d7c 00000086 8179d659 8c0eaecc f1dd1d08 8179d704 20ac0580 0000099b
[10581.901104]  f1dd1d28 81c94900 81c94900 00000000 0000099b 8c0eaa40 b758d480 00000000
[10581.903681]  00000000 8c0eaa40 00000001 8107bd68 001e38f0 f1dd1d74 81088dca 3404bc3c
[10581.906282] Call Trace:
[10581.908817]  [<8179d659>] ? sub_preempt_count.part.170+0x9/0x90
[10581.911419]  [<8179d704>] ? sub_preempt_count+0x24/0x50
[10581.913983]  [<8107bd68>] ? prepare_to_wait_exclusive+0x48/0x80
[10581.916562]  [<81088dca>] ? ktime_get_ts+0xea/0x120
[10581.919145]  [<81797438>] io_schedule+0x78/0xb0
[10581.921744]  [<810fdaad>] sleep_on_page_killable+0xd/0x40
[10581.924318]  [<81797ca7>] __wait_on_bit_lock+0x47/0xa0
[10581.926888]  [<810fdaa0>] ? sleep_on_page+0x20/0x20
[10581.929458]  [<810fd9c7>] __lock_page_killable+0xa7/0xb0
[10581.932062]  [<8107bfa0>] ? autoremove_wake_function+0x50/0x50
[10581.934645]  [<810ff2ef>] do_generic_file_read.constprop.33+0x2ef/0x4f0
[10581.937241]  [<811001a4>] generic_file_aio_read+0xd4/0x290
[10581.939836]  [<81291fd4>] fuse_file_aio_read+0xc4/0xf0
[10581.942475]  [<8114f00c>] do_sync_read+0xac/0xe0
[10581.945092]  [<812b474c>] ? security_file_permission+0x8c/0xa0
[10581.947727]  [<8114f34c>] ? rw_verify_area+0x6c/0x120
[10581.950377]  [<810942a2>] ? check_flags+0x12/0x20
[10581.953003]  [<8114f82c>] vfs_read+0x8c/0x160
[10581.955648]  [<8114ef60>] ? do_sync_write+0xe0/0xe0
[10581.958298]  [<8114f93d>] sys_read+0x3d/0x70
[10581.960960]  [<817a1098>] sysenter_do_call+0x12/0x38
[10581.963645] 
[10581.966249] Restarting tasks ... done.


I know some time ago, there was similar problem with sshfs. One could not suspend when sshfs was mounted. But i had no such problem on zfs-fuse previously. Also on sshfs it was that no file needed to be opened or operated on, it was sufficient to have sshfs mounted to prevent suspending. Here, one need to put considerable load to trigger some kind of race-condition.

AFAIK suspending logic is not sufficient. First kernel freezes all processes, but then it tries to perform something with filesystem, probably caches, but cannot do anything as process (zfs-fuse), is already freezed.
Comment 5 Witold Baryluk 2011-06-09 05:43:04 UTC
Still happens on 3.0-rc2, hovewer kernel message is different

[43431.608377] PM: Syncing filesystems ... done.
[43431.921563] PM: Preparing system for mem sleep
[43432.449170] Freezing user space processes ... 
[43452.466230] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0):
[43452.468573] cat             D 00002780     0 22561  16299 0x00800084
[43452.470838]  edec6da0 00000086 65af8605 00002780 00000001 f6fd9c80 00002780 c148a1c0
[43452.473179]  00000c57 00000000 c148a1c0 edec6da0 c148a1c0 c148a1c0 f8a170f3 f4303400
[43452.475539]  ef2245c0 00000000 f4303400 f8a1882d f6e9c52c d32bbde4 f8a1c4e6 d32bbddc
[43452.477932] Call Trace:
[43452.480311]  [<f8a170f3>] ? flush_bg_queue+0x76/0x8a [fuse]
[43452.482700]  [<f8a1882d>] ? fuse_request_send_background+0x25/0x3a [fuse]
[43452.485094]  [<f8a1c4e6>] ? fuse_readpages+0x7c/0x91 [fuse]
[43452.487472]  [<c105307c>] ? timekeeping_get_ns+0x11/0x4f
[43452.489886]  [<c12b083c>] ? io_schedule+0x6c/0xa4
[43452.492321]  [<c109659c>] ? sleep_on_page_killable+0x5/0x27
[43452.494782]  [<c12b0ca6>] ? __wait_on_bit_lock+0x2f/0x68
[43452.497280]  [<c1096597>] ? sleep_on_page+0x8/0x8
[43452.499784]  [<c1096517>] ? __lock_page_killable+0x6f/0x75
[43452.502334]  [<c104c867>] ? autoremove_wake_function+0x29/0x29
[43452.504913]  [<c1097721>] ? generic_file_aio_read+0x44c/0x60a
[43452.507530]  [<c10cc403>] ? do_sync_read+0x9e/0xd3
[43452.510125]  [<c11204c1>] ? fsnotify_perm+0x4f/0x5a
[43452.512691]  [<c10cc4ff>] ? rw_verify_area+0xc7/0xe8
[43452.515279]  [<c10cc365>] ? do_sync_write+0xd3/0xd3
[43452.517867]  [<c10ccdf3>] ? vfs_read+0x7c/0xd6
[43452.520434]  [<c10ccee3>] ? sys_read+0x3c/0x63
[43452.522968]  [<c12b6fdf>] ? sysenter_do_call+0x12/0x28
[43452.525541] 
[43452.528079] Restarting tasks ... done.
[43452.589475] video LNXVIDEO:00: Restoring backlight state
[43452.776042] tg3 0000:02:00.0: BAR 0: set to [mem 0xb0200000-0xb020ffff 64bit] (PCI address [0xb0200000-0xb020ffff])
[43452.932807] ADDRCONF(NETDEV_UP): eth0: link is not ready
[43452.937703] ADDRCONF(NETDEV_UP): eth1: link is not ready
[43453.448848] EXT4-fs (dm-1): re-mounted. Opts: errors=remount-ro,barrier=1,user_xattr,acl,commit=120,commit=0
[43453.822418] EXT4-fs (dm-5): re-mounted. Opts: barrier=1,user_xattr,acl,commit=300,commit=0
[43453.843030] EXT4-fs (dm-2): re-mounted. Opts: barrier=1,user_xattr,acl,commit=120,commit=0
[43457.970949] EXT4-fs (dm-3): re-mounted. Opts: barrier=1,user_xattr,acl,commit=120,commit=0
[43458.722154] EXT4-fs (dm-6): re-mounted. Opts: barrier=1,user_xattr,acl,commit=120,commit=0
[43461.984466] thinkpad_acpi: fan watchdog: enabling fan
Comment 6 Witold Baryluk 2011-07-19 15:48:14 UTC
Still present in 3.0.0-rc7, when performing simple cat /some-fuse-file-system/bigfile.avi  > /dev/null.

I really do not know if this is new issues, but I had no issues for few years, and just noticed it in 2.6.39 - 3.0.0

[ 6926.941930] PM: Syncing filesystems ... done.
[ 6928.372670] Freezing user space processes ... 
[ 6948.394575] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0):
[ 6948.399365] cat             D ebaf1e8c     0 12428  12418 0x00800004
[ 6948.404144]  eeeb5dd4 00200086 ebaf1e2c ebaf1e8c 00000145 eeeb5e00 c119acdb 00000000
[ 6948.408991]  c0c563f4 00000004 eee6d540 eeeb5d84 eeeb5d80 eeeb5d88 eeeb5d88 eeeb5db4
[ 6948.413865]  c0c563f4 00000480 eeeb5e00 c11e55da 00000000 c1073985 00000020 f1e3ce80
[ 6948.418742] Call Trace:
[ 6948.423574]  [<c119acdb>] ? flush_bg_queue+0x7b/0x90
[ 6948.428432]  [<c11e55da>] ? blk_finish_plug+0xa/0x30
[ 6948.433309]  [<c1073985>] ? read_pages+0x45/0xd0
[ 6948.438140]  [<c156fbd2>] ? io_schedule+0x32/0x50
[ 6948.442951]  [<c106b715>] ? sleep_on_page_killable+0x5/0x30
[ 6948.447812]  [<c1570236>] ? __wait_on_bit_lock+0x46/0x90
[ 6948.452752]  [<c106b710>] ? sleep_on_page+0x10/0x10
[ 6948.457712]  [<c106b63e>] ? __lock_page_killable+0x9e/0xb0
[ 6948.462758]  [<c103e620>] ? autoremove_wake_function+0x40/0x40
[ 6948.467853]  [<c106c542>] ? do_generic_file_read.constprop.47+0x292/0x470
[ 6948.473061]  [<c106d59d>] ? generic_file_aio_read+0xed/0x270
[ 6948.478308]  [<c10a14a0>] ? do_sync_write+0xe0/0xe0
[ 6948.483613]  [<c10a154b>] ? do_sync_read+0xab/0xe0
[ 6948.488925]  [<c1019955>] ? do_page_fault+0x185/0x440
[ 6948.494225]  [<c10a1cdb>] ? vfs_read+0x8b/0x130
[ 6948.499456]  [<c10a1dbc>] ? sys_read+0x3c/0x70
[ 6948.504727]  [<c15713d0>] ? sysenter_do_call+0x12/0x26
[ 6948.509957] 
[ 6948.515095] Restarting tasks ... done.
[ 6948.557211] video LNXVIDEO:00: Restoring backlight state
Comment 7 Witold Baryluk 2011-07-27 16:00:36 UTC
Next kind of. Again slightly different. on 3.0.0-t43-prod-03124-g81d6743-dirty

[188056.511692] PM: Syncing filesystems ... done.
[188057.344873] Freezing user space processes ... 
[188077.360091] Freezing of tasks failed after 20.01 seconds (2 tasks refusing to freeze, wq_busy=0):
[188077.360819] collectd        D f524f900     0  3453   3443 0x00800004
[188077.361356]  f37c1e88 00000082 c1027ef6 f524f900 00000092 00000000 f37c1e38 00000092
[188077.362089]  00000000 f56ccfa0 f37c1e38 f37c1e60 c1026e67 00000000 00000001 00000003
[188077.362821]  f56ccfa0 f37c1e78 c10380be f56ccfa0 c1038a6f 00000000 00000000 f56ccfa0
[188077.363553] Call Trace:
[188077.363763]  [<c1027ef6>] ? check_preempt_curr+0x76/0x90
[188077.364188]  [<c1026e67>] ? __wake_up_common+0x47/0x70
[188077.364601]  [<c10380be>] ? recalc_sigpending+0xe/0x30
[188077.365013]  [<c1038a6f>] ? __set_task_blocked+0x2f/0x80
[188077.365438]  [<c103a5ad>] ? set_current_blocked+0xd/0x10
[188077.365863]  [<c103a60b>] ? sigprocmask+0x5b/0xd0
[188077.366243]  [<c11a7a7d>] ? fuse_request_send+0xbd/0x250
[188077.366669]  [<c1043b80>] ? finish_wait+0x50/0x50
[188077.367047]  [<c11b0a7c>] ? fuse_statfs+0xdc/0x160
[188077.367434]  [<c10cc6d5>] ? statfs_by_dentry+0xb5/0x110
[188077.367853]  [<c10cc74a>] ? vfs_statfs+0x1a/0xa0
[188077.368223]  [<c10cc808>] ? user_statfs+0x38/0x60
[188077.368601]  [<c10cc8ca>] ? sys_statfs64+0x1a/0x40
[188077.368987]  [<c15838d0>] ? sysenter_do_call+0x12/0x26

and on more call trace (as two tasks was freezed), probably releated to bug #34572 (see comment #15 there).
Comment 8 Miklos Szeredi 2011-08-04 16:58:08 UTC
Yes, this is a known issue with fuse vs. suspend.
Comment 9 Alan 2012-08-23 14:07:50 UTC
Please re-open if seen with a modern kernel
Comment 10 瀚海书香 2012-11-30 01:44:33 UTC
Has this bug been fixed? In which version?
Comment 11 Laszlo Monda 2013-02-21 12:22:51 UTC
I got the following yesterday with the stock 3.5.0-24-lowlatency kernel of Linux Mint and also got it recently with 3.5.0-19-generic.

I bumped into this issue lately and encounter with it from time to time so this definitely seems a standing issue.

Feb 21 02:22:23 nitehawk kernel: [ 9359.366095] PM: Syncing filesystems ... done.
Feb 21 02:22:23 nitehawk kernel: [ 9359.372465] PM: Preparing system for mem sleep
Feb 21 02:22:23 nitehawk kernel: [ 9359.372484] Freezing user space processes ... 
Feb 21 02:22:23 nitehawk kernel: [ 9379.373235] Freezing of tasks failed after 20.00 seconds (1 tasks refusing to freeze, wq_busy=0):
Feb 21 02:22:23 nitehawk kernel: [ 9379.373336] mc              D ffff88013fd13b00     0  8583      1 0x00000004
Feb 21 02:22:23 nitehawk kernel: [ 9379.373339]  ffff880048bd7be8 0000000000000082 ffff880048b91720 ffff880048bd7fd8
Feb 21 02:22:23 nitehawk kernel: [ 9379.373343]  ffff880048bd7fd8 ffff880048bd7fd8 ffff880139b40000 ffff880048b91720
Feb 21 02:22:23 nitehawk kernel: [ 9379.373346]  ffff880048bd7bc8 ffff880048b91720 ffff88013fd14398 0000000000000082
Feb 21 02:22:23 nitehawk kernel: [ 9379.373349] Call Trace:
Feb 21 02:22:23 nitehawk kernel: [ 9379.373357]  [<ffffffff81127260>] ? sleep_on_page+0x20/0x20
Feb 21 02:22:23 nitehawk kernel: [ 9379.373361]  [<ffffffff81690a29>] schedule+0x29/0x70
Feb 21 02:22:23 nitehawk kernel: [ 9379.373364]  [<ffffffff81690aff>] io_schedule+0x8f/0xd0
Feb 21 02:22:23 nitehawk kernel: [ 9379.373366]  [<ffffffff8112726e>] sleep_on_page_killable+0xe/0x40
Feb 21 02:22:23 nitehawk kernel: [ 9379.373369]  [<ffffffff8168ef7b>] __wait_on_bit_lock+0x5b/0xc0
Feb 21 02:22:23 nitehawk kernel: [ 9379.373371]  [<ffffffff811271c7>] __lock_page_killable+0x67/0x70
Feb 21 02:22:23 nitehawk kernel: [ 9379.373375]  [<ffffffff81078e90>] ? autoremove_wake_function+0x40/0x40
Feb 21 02:22:23 nitehawk kernel: [ 9379.373378]  [<ffffffff81129625>] generic_file_aio_read+0x475/0x710
Feb 21 02:22:23 nitehawk kernel: [ 9379.373382]  [<ffffffff8129fc25>] fuse_file_aio_read+0x75/0x90
Feb 21 02:22:23 nitehawk kernel: [ 9379.373385]  [<ffffffff811889c6>] do_sync_read+0xe6/0x120
Feb 21 02:22:23 nitehawk kernel: [ 9379.373388]  [<ffffffff812bee52>] ? security_file_permission+0x92/0xb0
Feb 21 02:22:23 nitehawk kernel: [ 9379.373391]  [<ffffffff81188e81>] ? rw_verify_area+0x61/0xf0
Feb 21 02:22:23 nitehawk kernel: [ 9379.373393]  [<ffffffff81189319>] vfs_read+0xa9/0x180
Feb 21 02:22:23 nitehawk kernel: [ 9379.373395]  [<ffffffff8118943a>] sys_read+0x4a/0x90
Feb 21 02:22:23 nitehawk kernel: [ 9379.373398]  [<ffffffff816990a9>] system_call_fastpath+0x16/0x1b
Comment 12 Askar Safin 2023-04-03 16:50:33 UTC
Still reproducible on Linux 5.10 with debian stretch. Steps to reproduce:
- Mount something using fuse sshfs (sshfs 2.8, libfuse 2)
- Run "cat /path-to-sshfs-mount/some-big-file > /dev/null"
- Try to suspend while above command runs

Actual behavior: suspend doesn't work
Expected behavior: suspend should work
Reproducibility: always

Please, reopen the bug

Note You need to log in before you can comment on or make changes to this bug.