Bug 34932
Summary: | Freezing of tasks failed. fuse_request_send | ||
---|---|---|---|
Product: | File System | Reporter: | Witold Baryluk (witold.baryluk+kernel) |
Component: | VFS | Assignee: | fs_vfs |
Status: | RESOLVED OBSOLETE | ||
Severity: | normal | CC: | alan, laci, miklos, safinaskar, tingw.liu |
Priority: | P1 | ||
Hardware: | All | ||
OS: | Linux | ||
Kernel Version: | 2.6.39-rc7-00708-g9f381a6-dirty | Subsystem: | |
Regression: | No | Bisected commit-id: | |
Attachments: | kernel config |
Description
Witold Baryluk
2011-05-11 16:49:25 UTC
Created attachment 57372 [details]
kernel config
This build also have revalidate https://bugzilla.kernel.org/show_bug.cgi?id=34732 (fuse_dentry_revalidate NULL dereference) patch applied. One more on 2.6.39-rc7-thinkpad-t43-00829-geed631e-dirty [ 9383.111642] Freezing user space processes ... [ 9400.320741] thinkpad_acpi: fan watchdog: enabling fan [ 9403.120228] [ 9403.122551] Freezing of tasks failed after 20.00 seconds (1 tasks refusing to freeze, wq_busy=0): [ 9403.125748] conf D f1baaecc 6432 24943 24886 0x00800084 [ 9403.128181] f235fd7c 00000086 8179d659 f1baaecc f235fd08 8179d704 ab628371 00000888 [ 9403.130739] f235fd28 81c94900 81c94900 00000000 00000888 f1baaa40 b77a3f60 00000000 [ 9403.133290] 00000000 f1baaa40 00000001 8107bd68 001aa17c f235fd74 81088dca 0500bd84 [ 9403.135874] Call Trace: [ 9403.138391] [<8179d659>] ? sub_preempt_count.part.170+0x9/0x90 [ 9403.140964] [<8179d704>] ? sub_preempt_count+0x24/0x50 [ 9403.143478] [<8107bd68>] ? prepare_to_wait_exclusive+0x48/0x80 [ 9403.146026] [<81088dca>] ? ktime_get_ts+0xea/0x120 [ 9403.148604] [<81797438>] io_schedule+0x78/0xb0 [ 9403.151225] [<810fdaad>] sleep_on_page_killable+0xd/0x40 [ 9403.153853] [<81797ca7>] __wait_on_bit_lock+0x47/0xa0 [ 9403.156507] [<810fdaa0>] ? sleep_on_page+0x20/0x20 [ 9403.159186] [<810fd9c7>] __lock_page_killable+0xa7/0xb0 [ 9403.161932] [<8107bfa0>] ? autoremove_wake_function+0x50/0x50 [ 9403.164684] [<810ff2ef>] do_generic_file_read.constprop.33+0x2ef/0x4f0 [ 9403.167477] [<81083955>] ? local_clock+0x65/0x70 [ 9403.170267] [<811001a4>] generic_file_aio_read+0xd4/0x290 [ 9403.173035] [<810114c8>] ? sched_clock+0x8/0x10 [ 9403.175816] [<8108360f>] ? sched_clock_local+0xcf/0x1d0 [ 9403.178601] [<81291fd4>] fuse_file_aio_read+0xc4/0xf0 [ 9403.181384] [<8114f00c>] do_sync_read+0xac/0xe0 [ 9403.184126] [<812b474c>] ? security_file_permission+0x8c/0xa0 [ 9403.186912] [<8114f34c>] ? rw_verify_area+0x6c/0x120 [ 9403.189688] [<810942a2>] ? check_flags+0x12/0x20 [ 9403.192450] [<8114f82c>] vfs_read+0x8c/0x160 [ 9403.195156] [<8114ef60>] ? do_sync_write+0xe0/0xe0 [ 9403.197844] [<8114f93d>] sys_read+0x3d/0x70 [ 9403.200524] [<817a1098>] sysenter_do_call+0x12/0x38 [ 9403.203293] [ 9403.205884] Restarting tasks ... done. [ 9403.299179] video LNXVIDEO:00: Restoring backlight state It happens only when i have fuse file system busy (i.e. some process is doing something heavy on fuse filesystem and I'm trying to suspend computer). On, more. [ 9907.601105] Freezing user space processes ... [ 9909.280188] thinkpad_acpi: fan watchdog: enabling fan [ 9927.620460] [ 9927.624458] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0): [ 9927.630280] git D 8179d659 5720 30852 30851 0x00800084 [ 9927.634513] ef75ddb4 00000082 ef75dd38 8179d659 b770aecc ef75dd44 ca2f065f 00000902 [ 9927.638879] 8109173d 81c94900 81c94900 00000000 00000902 b770aa40 b7589520 ef75dd7c [ 9927.643379] 00000000 00000000 b770aa40 00000001 001c3a40 ef75ddac 81088dca 22f7d887 [ 9927.647841] Call Trace: [ 9927.652236] [<8179d659>] ? sub_preempt_count.part.170+0x9/0x90 [ 9927.656673] [<8109173d>] ? put_lock_stats.isra.20+0xd/0x30 [ 9927.661120] [<81088dca>] ? ktime_get_ts+0xea/0x120 [ 9927.665564] [<81797438>] io_schedule+0x78/0xb0 [ 9927.670088] [<810fda8d>] sleep_on_page+0xd/0x20 [ 9927.674617] [<81797def>] __wait_on_bit+0x4f/0x80 [ 9927.679177] [<810fda80>] ? __lock_page+0xb0/0xb0 [ 9927.683825] [<810fdc63>] wait_on_page_bit+0xb3/0xc0 [ 9927.688477] [<8107bfa0>] ? autoremove_wake_function+0x50/0x50 [ 9927.693244] [<811004a7>] __lock_page_or_retry+0x47/0x70 [ 9927.698017] [<81100816>] filemap_fault+0x346/0x3b0 [ 9927.702872] [<811004d0>] ? __lock_page_or_retry+0x70/0x70 [ 9927.707675] [<8111d0ef>] __do_fault+0x4f/0x4d0 [ 9927.712476] [<811004d0>] ? __lock_page_or_retry+0x70/0x70 [ 9927.717308] [<81120497>] handle_pte_fault+0x87/0x240 [ 9927.722171] [<81120818>] handle_mm_fault+0x138/0x260 [ 9927.726934] [<8179d247>] do_page_fault+0x117/0x520 [ 9927.731709] [<812940f0>] ? fuse_flush+0x40/0x140 [ 9927.736491] [<8179aa2f>] ? error_code+0x5b/0x64 [ 9927.741285] [<8179d130>] ? vmalloc_fault+0x190/0x190 [ 9927.746005] [<8179d130>] ? vmalloc_fault+0x190/0x190 [ 9927.750679] [<8179aa33>] error_code+0x5f/0x64 [ 9927.755260] [<8179d130>] ? vmalloc_fault+0x190/0x190 [ 9927.759879] [ 9927.762581] Restarting tasks ... done. and one more (same?) [10109.434374] Freezing user space processes ... [10111.840098] thinkpad_acpi: fan watchdog: enabling fan [10129.450245] [10129.452542] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0): [10129.455775] git D 8179d659 6252 32330 32326 0x00800084 [10129.458196] f1befdb4 00000082 f1befd38 8179d659 f0a219ac f1befd44 c9103491 00000931 [10129.460737] 8109173d 81c94900 81c94900 00000000 00000931 f0a21520 ee078000 f1befd7c [10129.463286] 00000000 00000000 f0a21520 00000001 001cd784 f1befdac 81088dca 19b59094 [10129.465860] Call Trace: [10129.468353] [<8179d659>] ? sub_preempt_count.part.170+0x9/0x90 [10129.470899] [<8109173d>] ? put_lock_stats.isra.20+0xd/0x30 [10129.473445] [<81088dca>] ? ktime_get_ts+0xea/0x120 [10129.476013] [<81797438>] io_schedule+0x78/0xb0 [10129.478597] [<810fda8d>] sleep_on_page+0xd/0x20 [10129.481234] [<81797def>] __wait_on_bit+0x4f/0x80 [10129.483864] [<810fda80>] ? __lock_page+0xb0/0xb0 [10129.486513] [<810fdc63>] wait_on_page_bit+0xb3/0xc0 [10129.489201] [<8107bfa0>] ? autoremove_wake_function+0x50/0x50 [10129.491949] [<811004a7>] __lock_page_or_retry+0x47/0x70 [10129.494698] [<81100816>] filemap_fault+0x346/0x3b0 [10129.497425] [<811004d0>] ? __lock_page_or_retry+0x70/0x70 [10129.500167] [<8111d0ef>] __do_fault+0x4f/0x4d0 [10129.502908] [<811004d0>] ? __lock_page_or_retry+0x70/0x70 [10129.505656] [<81120497>] handle_pte_fault+0x87/0x240 [10129.508376] [<81120818>] handle_mm_fault+0x138/0x260 [10129.511096] [<8179d247>] do_page_fault+0x117/0x520 [10129.513814] [<810813a1>] ? hrtimer_interrupt+0x151/0x280 [10129.516531] [<8179d69d>] ? sub_preempt_count.part.170+0x4d/0x90 [10129.519244] [<8179aa2f>] ? error_code+0x5b/0x64 [10129.521952] [<8179d130>] ? vmalloc_fault+0x190/0x190 [10129.524607] [<8179d130>] ? vmalloc_fault+0x190/0x190 [10129.527213] [<8179aa33>] error_code+0x5f/0x64 [10129.529794] [<8179d130>] ? vmalloc_fault+0x190/0x190 [10129.532430] [10129.535002] Restarting tasks ... done. And one more (different again): [10216.592699] Freezing user space processes ... [10234.880096] thinkpad_acpi: fan watchdog: enabling fan [10236.610246] [10236.612850] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0): [10236.616459] git D 8179d659 6140 1900 32304 0x00800084 [10236.619227] f0afbdb4 00000082 f0afbd38 8179d659 8c0eaecc f0afbd44 bc523cf1 0000094a [10236.622089] 8109173d 81c94900 81c94900 00000000 0000094a 8c0eaa40 eca9aa40 f0afbd7c [10236.624929] 00000000 00000000 8c0eaa40 00000001 001d2b34 f0afbdac 81088dca 23452c4c [10236.627765] Call Trace: [10236.630540] [<8179d659>] ? sub_preempt_count.part.170+0x9/0x90 [10236.633302] [<8109173d>] ? put_lock_stats.isra.20+0xd/0x30 [10236.636066] [<81088dca>] ? ktime_get_ts+0xea/0x120 [10236.638805] [<81797438>] io_schedule+0x78/0xb0 [10236.641580] [<810fda8d>] sleep_on_page+0xd/0x20 [10236.644305] [<81797def>] __wait_on_bit+0x4f/0x80 [10236.647015] [<810fda80>] ? __lock_page+0xb0/0xb0 [10236.649709] [<810fdc63>] wait_on_page_bit+0xb3/0xc0 [10236.652441] [<8107bfa0>] ? autoremove_wake_function+0x50/0x50 [10236.655162] [<811004a7>] __lock_page_or_retry+0x47/0x70 [10236.657892] [<81100816>] filemap_fault+0x346/0x3b0 [10236.660633] [<811004d0>] ? __lock_page_or_retry+0x70/0x70 [10236.663331] [<8111d0ef>] __do_fault+0x4f/0x4d0 [10236.666019] [<811004d0>] ? __lock_page_or_retry+0x70/0x70 [10236.668740] [<81120497>] handle_pte_fault+0x87/0x240 [10236.671470] [<81120818>] handle_mm_fault+0x138/0x260 [10236.674150] [<8179d247>] do_page_fault+0x117/0x520 [10236.676819] [<81154bdb>] ? sys_lstat64+0x2b/0x30 [10236.679470] [<8179aa2f>] ? error_code+0x5b/0x64 [10236.682140] [<8179d130>] ? vmalloc_fault+0x190/0x190 [10236.684760] [<8179d130>] ? vmalloc_fault+0x190/0x190 [10236.687349] [<8179aa33>] error_code+0x5f/0x64 [10236.689926] [<8179d130>] ? vmalloc_fault+0x190/0x190 [10236.692501] [10236.694967] Restarting tasks ... done. Interesingly, I have this problem only when STARTING compilation of kernel. Most problematic for some reason is git command (git describe --exact-match) and conf. When this commands are in progress, then i cannot suspend. But when they end, and normal compilation took place (still with pretty big disk usage), I have no problem suspending machine (10 success in 10 tries). But when i put really big load on machine (make -j6), then also can reproduce this when compiling: [10345.074878] PM: Syncing filesystems ... done. [10350.738837] Freezing user space processes ... [10370.754726] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0): [10370.760811] ld D 8179d659 6424 4532 4528 0x00800084 [10370.765250] 941cfdb4 00000082 941cfd38 8179d659 f38f6e2c 941cfd44 f6d76dfa 00000969 [10370.769809] 8109173d 81c94900 81c94900 00000000 00000969 f38f69a0 b7589520 941cfd7c [10370.774510] 00000000 00000000 f38f69a0 00000001 001d93fa 941cfdac 81088dca 2ac4aabc [10370.779148] Call Trace: [10370.783733] [<8179d659>] ? sub_preempt_count.part.170+0x9/0x90 [10370.788360] [<8109173d>] ? put_lock_stats.isra.20+0xd/0x30 [10370.793045] [<81088dca>] ? ktime_get_ts+0xea/0x120 [10370.797682] [<81797438>] io_schedule+0x78/0xb0 [10370.802341] [<810fda8d>] sleep_on_page+0xd/0x20 [10370.806943] [<81797def>] __wait_on_bit+0x4f/0x80 [10370.811579] [<810fda80>] ? __lock_page+0xb0/0xb0 [10370.816158] [<810fdc63>] wait_on_page_bit+0xb3/0xc0 [10370.820774] [<8107bfa0>] ? autoremove_wake_function+0x50/0x50 [10370.825366] [<811004a7>] __lock_page_or_retry+0x47/0x70 [10370.829953] [<81100816>] filemap_fault+0x346/0x3b0 [10370.834603] [<811004d0>] ? __lock_page_or_retry+0x70/0x70 [10370.839221] [<8111d0ef>] __do_fault+0x4f/0x4d0 [10370.843868] [<811004d0>] ? __lock_page_or_retry+0x70/0x70 [10370.848482] [<81120497>] handle_pte_fault+0x87/0x240 [10370.853124] [<81120818>] handle_mm_fault+0x138/0x260 [10370.857713] [<8179d247>] do_page_fault+0x117/0x520 [10370.862384] [<8179aa2f>] ? error_code+0x5b/0x64 [10370.867012] [<8179d130>] ? vmalloc_fault+0x190/0x190 [10370.871710] [<8179d130>] ? vmalloc_fault+0x190/0x190 [10370.876288] [<8179aa33>] error_code+0x5f/0x64 [10370.880902] [<8179d130>] ? vmalloc_fault+0x190/0x190 [10370.885571] [10370.888072] Restarting tasks ... done. In fact I could reproduce it also using cat *somebig_files* > /dev/null [10561.423121] PM: Syncing filesystems ... done. [10561.874139] Freezing user space processes ... [10571.360097] thinkpad_acpi: fan watchdog: enabling fan [10581.890234] [10581.892674] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0): [10581.895973] cat D 8c0eaecc 6188 5057 14924 0x00800084 [10581.898497] f1dd1d7c 00000086 8179d659 8c0eaecc f1dd1d08 8179d704 20ac0580 0000099b [10581.901104] f1dd1d28 81c94900 81c94900 00000000 0000099b 8c0eaa40 b758d480 00000000 [10581.903681] 00000000 8c0eaa40 00000001 8107bd68 001e38f0 f1dd1d74 81088dca 3404bc3c [10581.906282] Call Trace: [10581.908817] [<8179d659>] ? sub_preempt_count.part.170+0x9/0x90 [10581.911419] [<8179d704>] ? sub_preempt_count+0x24/0x50 [10581.913983] [<8107bd68>] ? prepare_to_wait_exclusive+0x48/0x80 [10581.916562] [<81088dca>] ? ktime_get_ts+0xea/0x120 [10581.919145] [<81797438>] io_schedule+0x78/0xb0 [10581.921744] [<810fdaad>] sleep_on_page_killable+0xd/0x40 [10581.924318] [<81797ca7>] __wait_on_bit_lock+0x47/0xa0 [10581.926888] [<810fdaa0>] ? sleep_on_page+0x20/0x20 [10581.929458] [<810fd9c7>] __lock_page_killable+0xa7/0xb0 [10581.932062] [<8107bfa0>] ? autoremove_wake_function+0x50/0x50 [10581.934645] [<810ff2ef>] do_generic_file_read.constprop.33+0x2ef/0x4f0 [10581.937241] [<811001a4>] generic_file_aio_read+0xd4/0x290 [10581.939836] [<81291fd4>] fuse_file_aio_read+0xc4/0xf0 [10581.942475] [<8114f00c>] do_sync_read+0xac/0xe0 [10581.945092] [<812b474c>] ? security_file_permission+0x8c/0xa0 [10581.947727] [<8114f34c>] ? rw_verify_area+0x6c/0x120 [10581.950377] [<810942a2>] ? check_flags+0x12/0x20 [10581.953003] [<8114f82c>] vfs_read+0x8c/0x160 [10581.955648] [<8114ef60>] ? do_sync_write+0xe0/0xe0 [10581.958298] [<8114f93d>] sys_read+0x3d/0x70 [10581.960960] [<817a1098>] sysenter_do_call+0x12/0x38 [10581.963645] [10581.966249] Restarting tasks ... done. I know some time ago, there was similar problem with sshfs. One could not suspend when sshfs was mounted. But i had no such problem on zfs-fuse previously. Also on sshfs it was that no file needed to be opened or operated on, it was sufficient to have sshfs mounted to prevent suspending. Here, one need to put considerable load to trigger some kind of race-condition. AFAIK suspending logic is not sufficient. First kernel freezes all processes, but then it tries to perform something with filesystem, probably caches, but cannot do anything as process (zfs-fuse), is already freezed. Still happens on 3.0-rc2, hovewer kernel message is different [43431.608377] PM: Syncing filesystems ... done. [43431.921563] PM: Preparing system for mem sleep [43432.449170] Freezing user space processes ... [43452.466230] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0): [43452.468573] cat D 00002780 0 22561 16299 0x00800084 [43452.470838] edec6da0 00000086 65af8605 00002780 00000001 f6fd9c80 00002780 c148a1c0 [43452.473179] 00000c57 00000000 c148a1c0 edec6da0 c148a1c0 c148a1c0 f8a170f3 f4303400 [43452.475539] ef2245c0 00000000 f4303400 f8a1882d f6e9c52c d32bbde4 f8a1c4e6 d32bbddc [43452.477932] Call Trace: [43452.480311] [<f8a170f3>] ? flush_bg_queue+0x76/0x8a [fuse] [43452.482700] [<f8a1882d>] ? fuse_request_send_background+0x25/0x3a [fuse] [43452.485094] [<f8a1c4e6>] ? fuse_readpages+0x7c/0x91 [fuse] [43452.487472] [<c105307c>] ? timekeeping_get_ns+0x11/0x4f [43452.489886] [<c12b083c>] ? io_schedule+0x6c/0xa4 [43452.492321] [<c109659c>] ? sleep_on_page_killable+0x5/0x27 [43452.494782] [<c12b0ca6>] ? __wait_on_bit_lock+0x2f/0x68 [43452.497280] [<c1096597>] ? sleep_on_page+0x8/0x8 [43452.499784] [<c1096517>] ? __lock_page_killable+0x6f/0x75 [43452.502334] [<c104c867>] ? autoremove_wake_function+0x29/0x29 [43452.504913] [<c1097721>] ? generic_file_aio_read+0x44c/0x60a [43452.507530] [<c10cc403>] ? do_sync_read+0x9e/0xd3 [43452.510125] [<c11204c1>] ? fsnotify_perm+0x4f/0x5a [43452.512691] [<c10cc4ff>] ? rw_verify_area+0xc7/0xe8 [43452.515279] [<c10cc365>] ? do_sync_write+0xd3/0xd3 [43452.517867] [<c10ccdf3>] ? vfs_read+0x7c/0xd6 [43452.520434] [<c10ccee3>] ? sys_read+0x3c/0x63 [43452.522968] [<c12b6fdf>] ? sysenter_do_call+0x12/0x28 [43452.525541] [43452.528079] Restarting tasks ... done. [43452.589475] video LNXVIDEO:00: Restoring backlight state [43452.776042] tg3 0000:02:00.0: BAR 0: set to [mem 0xb0200000-0xb020ffff 64bit] (PCI address [0xb0200000-0xb020ffff]) [43452.932807] ADDRCONF(NETDEV_UP): eth0: link is not ready [43452.937703] ADDRCONF(NETDEV_UP): eth1: link is not ready [43453.448848] EXT4-fs (dm-1): re-mounted. Opts: errors=remount-ro,barrier=1,user_xattr,acl,commit=120,commit=0 [43453.822418] EXT4-fs (dm-5): re-mounted. Opts: barrier=1,user_xattr,acl,commit=300,commit=0 [43453.843030] EXT4-fs (dm-2): re-mounted. Opts: barrier=1,user_xattr,acl,commit=120,commit=0 [43457.970949] EXT4-fs (dm-3): re-mounted. Opts: barrier=1,user_xattr,acl,commit=120,commit=0 [43458.722154] EXT4-fs (dm-6): re-mounted. Opts: barrier=1,user_xattr,acl,commit=120,commit=0 [43461.984466] thinkpad_acpi: fan watchdog: enabling fan Still present in 3.0.0-rc7, when performing simple cat /some-fuse-file-system/bigfile.avi > /dev/null. I really do not know if this is new issues, but I had no issues for few years, and just noticed it in 2.6.39 - 3.0.0 [ 6926.941930] PM: Syncing filesystems ... done. [ 6928.372670] Freezing user space processes ... [ 6948.394575] Freezing of tasks failed after 20.01 seconds (1 tasks refusing to freeze, wq_busy=0): [ 6948.399365] cat D ebaf1e8c 0 12428 12418 0x00800004 [ 6948.404144] eeeb5dd4 00200086 ebaf1e2c ebaf1e8c 00000145 eeeb5e00 c119acdb 00000000 [ 6948.408991] c0c563f4 00000004 eee6d540 eeeb5d84 eeeb5d80 eeeb5d88 eeeb5d88 eeeb5db4 [ 6948.413865] c0c563f4 00000480 eeeb5e00 c11e55da 00000000 c1073985 00000020 f1e3ce80 [ 6948.418742] Call Trace: [ 6948.423574] [<c119acdb>] ? flush_bg_queue+0x7b/0x90 [ 6948.428432] [<c11e55da>] ? blk_finish_plug+0xa/0x30 [ 6948.433309] [<c1073985>] ? read_pages+0x45/0xd0 [ 6948.438140] [<c156fbd2>] ? io_schedule+0x32/0x50 [ 6948.442951] [<c106b715>] ? sleep_on_page_killable+0x5/0x30 [ 6948.447812] [<c1570236>] ? __wait_on_bit_lock+0x46/0x90 [ 6948.452752] [<c106b710>] ? sleep_on_page+0x10/0x10 [ 6948.457712] [<c106b63e>] ? __lock_page_killable+0x9e/0xb0 [ 6948.462758] [<c103e620>] ? autoremove_wake_function+0x40/0x40 [ 6948.467853] [<c106c542>] ? do_generic_file_read.constprop.47+0x292/0x470 [ 6948.473061] [<c106d59d>] ? generic_file_aio_read+0xed/0x270 [ 6948.478308] [<c10a14a0>] ? do_sync_write+0xe0/0xe0 [ 6948.483613] [<c10a154b>] ? do_sync_read+0xab/0xe0 [ 6948.488925] [<c1019955>] ? do_page_fault+0x185/0x440 [ 6948.494225] [<c10a1cdb>] ? vfs_read+0x8b/0x130 [ 6948.499456] [<c10a1dbc>] ? sys_read+0x3c/0x70 [ 6948.504727] [<c15713d0>] ? sysenter_do_call+0x12/0x26 [ 6948.509957] [ 6948.515095] Restarting tasks ... done. [ 6948.557211] video LNXVIDEO:00: Restoring backlight state Next kind of. Again slightly different. on 3.0.0-t43-prod-03124-g81d6743-dirty [188056.511692] PM: Syncing filesystems ... done. [188057.344873] Freezing user space processes ... [188077.360091] Freezing of tasks failed after 20.01 seconds (2 tasks refusing to freeze, wq_busy=0): [188077.360819] collectd D f524f900 0 3453 3443 0x00800004 [188077.361356] f37c1e88 00000082 c1027ef6 f524f900 00000092 00000000 f37c1e38 00000092 [188077.362089] 00000000 f56ccfa0 f37c1e38 f37c1e60 c1026e67 00000000 00000001 00000003 [188077.362821] f56ccfa0 f37c1e78 c10380be f56ccfa0 c1038a6f 00000000 00000000 f56ccfa0 [188077.363553] Call Trace: [188077.363763] [<c1027ef6>] ? check_preempt_curr+0x76/0x90 [188077.364188] [<c1026e67>] ? __wake_up_common+0x47/0x70 [188077.364601] [<c10380be>] ? recalc_sigpending+0xe/0x30 [188077.365013] [<c1038a6f>] ? __set_task_blocked+0x2f/0x80 [188077.365438] [<c103a5ad>] ? set_current_blocked+0xd/0x10 [188077.365863] [<c103a60b>] ? sigprocmask+0x5b/0xd0 [188077.366243] [<c11a7a7d>] ? fuse_request_send+0xbd/0x250 [188077.366669] [<c1043b80>] ? finish_wait+0x50/0x50 [188077.367047] [<c11b0a7c>] ? fuse_statfs+0xdc/0x160 [188077.367434] [<c10cc6d5>] ? statfs_by_dentry+0xb5/0x110 [188077.367853] [<c10cc74a>] ? vfs_statfs+0x1a/0xa0 [188077.368223] [<c10cc808>] ? user_statfs+0x38/0x60 [188077.368601] [<c10cc8ca>] ? sys_statfs64+0x1a/0x40 [188077.368987] [<c15838d0>] ? sysenter_do_call+0x12/0x26 and on more call trace (as two tasks was freezed), probably releated to bug #34572 (see comment #15 there). Yes, this is a known issue with fuse vs. suspend. Please re-open if seen with a modern kernel Has this bug been fixed? In which version? I got the following yesterday with the stock 3.5.0-24-lowlatency kernel of Linux Mint and also got it recently with 3.5.0-19-generic. I bumped into this issue lately and encounter with it from time to time so this definitely seems a standing issue. Feb 21 02:22:23 nitehawk kernel: [ 9359.366095] PM: Syncing filesystems ... done. Feb 21 02:22:23 nitehawk kernel: [ 9359.372465] PM: Preparing system for mem sleep Feb 21 02:22:23 nitehawk kernel: [ 9359.372484] Freezing user space processes ... Feb 21 02:22:23 nitehawk kernel: [ 9379.373235] Freezing of tasks failed after 20.00 seconds (1 tasks refusing to freeze, wq_busy=0): Feb 21 02:22:23 nitehawk kernel: [ 9379.373336] mc D ffff88013fd13b00 0 8583 1 0x00000004 Feb 21 02:22:23 nitehawk kernel: [ 9379.373339] ffff880048bd7be8 0000000000000082 ffff880048b91720 ffff880048bd7fd8 Feb 21 02:22:23 nitehawk kernel: [ 9379.373343] ffff880048bd7fd8 ffff880048bd7fd8 ffff880139b40000 ffff880048b91720 Feb 21 02:22:23 nitehawk kernel: [ 9379.373346] ffff880048bd7bc8 ffff880048b91720 ffff88013fd14398 0000000000000082 Feb 21 02:22:23 nitehawk kernel: [ 9379.373349] Call Trace: Feb 21 02:22:23 nitehawk kernel: [ 9379.373357] [<ffffffff81127260>] ? sleep_on_page+0x20/0x20 Feb 21 02:22:23 nitehawk kernel: [ 9379.373361] [<ffffffff81690a29>] schedule+0x29/0x70 Feb 21 02:22:23 nitehawk kernel: [ 9379.373364] [<ffffffff81690aff>] io_schedule+0x8f/0xd0 Feb 21 02:22:23 nitehawk kernel: [ 9379.373366] [<ffffffff8112726e>] sleep_on_page_killable+0xe/0x40 Feb 21 02:22:23 nitehawk kernel: [ 9379.373369] [<ffffffff8168ef7b>] __wait_on_bit_lock+0x5b/0xc0 Feb 21 02:22:23 nitehawk kernel: [ 9379.373371] [<ffffffff811271c7>] __lock_page_killable+0x67/0x70 Feb 21 02:22:23 nitehawk kernel: [ 9379.373375] [<ffffffff81078e90>] ? autoremove_wake_function+0x40/0x40 Feb 21 02:22:23 nitehawk kernel: [ 9379.373378] [<ffffffff81129625>] generic_file_aio_read+0x475/0x710 Feb 21 02:22:23 nitehawk kernel: [ 9379.373382] [<ffffffff8129fc25>] fuse_file_aio_read+0x75/0x90 Feb 21 02:22:23 nitehawk kernel: [ 9379.373385] [<ffffffff811889c6>] do_sync_read+0xe6/0x120 Feb 21 02:22:23 nitehawk kernel: [ 9379.373388] [<ffffffff812bee52>] ? security_file_permission+0x92/0xb0 Feb 21 02:22:23 nitehawk kernel: [ 9379.373391] [<ffffffff81188e81>] ? rw_verify_area+0x61/0xf0 Feb 21 02:22:23 nitehawk kernel: [ 9379.373393] [<ffffffff81189319>] vfs_read+0xa9/0x180 Feb 21 02:22:23 nitehawk kernel: [ 9379.373395] [<ffffffff8118943a>] sys_read+0x4a/0x90 Feb 21 02:22:23 nitehawk kernel: [ 9379.373398] [<ffffffff816990a9>] system_call_fastpath+0x16/0x1b Still reproducible on Linux 5.10 with debian stretch. Steps to reproduce: - Mount something using fuse sshfs (sshfs 2.8, libfuse 2) - Run "cat /path-to-sshfs-mount/some-big-file > /dev/null" - Try to suspend while above command runs Actual behavior: suspend doesn't work Expected behavior: suspend should work Reproducibility: always Please, reopen the bug |