Bug 195817 - CIFS disregards USER_SESSION_DELETED from remote servers and holds onto dead sessions
Summary: CIFS disregards USER_SESSION_DELETED from remote servers and holds onto dead ...
Status: RESOLVED CODE_FIX
Alias: None
Product: File System
Classification: Unclassified
Component: CIFS (show other bugs)
Hardware: All Linux
: P1 normal
Assignee: fs_cifs
URL:
Keywords:
Depends on:
Blocks:
 
Reported: 2017-05-19 12:26 UTC by Keith Ward
Modified: 2019-02-07 10:02 UTC (History)
2 users (show)

See Also:
Kernel Version: 4.x
Subsystem:
Regression: No
Bisected commit-id:


Attachments

Description Keith Ward 2017-05-19 12:26:23 UTC
The cifs kernel module currently is currently disregarding the nt_status USER_SESSION_DELETED packets sent to it from remote SMB2+ peers and is subsequently holding onto clearly dead SMB2+ sessions as a result.

This is causing issues as it appears that the kernel only reconnects after a delay period, during which time all IO to that network share (with that session id) results in an I/O error for the user; all the while the kernel is repeatedly resubmitting the requests in the background with the server just responding to each of them with the same error.

This is contrasting to the behaviour on Windows clients - whereby if the response the client gets is USER_SESSION_DELETED, it will immediately dump that session and reconnect - transparently to the user.

This causes massive issues multiuser mounts (as is the case for me), as if a users SMB session is deleted by the server for any reason, that user essentially loses all access to that share for extended periods, until the cifs module sees fit to treat that session as dead and renegotiate or an admin intervenes.

Reproducing the issue is easily done, establish an SMB2+ mount to a Windows Server machine, open any directory on client to establish a session, then simply terminate that SMB session from the servers MMC console, if you then try and run any queries against that mountpoint on the client you will just end up with a string of I/O errors because its trying to use a session id which is dead.

For reference I'm testing using the following mount options:
vers=3.02,multiuser,sec=ntlmsspi,file_mode=0750,dir_mode=0755,cache=none,credentials=/etc/smb.credentials.conf

Kernels 4.4 and 4.10 have extended delays until the user is able to regain access to the mountpoint, the latest rc has cut that down significantly and its only a few minutes, but even still if the session has been deleted by the server the client should at least attempt to reconnect and then resubmit the last command, rather than continue to submit packets with dead session ID's and return I/O to the user.
Comment 1 Ronnie Sahlberg 2019-02-07 04:25:43 UTC
This was fixed in :
commit d81243c697ffc71f983736e7da2db31a8be0001f
Author: Mark Syms <mark.syms@citrix.com>
Date:   Thu May 24 09:47:31 2018 +0100

    CIFS: 511c54a2f69195b28afb9dd119f03787b1625bb4 adds a check for session expiry, status STATUS_NETWORK_SESSION_EXPIRED, however the server can also respond with STATUS_USER_SESSION_DELETED in cases where the session has been idle for some time and the server reaps the session to recover resources.
    
    Handle this additional status in the same way as SESSION_EXPIRED.
    
    Signed-off-by: Mark Syms <mark.syms@citrix.com>
    Signed-off-by: Steve French <stfrench@microsoft.com>
    CC: Stable <stable@vger.kernel.org>

Note You need to log in before you can comment on or make changes to this bug.