Skip to content

Conversation

@gman0
Copy link
Collaborator

@gman0 gman0 commented Sep 28, 2023

automount daemon unmounts the autofs root in /cvmfs upon receiving SIGTERM. This makes it impossible to reconnect
the daemon to the mount later, so all consumer Pods will loose their mounts CVMFS, without the possibility of restoring
them (unless these Pods are restarted too). The implication is that the nodeplugin is just being restarted, and will be
needed again.

SIGKILL is handled differently in automount, as this forces the daemon to skip the cleanup at exit, leaving the autofs
mount behind and making it possible to reconnect to it later. We make a use of this, and unless the admin doesn't explicitly ask for cleanup with AUTOFS_TRY_CLEAN_AT_EXIT env var, no cleanup is done.

Fixes #121

automount daemon unmounts the autofs root in /cvmfs upon
receiving SIGTERM. This makes it impossible to reconnect
the daemon to the mount later, so all consumer Pods will
loose their mounts CVMFS, without the possibility of restoring
them (unless these Pods are restarted too). The implication
is that the nodeplugin is just being restarted, and will be
needed again.

SIGKILL is handled differently in automount, as this forces
the daemon to skip the cleanup at exit, leaving the autofs
mount behind and making it possible to reconnect to it later.
We make a use of this, and unless the admin doesn't explicitly
ask for cleanup with AUTOFS_TRY_CLEAN_AT_EXIT env var, no cleanup
is done.
@gman0 gman0 force-pushed the autofs-reconnect-fix branch from d531272 to f1d7ee2 Compare September 30, 2023 15:39
@gman0 gman0 merged commit 22cbc9a into cvmfs-contrib:master Sep 30, 2023
gman0 added a commit to gman0/cvmfs-csi that referenced this pull request Sep 30, 2023
automount daemon unmounts the autofs root in /cvmfs upon
receiving SIGTERM. This makes it impossible to reconnect
the daemon to the mount later, so all consumer Pods will
loose their mounts CVMFS, without the possibility of restoring
them (unless these Pods are restarted too). The implication
is that the nodeplugin is just being restarted, and will be
needed again.

SIGKILL is handled differently in automount, as this forces
the daemon to skip the cleanup at exit, leaving the autofs
mount behind and making it possible to reconnect to it later.
We make a use of this, and unless the admin doesn't explicitly
ask for cleanup with AUTOFS_TRY_CLEAN_AT_EXIT env var, no cleanup
is done.

Cherry-pick f1d7ee2 (cvmfs-contrib#122)
gman0 added a commit that referenced this pull request Sep 30, 2023
(Backport #122) automount: shut down automount daemon with SIGKILL
@dimm0
Copy link
Contributor

dimm0 commented Oct 2, 2023

I still have the mount disappeared from the pods on nodeplugin restart in v2.3.1

@gman0
Copy link
Collaborator Author

gman0 commented Oct 3, 2023

@dimm0 please open a new issue with logs attached, with logVerbosityLevel set to 5.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment

Labels

None yet

Projects

None yet

Development

Successfully merging this pull request may close these issues.

automount daemon unable to reconnect

2 participants