-
Notifications
You must be signed in to change notification settings - Fork 49
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
When there is a lockfile timeout, show the pid of the process that holds the lock #201
Comments
Also a good option could be --retry-forever We have made automatic injection system, that grep the error and loop until it works. |
Retrying forever seems a bit dubious to me. There is a |
can we set PINTO_LOCKFILE_TIMEOUT=-1 or =0 mean forever ? The forever is more because we have numerous to work on pinto, and it has no queue for processing. So trying to steal lock as soon as possible could be great. But we need to complete the operation under an automatic process. |
looks like File::NFSLock supports stale_lock_timeout, could expose it via PINTO_ env and document interaction if ultimate goal is not "wait forever" but "wait suitably long, then steal the lock" also, with a (very) quick look at File::NFSLock internals, i don't see it setting the holder pid on failure - a chicken/egg issue for reporting it on failure |
Yes, it looks like 0 means forever. |
I think we'll just have to parse that from the contents of the lock file itself (or patch File::NFSLock). |
proposed hookbot/File-NFSLock#3, if accepted, it should cover the reporting issue. |
Maybe even offer to steal the lock if the process appears dead. Beware of NFS situations, where the process could be on another server.
The text was updated successfully, but these errors were encountered: