When there is a lockfile timeout, show the pid of the process that holds the lock #201

thaljef · 2015-03-24T07:46:30Z

Maybe even offer to steal the lock if the process appears dead. Beware of NFS situations, where the process could be on another server.

celogeek · 2015-04-10T10:30:02Z

Also a good option could be --retry-forever
So if we can't get the lock, we retry until we got it.

We have made automatic injection system, that grep the error and loop until it works.
We a --retry-forever option it would ease that situation.

thaljef · 2015-04-22T17:32:15Z

Retrying forever seems a bit dubious to me. There is a PINTO_LOCKFILE_TIMEOUT environment variable you can set though. Not sure if that is documented.

celogeek · 2015-04-23T05:32:53Z

can we set PINTO_LOCKFILE_TIMEOUT=-1 or =0 mean forever ?

The forever is more because we have numerous to work on pinto, and it has no queue for processing. So trying to steal lock as soon as possible could be great. But we need to complete the operation under an automatic process.

cakirke · 2015-05-05T01:12:36Z

looks like File::NFSLock supports stale_lock_timeout, could expose it via PINTO_ env and document interaction if ultimate goal is not "wait forever" but "wait suitably long, then steal the lock"

also, with a (very) quick look at File::NFSLock internals, i don't see it setting the holder pid on failure - a chicken/egg issue for reporting it on failure

thaljef · 2015-05-05T02:40:05Z

can we set PINTO_LOCKFILE_TIMEOUT=-1 or =0 mean forever ?

Yes, it looks like 0 means forever.

thaljef · 2015-05-05T02:44:30Z

i don't see it setting the holder pid on failure - a chicken/egg issue for reporting it on failure.

I think we'll just have to parse that from the contents of the lock file itself (or patch File::NFSLock).

cakirke · 2015-05-08T12:06:27Z

proposed hookbot/File-NFSLock#3, if accepted, it should cover the reporting issue.
continuing with the stale lock/steal lock portion of this.

cakirke mentioned this issue May 8, 2015

show lock holder information in errstr hookbot/File-NFSLock#3

Open

cakirke mentioned this issue May 10, 2015

Issue 201 lock #209

Merged

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

When there is a lockfile timeout, show the pid of the process that holds the lock #201

When there is a lockfile timeout, show the pid of the process that holds the lock #201

thaljef commented Mar 24, 2015

celogeek commented Apr 10, 2015

thaljef commented Apr 22, 2015

celogeek commented Apr 23, 2015

cakirke commented May 5, 2015

thaljef commented May 5, 2015

thaljef commented May 5, 2015

cakirke commented May 8, 2015

When there is a lockfile timeout, show the pid of the process that holds the lock #201

When there is a lockfile timeout, show the pid of the process that holds the lock #201

Comments

thaljef commented Mar 24, 2015

celogeek commented Apr 10, 2015

thaljef commented Apr 22, 2015

celogeek commented Apr 23, 2015

cakirke commented May 5, 2015

thaljef commented May 5, 2015

thaljef commented May 5, 2015

cakirke commented May 8, 2015