Skip to content

What is the best way to get the job failure errstr without using wait? #3496

Answered by grondo
SteVwonder asked this question in Q&A
Discussion options

You must be logged in to vote

The errstr is not stored anywhere, but is created by the job manager as it responds to a wait request when the job has exited.

https://github.com/flux-framework/flux-core/blob/master/src/modules/job-manager/wait.c#L72

The string itself is informational and is not meant to be part of job provenance, since it can be recreated from the finish event status.

A similar message (probably not exactly the same) is returned by flux job status by processing the eventlog, e.g:

$ flux job status -v ƒczpiyJKR
flux-job: ƒczpiyJKR: exited with exit code 1

Replies: 1 comment 2 replies

Comment options

You must be logged in to vote
2 replies
@SteVwonder
Comment options

SteVwonder Feb 4, 2021
Maintainer Author

@grondo
Comment options

Answer selected by SteVwonder
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
None yet
2 participants