Skip to content
This repository has been archived by the owner on Aug 15, 2024. It is now read-only.

"Screw up at restart" issue #81

Open
illdopejake opened this issue Nov 11, 2015 · 0 comments
Open

"Screw up at restart" issue #81

illdopejake opened this issue Nov 11, 2015 · 0 comments
Assignees
Labels
Milestone

Comments

@illdopejake
Copy link

Hey all, reporting here an issue I had when preprocessing ~100 ADPD subjects using Niak 13.4b. I saw that the pipeline was still running but several of my workers were idle. Output said this:

screen shot 2015-11-11 at 1 40 00 pm

Also, this from the daemon logs:
screen shot 2015-11-11 at 2 24 36 pm

Looking in logs/workers, psom2-4 (out of 5) were empty. In psom1, PB pointed out that there was .ready file but no new_jobs.mat file, which apparently can happen if a worker is restarted exactly when the manager assigns it new jobs. But, number of worker deaths was puzzling.

psom5 was the only directory with worker.eqsub and worker.oqsub files. They read as follows:
screen shot 2015-11-11 at 2 40 45 pm
screen shot 2015-11-11 at 2 40 19 pm

Path to output: /gs/project/gsf-624-aa/ADPD_October_2015/preproc/fmri_preprocess_all_scrubb05/

Path to script: /gs/project/gsf-624-aa/ADPD_October_2015/scripts/nkim_preprocessing_jake.m

In solidarity,
--Jake

@pbellec pbellec added the bug label Apr 1, 2016
@pbellec pbellec added this to the release 2.0 milestone Apr 1, 2016
@pbellec pbellec self-assigned this Apr 1, 2016
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
Projects
None yet
Development

No branches or pull requests

2 participants