Skip to content
This repository has been archived by the owner on May 1, 2019. It is now read-only.

Timeout threshold might be too low ? #43

Open
mbanani opened this issue Sep 25, 2017 · 5 comments
Open

Timeout threshold might be too low ? #43

mbanani opened this issue Sep 25, 2017 · 5 comments

Comments

@mbanani
Copy link

mbanani commented Sep 25, 2017

I keep getting this error:

Traceback (most recent call last): File "train_viewpoint.py", line 495, in <module> main(args) File "train_viewpoint.py", line 23, in main cc = CrayonClient("focus.eecs.umich.edu") File "/z/home/mbanani/PyTorch2Env/local/lib/python2.7/site-packages/pycrayon/crayon.py", line 29, in __init__ " Server sent: {}.".format(r.text)) RuntimeError: Something went wrong! Server sent: Server: TensorBoard failed to answer request 'logdir'. Done

and I think the reason is that the timeout, specified here is too low when people are training on clusters and trying to connect to a docker on their personal machines. Maybe consider increasing the timeout ?

@mbanani
Copy link
Author

mbanani commented Sep 25, 2017

Issue was caused by something else.

@mbanani mbanani closed this as completed Sep 25, 2017
@vakker
Copy link

vakker commented Oct 16, 2017

I have the same error. Could you elaborate a bit more on what was causing this for you?

@mbanani mbanani reopened this Oct 16, 2017
@mbanani
Copy link
Author

mbanani commented Oct 16, 2017

So I closed the issue before because I thought it was caused by running within a virtual environment, but the issue reappeared and I forgot to reopen the issue.

I am not really sure what's causing the issue, and I moved to using tensorboard directly, as shown here, which proved simpler to use for my purposes.

As for the error, I found that having a firewall that blocks some of the more common network ports or having many experiments running at the same time resulted in the error being produced more often. It would still run from time to time thought, which I found pretty weird, so my guess is that it's caused by the network taking too long to respond. Sorry this isn't more helpful

@vakker
Copy link

vakker commented Oct 18, 2017

Thanks for the info, I'll investigate a bit further. I'm using Crayon in a Docker container, not sure if it has an effect.

@motiwari
Copy link

I had the same issue; for a quick and dirty workaround I simply commented out https://github.com/torrvision/crayon/blob/master/client/python/pycrayon/crayon.py#L32-L44 and it's working now.

Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants