Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

datalad crawl can't connect to s3 and produces changing error messages #47

Closed
TobiasKadelka opened this issue Jul 12, 2019 · 3 comments
Closed

Comments

@TobiasKadelka
Copy link

TobiasKadelka commented Jul 12, 2019

While trying to "datalad crawl" with the change from #46 (comment) on our server, this happens:

❱ datalad crawl  
[INFO   ] Loading pipeline specification from ./.datalad/crawl/crawl.cfg 
[INFO   ] Creating a pipeline for the hcp-openaccess bucket 
[INFO   ] Running pipeline [<datalad_crawler.nodes.s3.crawl_s3 object at 0x7f598e460a58>, switch(default=None, key='datalad_action', mapping=<<{'commit': <function A...>>, re=False)] 
[INFO   ] S3 session: Connecting to the bucket hcp-openaccess with authentication 
[ERROR  ] unorderable types: NoneType() < str() [s3.py:__call__:162] (TypeError) 
Exception ignored in: <bound method AnnexRepo.__del__ of <AnnexRepo path=/home/homeGlobal/tkadelka/hcp/hcp (<class 'datalad.support.annexrepo.AnnexRepo'>)>>
Traceback (most recent call last):
  File "/home/homeGlobal/tkadelka/env/datalad-crawler/datalad.git/datalad/support/annexrepo.py", line 365, in __del__
AttributeError: 'NoneType' object has no attribute 'debug'
Exception ignored in: <function WeakValueDictionary.__init__.<locals>.remove at 0x7f59e9414840>
Traceback (most recent call last):
  File "/home/homeGlobal/tkadelka/env/datalad-crawler/lib/python3.5/weakref.py", line 117, in remove
TypeError: 'NoneType' object is not callable

Also, when I then just re-run the "datalad crawl" without any changes, sometimes other error messages appear (in addition to the one above, which is always displayed):

Additional Error 1

❱ datalad crawl
[INFO ] Loading pipeline specification from ./.datalad/crawl/crawl.cfg
[INFO ] Creating a pipeline for the hcp-openaccess bucket
[INFO ] Running pipeline [<datalad_crawler.nodes.s3.crawl_s3 object at 0x7ff943fb0a58>, switch(default=None, key='datalad_action', mapping=<<{'annex': <datalad_cra...>>, re=False)]
[INFO ] S3 session: Connecting to the bucket hcp-openaccess with authentication
[ERROR ] unorderable types: NoneType() < str() [s3.py:call:162] (TypeError)
Exception ignored in: <bound method AnnexRepo.del of <AnnexRepo path=/home/homeGlobal/tkadelka/hcp/hcp (<class 'datalad.support.annexrepo.AnnexRepo'>)>>
Traceback (most recent call last):
File "/home/homeGlobal/tkadelka/env/datalad-crawler/datalad.git/datalad/support/annexrepo.py", line 365, in del
File "/home/homeGlobal/tkadelka/env/datalad-crawler/datalad.git/datalad/dochelpers.py", line 328, in exc_str
AttributeError: 'NoneType' object has no attribute 'get'
Exception ignored in: <function WeakValueDictionary.init..remove at 0x7ff99ef63840>
Traceback (most recent call last):
File "/home/homeGlobal/tkadelka/env/datalad-crawler/lib/python3.5/weakref.py", line 117, in remove
TypeError: 'NoneType' object is not callable

Additional Error 2
> datalad crawl                                                                                                                                                                                                1 !
[INFO   ] Loading pipeline specification from ./.datalad/crawl/crawl.cfg 
[INFO   ] Creating a pipeline for the hcp-openaccess bucket 
[INFO   ] Running pipeline [<datalad_crawler.nodes.s3.crawl_s3 object at 0x7ffa804fba58>, switch(default=None, key='datalad_action', mapping=<<{'commit': <function A...>>, re=False)] 
[INFO   ] S3 session: Connecting to the bucket hcp-openaccess with authentication 
[ERROR  ] unorderable types: NoneType() < str() [s3.py:__call__:162] (TypeError) 
Exception ignored in: <bound method AnnexRepo.__del__ of <AnnexRepo path=/home/homeGlobal/tkadelka/hcp/hcp (<class 'datalad.support.annexrepo.AnnexRepo'>)>>
Traceback (most recent call last):
  File "/home/homeGlobal/tkadelka/env/datalad-crawler/datalad.git/datalad/support/annexrepo.py", line 362, in __del__
  File "/home/homeGlobal/tkadelka/env/datalad-crawler/datalad.git/datalad/support/gitrepo.py", line 965, in __del__
  File "/home/homeGlobal/tkadelka/env/datalad-crawler/lib/python3.5/genericpath.py", line 19, in exists
AttributeError: 'NoneType' object has no attribute 'stat'
@yarikoptic
Copy link
Member

All the errors are same

[ERROR ] unorderable types: NoneType() < str() [s3.py:call:162] (TypeError)

Could you rerun with --dbg and then print entire stack (bt) ?

The ones from del can be ignored - i will silence them when get to the laptop

@yarikoptic
Copy link
Member

oh, this one is probably the duplicate of #44 which has [ERROR ] '<' not supported between instances of 'NoneType' and 'str' [s3.py:__call__:162] (TypeError) and for which @mih submitted #45 which I just merged. I will retitle and close this one with a fix for __del__

@yarikoptic
Copy link
Member

actually -- it is in the __del__ of datalad core, so nothing to be done here (will do there), closing

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants