You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
EDIT: sorted hahah but the error mentioned at the bottom is still worthwhile looking at, so I rename this to
ignore and continue: STOPPED ('The read operation timed out',)
old:
for each retweet, the image is redownloaded. And some of the #devcon tweets are retweeted dozens of times, so I have tons of duplicate images now.
Do you have any idea how to prevent that? Perhaps keep some sqlite of which tweet's image is already downloaded, and then ignore that image for all of its retweets?
For my one task, all the #devcon3 images, I will now just be patient, and I hope it won't stop prematurely with the error (*) described below; and then I use a photo sorting program to remove duplicates. But for the general usefulness, it would be nice to have a switch -noretweeetimages which does prints the tweet texts, but does not download the image again.
But hey - great tool, I am very very happy.
mentioned error (*)
BobSummerwill: RT @slockitproject: USN Architecture Diagram - want to work with us? [email protected]. #devcon #ethereum #blockchain https://t.co/7hjpKUbjHU
Sun Nov 05 09:12:39 +0000 2017
*** STOPPED ('The read operation timed out',)
probably the gps lookup failed?
in most cases, we would like to (perhaps retry once, then) simply catch-and-ignore that exception; so I am waiting for it to happen again, to see which exact exception to catch: drandreaskrueger@0deb046
The text was updated successfully, but these errors were encountered:
Hi - Glad you are liking the tool. I'll follow up on your comments and patches in the next couple of days. Keep them coming!
Jonas
drandreaskrueger
changed the title
(feature request) retweets - do not re-download image - any ideas?
ignore and continue: STOPPED ('The read operation timed out',)
Nov 5, 2017
the hashtag #devcon3 has already resulted in 450 images for the past 3 days - so I cannot afford the tool to stop & restart. I rather loose a few pics in between.
Idea: Store tweet IDs, and if a tweet has already been stored, skip it - then the tool could simply be restarted, and just those tweets are added which are not in the database yet.
EDIT: sorted hahah but the error mentioned at the bottom is still worthwhile looking at, so I rename this to
ignore and continue: STOPPED ('The read operation timed out',)
old:
for each retweet, the image is redownloaded. And some of the #devcon tweets are retweeted dozens of times, so I have tons of duplicate images now.
Do you have any idea how to prevent that? Perhaps keep some sqlite of which tweet's image is already downloaded, and then ignore that image for all of its retweets?
For my one task, all the #devcon3 images, I will now just be patient, and I hope it won't stop prematurely with the error (*) described below; and then I use a photo sorting program to remove duplicates. But for the general usefulness, it would be nice to have a switch
-noretweeetimages
which does prints the tweet texts, but does not download the image again.But hey - great tool, I am very very happy.
mentioned error (*)
probably the gps lookup failed?
in most cases, we would like to (perhaps retry once, then) simply catch-and-ignore that exception; so I am waiting for it to happen again, to see which exact exception to catch: drandreaskrueger@0deb046
The text was updated successfully, but these errors were encountered: