Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Clean up temporary files underway #252

Open
tokee opened this issue Jun 7, 2021 · 5 comments · Fixed by #254
Open

Clean up temporary files underway #252

tokee opened this issue Jun 7, 2021 · 5 comments · Fixed by #254

Comments

@tokee
Copy link
Collaborator

tokee commented Jun 7, 2021

It seems that calling warc-indexer with thousands of WARC-files causes the tmp folder to fill up (maybe due to DROID temporary files). It should possible to clean up underway.

@anjackson
Copy link
Contributor

I think this likely relates to this issue: openpreserve/nanite#36

Unfortunately, the pull was full of whitespace changes and I couldn't work out what was happening. I'll have to try and fix it up.

@anjackson
Copy link
Contributor

Hm, also openpreserve/nanite#40 and this part of the code seems to be a bit of a mess as those two pulls were a bit out of sync, so I'll try to tidy up.

@anjackson
Copy link
Contributor

Well, that was messy, but I think the Nanite code is better now. Just released 1.4.1-97 and will update this project when it becomes available.

@anjackson
Copy link
Contributor

Actually lets leave this open until we've proved the Nanite update resolved the issue.

@anjackson anjackson added this to the 3.1.1 Bugfix release milestone Aug 3, 2022
@anjackson
Copy link
Contributor

Note that Tika < 1.25 has also been reported as generating a lot of tmp files (https://issues.apache.org/jira/browse/TIKA-3203) so that might also be the issue. I've updated to 1.28.5 and I'm looking at getting to Tika 2.7.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants