-
Notifications
You must be signed in to change notification settings - Fork 0
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Maintenance 09 2024 #39
Conversation
* Update to 3.12 in .python-version, Dockerfile, and Pipfile * Add help command to Makefile * Update and reorder dependencies in Pipfile * Update pyproject.toml
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Before diving into the rest of the PR (which looks great at a glance, not anticipating any requests), did have one.
While attempting to run a local harvest test via:
make dist-local
make run-harvest-local
It was discovered that the testing configurations here are pointing at a version of the website no longer available.
A more stable version was shared, which would allow updating that YAML to the following:
generateCDX: true
generateWACZ: true
text: to-pages
# prevent PAGES from getting crawled; scoping
exclude:
- ".*lib.mit.edu/search/.*"
- ".*mit.primo.exlibrisgroup.com/.*"
# prevent RESOURCES / ASSETS from getting retrieved; URL requests
blockRules:
- ".*googlevideo.com.*"
- ".*cdn.libraries.mit.edu/media/.*"
- "\\.(jpg|png)$"
depth: 1
maxPageLimit: 20
timeout: 30
scopeType: "domain"
seeds:
- url: https://www-test.libraries.mit.edu/sitemap.xml
sitemap: https://www-test.libraries.mit.edu/sitemap.xml
Can we make this change? Thanks!
@ghukill Good catch, made the change! |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Approved! Able to run a local crawl/harvest, all looks good.
Purpose and background context
Updates app according to our maintenance week documentation.
How can a reviewer manually see the effects of these changes?
Run
make test
andmake lint
to confirm they still passIncludes new or updated dependencies?
YES
Changes expectations for external applications?
NO
Developer
Code Reviewer(s)