-
Notifications
You must be signed in to change notification settings - Fork 3
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
[bug] intermittent 409 Client Error
message appears during the final create_stage_index
(harvest dag) & publish_collection
(publish dag) tasks; re-running the step in Airflow runs in success
#1095
Labels
Comments
gamontoya
changed the title
[bug] UCSD re-harvesting issue at
[bug] UCSD re-harvesting issue at Aug 12, 2024
create_stage_index
task instance (32 collections)create_stage_index
task instance
christinklez
changed the title
[bug] UCSD re-harvesting issue at
[bug] intermittent Aug 13, 2024
create_stage_index
task instance409 Client Error
message appears during the final create_stage_index
(harvest dag) & publish_collection
(publish dag) tasks; re-running the step runs in success
christinklez
changed the title
[bug] intermittent
[bug] intermittent Aug 13, 2024
409 Client Error
message appears during the final create_stage_index
(harvest dag) & publish_collection
(publish dag) tasks; re-running the step runs in success409 Client Error
message appears during the final create_stage_index
(harvest dag) & publish_collection
(publish dag) tasks; re-running the step in Airflow runs in success
This is an issue with OpenSearch not quite indexing documents fast enough, but as long as the counts are the same, it is harmless. Adding some more logging output here for clarification:
|
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
Example error message:
Page to the first attempt for the log:
Example from Harvest DAG
create_stage_index
https://7a8067cb-3b99-477e-a883-7e311175a9b4.c3.us-west-2.airflow.amazonaws.com/dags/harvest_collection/grid?dag_run_id=manual__2024-08-07T20%3A42%3A49%2B00%3A00&num_runs=365&task_id=create_stage_index&tab=logs
[2024-08-07, 21:17:05 UTC] {{standard_task_runner.py:104}} ERROR - Failed to execute job 148437 for task create_stage_index (409 Client Error: Conflict for url: https://search-rikolti-2-xxbcriyfw5iqysaj7p3fhhscae.us-west-2.es.amazonaws.com/rikolti-stg-2024-07-11-t15_35_50/_delete_by_query; 1593)
Example from Publish DAG
publish_collection
https://7a8067cb-3b99-477e-a883-7e311175a9b4.c3.us-west-2.airflow.amazonaws.com/dags/publish_collection/grid?dag_run_id=manual__2024-08-12T20%3A34%3A21%2B00%3A00&task_id=publish_collection&num_runs=365&tab=logs
[2024-08-12, 21:10:03 UTC] {{standard_task_runner.py:104}} ERROR - Failed to execute job 149566 for task publish_collection (409 Client Error: Conflict for url: https://search-rikolti-2-xxbcriyfw5iqysaj7p3fhhscae.us-west-2.es.amazonaws.com/rikolti-prd-2024-07-11-t16_19_42/_delete_by_query; 7953)
==
Some notes:
Gabriela initiated batches of 100 collections from the Registry, and experienced that most would run successfully, while some would fail at the final step.
From Airflow, re-running the final task resulted in success.
These steps apply to both the Harvest (to -stage) DAG as well as the Publish (to -prod) DAG.
Another note:
Gabriela initiated 3 collections from the registry, to publish collections to -prod, and also experienced this error. So it seems that this error is not just a huge batch size issue. (Re-running the final task in Airflow works fine though.)
The text was updated successfully, but these errors were encountered: