Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add runbook for push image failures #80

Open
robrap opened this issue Jan 6, 2025 · 0 comments
Open

Add runbook for push image failures #80

robrap opened this issue Jan 6, 2025 · 0 comments

Comments

@robrap
Copy link

robrap commented Jan 6, 2025

These image push failures are currently being sent to many teams, but it is unclear what needs to be done and who needs to fix issues. See:

If we create and point to a common runbook in Confluence, then:

  1. We can comment on how to deal with common errors (if any arise), and
  2. Based on our findings of what errors are common, we can determine who mainly would be fixing these issues. If the errors are typically not unique to the service, then maybe it will make sense for Arbi-BOM or SRE to own these notifications?

The runbook can provide a place for us to gather data, and for the team notified to better understand what they should do.

Note: When I got the email about the following error, https://github.com/edx/public-dockerfiles/actions/runs/12461004340/job/34779989143, it had already resolved elsewhere, so I ignore this issue. Maybe it was failing on:

56.83 Error: retrieving gpg key timed out.

I do not know if re-running would have resolved (I didn't need to, so I never re-ran them), and I don't know if the job could have a retry to keep this failure from happening as much.

@robrap robrap changed the title Add runbook with more push image failures Add runbook for push image failures Jan 6, 2025
@robrap robrap added this to Arbi-BOM Jan 6, 2025
@github-project-automation github-project-automation bot moved this to Todo in Arbi-BOM Jan 6, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
Status: Todo
Development

No branches or pull requests

1 participant