From 73611c9722c41987adb7d01d8a1eb40271b6fb19 Mon Sep 17 00:00:00 2001 From: Arik Alon Date: Wed, 13 Nov 2024 09:07:28 -0700 Subject: [PATCH 1/2] test runbook --- docs/how-it-works/HighCheckoutErrorRate.rst | 13 +++++++++++++ 1 file changed, 13 insertions(+) create mode 100644 docs/how-it-works/HighCheckoutErrorRate.rst diff --git a/docs/how-it-works/HighCheckoutErrorRate.rst b/docs/how-it-works/HighCheckoutErrorRate.rst new file mode 100644 index 000000000..cae4e612f --- /dev/null +++ b/docs/how-it-works/HighCheckoutErrorRate.rst @@ -0,0 +1,13 @@ +Troubleshooting runbook: HighCheckoutErrorRate +################################################## + +This alert fires when the error rate on checkout requests is high + +Troubleshooting steps +---------------------- + +1. Check the account-service workload logs for errors +2. Verify that the nginx-deployment is working properly +3. Check that the redis workload is working properly + +If any of the above has error, this is most likely the issue From cc09e01cb66a7cdb98170d2140d9e6b885d1b23e Mon Sep 17 00:00:00 2001 From: Arik Alon Date: Wed, 13 Nov 2024 09:30:56 -0700 Subject: [PATCH 2/2] test runbook --- docs/how-it-works/HighCheckoutErrorRate.rst | 5 +++-- 1 file changed, 3 insertions(+), 2 deletions(-) diff --git a/docs/how-it-works/HighCheckoutErrorRate.rst b/docs/how-it-works/HighCheckoutErrorRate.rst index cae4e612f..98f220a61 100644 --- a/docs/how-it-works/HighCheckoutErrorRate.rst +++ b/docs/how-it-works/HighCheckoutErrorRate.rst @@ -7,7 +7,8 @@ Troubleshooting steps ---------------------- 1. Check the account-service workload logs for errors -2. Verify that the nginx-deployment is working properly -3. Check that the redis workload is working properly +2. Verify that the nginx-deployment is in running state, and doesn't have and related issues + investigate if this workload has any issue +3. Check that the redis workload is in running state, and doesn't have and related issues. investigate if this workload has any issue If any of the above has error, this is most likely the issue