Replies: 1 comment 3 replies
-
Hey, these are excellent questions. Let me know if this already clears things up. |
Beta Was this translation helpful? Give feedback.
3 replies
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment
-
Hello,
I'm in the process of testing out Pyrra to replace Sloth to monitor our SLOs.
I'm trying to define a very simple "availability" SLI, base on nginx requests. The spec looks like this:
That's a very simple example!
In Sloth, we used to have 2 "hacks":
Add
or on() vector(0)
to the "error" query: when a service has no errors, the original "error" query doesn't return any numbers and it creates empty metrics (instead of error = 0), which in turn renders a lot of dashboard a bit useless.To compensate, we added that
or on() vector(0)
on all the Sloth error queries.In addition to the hack on the error query, we had (sometimes) the same hack on the "total" query, to compensate for some services which don't always have requests all the time. In this situation, the original "total" query doesn't return any number, thus creating holes in the final metrics and dashboards.
We tried to compensate this by adding
or on() vector(1)
(describe a bit in NaN in SLO dashboard slok/sloth#531 (comment)), although this one is a bit more doubtful.I initially tried to configure the same with Pyrra, but by default, Pyrra returns a
ratio total metric is not a VectorSelector
error.Thanks for your help!
Beta Was this translation helpful? Give feedback.
All reactions