Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[consul] Added maintenance status for metrics #2496

Closed
wants to merge 1 commit into from

Conversation

brumschlag
Copy link

When a node is in 'maintenance' mode, it is a different measure than if a service is critical.

This PR adds a 'maintenance' status -- it checks for a check named _node_maintenance. If it is critical, it marks the node and services as 'maintenance' status instead of critical.

This is helpful for cases when new nodes come up in a maintenance mode, ready to be turned on.
Also useful for alerting only on 'critical' cases - not 'maintenance'.

@truthbk truthbk changed the title Added maintenance status for metrics [consul] Added maintenance status for metrics Jan 30, 2017
@gmmeyer gmmeyer self-assigned this Jan 31, 2017
@gmmeyer
Copy link
Contributor

gmmeyer commented Feb 7, 2017

@brumschlag, thank you very much for your contribution. It's very much appreciated!

Unfortunately, we're not going to be able to add any more new features before the next release. It takes too long to test all of them, especially when we're in the middle of moving to a new model for how we package and distribute checks. This feature makes sense, however, and I think we'd happily add it into the check afterwards.

@gmmeyer gmmeyer assigned irabinovitch and unassigned gmmeyer Feb 7, 2017
diogokiss pushed a commit to diogokiss/integrations-core that referenced this pull request Mar 16, 2018
Currently, if a node in Consul is set into maintenance state, it is reported
as a node in critical state to Datadog metrics. This makes it confusing to
determine by the metric whether it is a node failing with problems or an planned
intervention. A user made a PR to Datadog some time ago, but it was not merged due to
Datadog code organization changes and got forgotten (DataDog/dd-agent#2496).
I'm pushing the change forward. I've tested it using dd-agent
version 5.8.0.

This PR also depends on DataDog/dd-agent#3708
diogokiss pushed a commit to diogokiss/integrations-core that referenced this pull request Mar 16, 2018
Currently, if a node in Consul is set into maintenance state, it is reported
as a node in critical state to Datadog metrics. This makes it confusing to
determine by the metric whether it is a node failing with problems or an planned
intervention. A user made a PR to Datadog some time ago, but it was not merged due to
Datadog code organization changes and got forgotten (DataDog/dd-agent#2496).
I'm pushing the change forward. I've tested it using dd-agent
version 5.8.0.

This PR also depends on DataDog/dd-agent#3708
diogokiss pushed a commit to diogokiss/integrations-core that referenced this pull request Mar 19, 2018
Currently, if a node in Consul is set into maintenance state, it is reported
as a node in critical state to Datadog metrics. This makes it confusing to
determine by the metric whether it is a node failing with problems or an planned
intervention. A user made a PR to Datadog some time ago, but it was not merged due to
Datadog code organization changes and got forgotten (DataDog/dd-agent#2496).
I'm pushing the change forward. I've tested it using dd-agent
version 5.8.0.

This PR also depends on DataDog/dd-agent#3708
diogokiss pushed a commit to diogokiss/integrations-core that referenced this pull request Mar 19, 2018
Currently, if a node in Consul is set into maintenance state, it is reported
as a node in critical state to Datadog metrics. This makes it confusing to
determine by the metric whether it is a node failing with problems or an planned
intervention. A user made a PR to Datadog some time ago, but it was not merged due to
Datadog code organization changes and got forgotten (DataDog/dd-agent#2496).
I'm pushing the change forward. I've tested it using dd-agent
version 5.8.0.

This PR also depends on DataDog/dd-agent#3708
diogokiss pushed a commit to diogokiss/integrations-core that referenced this pull request Mar 19, 2018
Currently, if a node in Consul is set into maintenance state, it is reported
as a node in critical state to Datadog metrics. This makes it confusing to
determine by the metric whether it is a node failing with problems or an planned
intervention. A user made a PR to Datadog some time ago, but it was not merged due to
Datadog code organization changes and got forgotten (DataDog/dd-agent#2496).
I'm pushing the change forward. I've tested it using dd-agent
version 5.8.0.

This PR also depends on DataDog/dd-agent#3708
@masci
Copy link
Contributor

masci commented Apr 12, 2018

closing as superseded by DataDog/integrations-core#1267

@masci masci closed this Apr 12, 2018
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Projects
None yet
Development

Successfully merging this pull request may close these issues.

5 participants