Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

HP DL360 with failing dimms gets HTTP 500 error #8

Open
brad-marshall opened this issue Mar 1, 2024 · 0 comments
Open

HP DL360 with failing dimms gets HTTP 500 error #8

brad-marshall opened this issue Mar 1, 2024 · 0 comments

Comments

@brad-marshall
Copy link

I have some systems with failed dimms that are getting a 500 error from the redfish exporter when being scraped, and we get the following errors in the logs:

2024-02-29 22:31:19 [FALCON] [ERROR] GET /health?job=redfish%2Fhardware&target=redacted => Traceback (most recent call last):
File "falcon/app.py", line 365, in falcon.app.App.call
File "/redfish_exporter/handler.py", line 103, in on_get
resp.body = generate_latest(registry)
File "/usr/local/lib/python3.10/dist-packages/prometheus_client/exposition.py", line 251, in generate_latest
for metric in registry.collect():
File "/redfish_exporter/collector.py", line 361, in collect
metrics.collect()
File "/redfish_exporter/collectors/health_collector.py", line 369, in collect
self.get_memory_health()
File "/redfish_exporter/collectors/health_collector.py", line 256, in get_memory_health
if type(dimm_info["Status"]) == str:
KeyError: 'Status'

This causes no data to be returned at all for the host. The ilo firmwares are currently downrev, I'll see what I can do about getting them upgraded to see if that helps.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant