Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

slurm-web-agent not working on rhel 8 #419

Closed
jonoharms opened this issue Dec 3, 2024 · 6 comments · Fixed by #420
Closed

slurm-web-agent not working on rhel 8 #419

jonoharms opened this issue Dec 3, 2024 · 6 comments · Fixed by #420
Assignees
Labels
bug This was supposed to work in this case and clearly it is not.
Milestone

Comments

@jonoharms
Copy link

I have been trying to get slurm-web working on rhel8. I have successfully installed everything using the rackslab repo. Racksdb and slurm-web-gateway both seem to be working fine. However, the slurm-web-agent will not start. It seems like the python3-werkzeug package that is installed is not recent enough, because it does not contain werkzeug.middleware. The installed version is 0.12.2.

werkzeug.middleware was added in 0.15

This is the output from sudo journalctl -u slurm-web-agent:

Dec 03 16:19:57 hostname systemd[1]: Started Slurm-web HPC dashboard agent.
Dec 03 16:19:57 hostname python3.6[267350]: detected unhandled Python exception in '/usr/libexec/slurm-web/slurm-web-agent'
Dec 03 16:19:57 hostname slurm-web-agent[267350]: Traceback (most recent call last):
Dec 03 16:19:57 hostname slurm-web-agent[267350]:   File "/usr/libexec/slurm-web/slurm-web-agent", line 11, in <module>
Dec 03 16:19:57 hostname slurm-web-agent[267350]:     load_entry_point('Slurm-web==4.0.0', 'console_scripts', 'slurm-web-agent')()
Dec 03 16:19:57 hostname slurm-web-agent[267350]:   File "/usr/lib/python3.6/site-packages/pkg_resources/__init__.py", line 476, in load_entry_point
Dec 03 16:19:57 hostname slurm-web-agent[267350]:     return get_distribution(dist).load_entry_point(group, name)
Dec 03 16:19:57 hostname slurm-web-agent[267350]:   File "/usr/lib/python3.6/site-packages/pkg_resources/__init__.py", line 2700, in load_entry_point
Dec 03 16:19:57 hostname slurm-web-agent[267350]:     return ep.load()
Dec 03 16:19:57 hostname slurm-web-agent[267350]:   File "/usr/lib/python3.6/site-packages/pkg_resources/__init__.py", line 2318, in load
Dec 03 16:19:57 hostname slurm-web-agent[267350]:     return self.resolve()
Dec 03 16:19:57 hostname slurm-web-agent[267350]:   File "/usr/lib/python3.6/site-packages/pkg_resources/__init__.py", line 2324, in resolve
Dec 03 16:19:57 hostname slurm-web-agent[267350]:     module = __import__(self.module_name, fromlist=['__name__'], level=0)
Dec 03 16:19:57 hostname slurm-web-agent[267350]:   File "/usr/lib/python3.6/site-packages/slurmweb/exec/agent.py", line 14, in <module>
Dec 03 16:19:57 hostname slurm-web-agent[267350]:     from ..apps.agent import SlurmwebAppAgent
Dec 03 16:19:57 hostname slurm-web-agent[267350]:   File "/usr/lib/python3.6/site-packages/slurmweb/apps/agent.py", line 13, in <module>
Dec 03 16:19:57 hostname slurm-web-agent[267350]:     from werkzeug.middleware import dispatcher
Dec 03 16:19:57 hostname slurm-web-agent[267350]: ModuleNotFoundError: No module named 'werkzeug.middleware'
Dec 03 16:19:57 hostname systemd[1]: slurm-web-agent.service: Main process exited, code=exited, status=1/FAILURE
Dec 03 16:19:57 hostname systemd[1]: slurm-web-agent.service: Failed with result 'exit-code'.
Dec 03 16:19:57 hostname systemd[1]: slurm-web-agent.service: Service RestartSec=100ms expired, scheduling restart.
Dec 03 16:19:57 hostname systemd[1]: slurm-web-agent.service: Scheduled restart job, restart counter is at 1.
Dec 03 16:19:57 hostname systemd[1]: Stopped Slurm-web HPC dashboard agent.
Dec 03 16:19:57 hostname systemd[1]: Started Slurm-web HPC dashboard agent.
@rezib
Copy link
Contributor

rezib commented Dec 4, 2024

Hello @jonoharms, thank you very much for reporting! I consider this a serious bug.

I will work on a fix ASAP.

@rezib rezib self-assigned this Dec 4, 2024
@rezib rezib added the bug This was supposed to work in this case and clearly it is not. label Dec 4, 2024
@rezib rezib added this to the v4.1.0 milestone Dec 4, 2024
rezib added a commit that referenced this issue Dec 4, 2024
Install werkzeug library in the version distributed in el8, in order to
be as close as this environment as possible and try reproduce #419.
rezib added a commit that referenced this issue Dec 4, 2024
Install Werkzeug library in the version distributed in el8, in order to
be as close as possible to this environment and reproduce #419.

This also requires to downgrade Flask to version <1.0 to avoid pulling
more recent version of Werkzeug.
@rezib
Copy link
Contributor

rezib commented Dec 4, 2024

I managed to reproduce it in CI.

@rezib
Copy link
Contributor

rezib commented Dec 4, 2024

@jonoharms, I just published in the repo RPM packages 4.0.0-2 for el8 with the patch developed in #420 included. Can you please try to update and confirm it works for you?

@jonoharms
Copy link
Author

Thanks rezib, I will try when I'm back at work tomorrow morning (Australian time).

@jonoharms
Copy link
Author

it is working now! thankyou very much for the quick turnaround.

@rezib
Copy link
Contributor

rezib commented Dec 5, 2024

Thank you @jonoharms for this confirmation, and thanks again for reporting the issue.

I will let the issue open until #420 is merged and the fix lands in next release.

rezib added a commit that referenced this issue Jan 2, 2025
Install Werkzeug library in the version distributed in el8, in order to
be as close as possible to this environment and reproduce #419.

This also requires to downgrade Flask to version <1.0 to avoid pulling
more recent version of Werkzeug.
rezib added a commit that referenced this issue Jan 2, 2025
Install Werkzeug library in the version distributed in el8, in order to
be as close as possible to this environment and reproduce #419.

This also requires to downgrade Flask to version <1.0 to avoid pulling
more recent version of Werkzeug.
@rezib rezib closed this as completed in 4bed033 Jan 2, 2025
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
bug This was supposed to work in this case and clearly it is not.
Projects
None yet
Development

Successfully merging a pull request may close this issue.

2 participants