-
Notifications
You must be signed in to change notification settings - Fork 26
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Add role to install NVIDIA DOCA on top of an existing "fat" image #492
Conversation
b92641c
to
d2c387c
Compare
a694dab
to
066d31f
Compare
1e54cbc
to
5fbed66
Compare
73d9f33
to
e3af80a
Compare
Fat image build: https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/12277251098. NB: doca workflows fail here b/c previous fat image has |
Note the above didn't actually build DOCA for some reason even though the workflow ran with the right groups |
Tested nightly build here: https://github.com/stackhpc/ansible-slurm-appliance/actions/runs/12280868395 |
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Packer and Workflows LGTM
There was a problem hiding this comment.
Choose a reason for hiding this comment
The reason will be displayed to describe this comment to others. Learn more.
Docs LGTM
Adds new
doca
role to install NVIDIA DOCA:inventory_groups
includedoca
.This role should be preferred over the
ofed
role which may be deprecated at a later date.NB: This uses doca packages from upstream repos, not StackHPC's ark.
Adds a workflow to test the DOCA build during CI on the current RL8/RL9 fat images.
Unless run manually, the built image is deleted on completion.
NB: the resulting DOCA image is not tested by CI.
Simplifies the configuration for packer builds:
inventory_groups
, taking a comma-separated list (insteadgroups
which took a map).All image build workflows have been adjusted to use the new packer configuration
The following manually-triggered checks related to packer configuration changes have been completed:
Ticket: PLATFORM-537