Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add AMD GPU support #45

Open
erikhuck opened this issue Sep 24, 2024 · 1 comment
Open

Add AMD GPU support #45

erikhuck opened this issue Sep 24, 2024 · 1 comment

Comments

@erikhuck
Copy link
Collaborator

Currently, gpu-tracker assumes an Nvidia gpu and calls nvidia-smi. There should be an additional parameter for gpu_branch that accepts 'nvidia', 'amd', or None which is the default. If None, no GPU info is tracked, otherwise it tracks according to the specified branch. We may want to separate the GPU tracking into separate classes with NvidiaGPUTracker, AMDGPUTracker, and a base GPUTracker class. The common interface from the base class can be used by the Tracker class and it doesn't need to be concerned about which brand of GPU is used i.e. polymorphism.

To prototype GPU support, we'll need to access a cloud cluster with GPU, ideally with ssh. The National Science Foundation (NSF), via their ACCESS website, provides clusters called "resources" and the DELTA resource has AMD GPUs. I've asked Vikram for assistance with accessing this ACCESS resource.

@erikhuck
Copy link
Collaborator Author

erikhuck commented Oct 18, 2024

This CLI might be what we need as a replacement for nvidia-smi, though apparently it has a corresponding python package.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant