Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Improve CUDA device detection #280

Open
makortel opened this issue Mar 11, 2019 · 2 comments
Open

Improve CUDA device detection #280

makortel opened this issue Mar 11, 2019 · 2 comments

Comments

@makortel
Copy link

In #100 when SwitchProducerCUDA checks whether there are any CUDA devices on the system, it requires that all devices must have compute capability above a certain level. If any device is below, the switch decides that there are no CUDA devices.

There are many ways how this logic could be improved

  • Disable the use of only those devices under the threshold.
    • Would require communication of the devices to CUDAService, or duplication of the logic
  • Figure out the supported architectures from SCRAM CUDA flags
@fwyzard
Copy link

fwyzard commented Mar 13, 2019

An other option:

  • Try to run a simple kernel, and if that fails (because we did not compile for that gpu architecture) drop the device from the list of available device
    • this may try to initialise a device that we would otherwise avoid, and/or before it has been properly configured ?
    • to address that, is it enough to call cudaDeviceReset() before the explicit configuration and initialisation in the CUDAService ?

@fwyzard
Copy link

fwyzard commented Mar 14, 2019

See #286 .

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants