Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[ENHANCEMENT] Enable CPU-only Deployment Option #117

Open
movchan74 opened this issue Jun 14, 2024 · 0 comments
Open

[ENHANCEMENT] Enable CPU-only Deployment Option #117

movchan74 opened this issue Jun 14, 2024 · 0 comments
Labels
enhancement New feature or request

Comments

@movchan74
Copy link
Contributor

Enhancement Description

The current implementation of example applications in the repository runs exclusively on GPU. While this is necessary for certain applications like chat_with_video or llama2, other applications such as whisper can run effectively on a CPU, albeit at a slower speed. The existing deployment configuration, found in deployments.py, mandates specifying the number of GPUs (num_gpus) for all deployments. This limitation necessitates code modification to run applications on a CPU, which is not ideal. This enhancement proposes adding a feature to override the num_gpus setting without altering the code.

Advantages

Allows users to run applications on either CPU or GPU without modifying the existing codebase.

Possible Implementation

  • Introduce a flag or an environment variable that, when set, automatically removes the num_gpus configuration. This could be implemented as follows:
    • Environment Variable: Create an environment variable, e.g., USE_CPU_ONLY. When this variable is set, the deployment script should remove the num_gpus setting.
    • Configuration Flag: Add a configuration flag, e.g., use_cpu_only. When this flag is enabled, the deployment logic should remove the num_gpus parameter.

By implementing this enhancement, the deployment process will become more adaptable, catering to a wider range of hardware configurations without necessitating direct code changes.

@movchan74 movchan74 added the enhancement New feature or request label Jun 14, 2024
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request
Projects
None yet
Development

No branches or pull requests

1 participant