Voxii is a Python-based project that leverages the ChatGPT API, DeepL API, and VoiceVox to create a voice translation system. The project integrates speech recognition, translation, and voice synthesis to enable users to record spoken phrases in one language, translate them to another language, and then convert the translated text into synthetic character voice using VoiceVox. The Push-to-Talk mechanism is implemented for voice recording.
## Setup
-
If you don’t have Python installed, install it from here.
-
Clone this repository.
-
Navigate into the project directory:
$ cd Aya_Voxii
-
Create a new virtual environment:
$ python -m venv venv $ . venv/bin/activate
-
Install the requirements:
$ pip install -r requirements.txt
-
make the environment variables file with your api keys: (as in the .env_example file) you need the following api keys filled in the env file:
- chatgpt api or deepgram api get deepgram api key
- deeptranslate api from rapid api get deeptranslate api key
-
run voice vox in the background install it from here
-
Run the app
$ python main.py