This add-on designed to seamlessly integrate the capabilities of the Open AI API into your workflow. Whether you're looking to craft comprehensive text, translate passages with precision, concisely summarize documents, or even interpret and describe visual content, this add-on does it all with ease.
- Navigate to the releases page to find the latest version of the add-on.
- Download the latest release from the provided link.
- Execute the installer to add the add-on to your NVDA environment.
In order to utilize the full functionality of the OpenAI NVDA add-on, an API key from OpenAI is required. Follow these steps to set it up:
- Acquire an API key by registering for an OpenAI account at https://platform.openai.com/api-keys.
- With the API key ready, you have two options for configuration:
- Through the NVDA settings dialog:
- Access the NVDA menu and navigate to the "Preferences" submenu.
- Open the "Settings" dialog and select the "Open AI" category.
- Input your API key in the provided field and click "OK" to confirm.
- Using environment variables:
- Press
Windows+Pause
to open System Properties. - Click on "Advanced system settings" and select "Environment Variables".
- Create a new variable under "User variables":
- Click on "New".
- Enter
OPENAI_API_KEY
as the variable name and paste your API key as the value.
- Click "OK" to save your changes.
- Press
- Through the NVDA settings dialog:
You are now equipped to explore the features of the OpenAI NVDA add-on!
The functionality of the add-on is housed within a central dialog that can be opened using the shortcut NVDA+g
. This dialog provides access to the majority of the add-on's features, enabling you to:
- Engage in conversation with the AI model.
- Get descriptions of images from image files.
- Transcribe spoken content from audio files or through a microphone.
- Use the text-to-speech feature to vocalize written text in the prompt.
Some commands are available in the main dialog for different elements.
- When the Prompt Field is focused:
Ctrl+Enter
: Submit the text you've entered.Ctrl+Up Arrow
: Retrieve and place the most recently entered prompt into the current field for review or re-use.
- When the History Field is focused:
Alt+Right Arrow
: Copy the user's text to the prompt.Alt+Left Arrow
: Copy the assistant's response to the system.Ctrl+C
: Copy the assistant's response or the user's text depending on the cursor's position.Ctrl+Shift+Up Arrow
: Move to the text block of the user or assistant above the current block.Ctrl+Shift+Down Arrow
: Move to the text block of the user or assistant below the current block.
These commands can be used to trigger actions from anywhere on your computer. It is possible to reassign them from Input Gestures dialog under Open AI category.
NVDA+e
: Take a screenshot and describe it.NVDA+o
: Grab the current navigator object and describe it.NVDA+:
: Toggle the microphone recording and transcribe the audio from anywhere.
The add-on comes bundled with the following essential dependencies:
- OpenAI: The official Python library for the openai API.
- MSS: An ultra fast cross-platform multiple screenshots module in pure python using ctypes.
- Pillow: The user-friendly fork of the Python Imaging Library, used for image resizing.
- sounddevice: Play and Record Sound with Python.