Prompt Example

Speech

Text-To-Speech

Input Example : Generate a speech with text "here we go"
Output:

Audio:

Style Transfer Text-To-Speech

First upload your audio(.wav)
Input Example : Speak using the voice of this audio. The text is "here we go".
Output:

Speech Recognition

First upload your audio(.wav)
Audio Example :

Input Example : Generate the text of this speech
Output:

Sing

Text-To-Sing

Input example : please generate a piece of singing voice. Text sequence is 小酒窝长睫毛AP是你最美的记号. Note sequence is C#4/Db4 | F#4/Gb4 | G#4/Ab4 | A#4/Bb4 F#4/Gb4 | F#4/Gb4 C#4/Db4 | C#4/Db4 | rest | C#4/Db4 | A#4/Bb4 | G#4/Ab4 | A#4/Bb4 | G#4/Ab4 | F4 | C#4/Db4. Note duration sequence is 0.407140 | 0.376190 | 0.242180 | 0.509550 0.183420 | 0.315400 0.235020 | 0.361660 | 0.223070 | 0.377270 | 0.340550 | 0.299620 | 0.344510 | 0.283770 | 0.323390 | 0.360340.
Output:

Audio:

Audio

Text-To-Audio

Input Example : Generate an audio of a piano playing
Output:

Audio:

Audio Inpainting

First upload your audio(.wav)
Audio Example :

Input Example : I want to inpaint this audio.
Output:

Then you can press the "Predict Masked Place" button
Output:

Output Audio:

Image-To-Audio

First upload your image(.png)
Input Example : Generate the audio of this image
Output:

Audio:

Audio-To-Text

First upload your audio(.wav)
Audio Example :

Input Example : Please tell me the text description of this audio.
Output:

Sound Detection

First upload your audio(.wav)
Audio Example :

Input Example : What events does this audio include?
Output:

Mono audio to Binaural Audio

First upload your audio(.wav)

Input Example: Transfer the mono speech to a binaural one audio.
Output:

Target Sound Detection

Fisrt upload your audio(.wav)

Input Example: please help me detect the target sound in the audio based on desription: “I want to detect Applause event”
Output:

Sound Extraction

First upload your audio(.wav)

Input Example: Please help me extract the sound events from the audio based on the description: "a person shouts nearby and then emergency vehicle sirens sounds"
Output:

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

README.md

README.md

Prompt Example

Speech

Text-To-Speech

Style Transfer Text-To-Speech

Speech Recognition

Sing

Text-To-Sing

Audio

Text-To-Audio

Audio Inpainting

Image-To-Audio

Audio-To-Text

Sound Detection

Mono audio to Binaural Audio

Target Sound Detection

Sound Extraction

Files

README.md

Latest commit

History

README.md

File metadata and controls

Prompt Example

Speech

Text-To-Speech

Style Transfer Text-To-Speech

Speech Recognition

Sing

Text-To-Sing

Audio

Text-To-Audio

Audio Inpainting

Image-To-Audio

Audio-To-Text

Sound Detection

Mono audio to Binaural Audio

Target Sound Detection

Sound Extraction