Discussions! Any way to tie this to Ooba or Silly Tavern? #57
Replies: 4 comments 3 replies
-
It should be possible, just need to connect the APIs. I haven't used ooba
recently, do you know if TTS is built in or is it a plugin?
Also, probably the settings might need to be changed such as CPU
offloading, assuming that ooba eats up most of the VRAM.
Just a quick mention: tortoise can be run quicker. Using the new options if
you set samples to 4 and iterations to 4-16 you'll get much faster results.
(Iterations improve output quality more cheaply than samples).
The quality does drop though, and bark, when it behaves properly, is more
impressive.
…On Sat, Jul 1, 2023, 6:37 AM St33lMouse ***@***.***> wrote:
Is there any way to get Ooba or Silly Tavern to use voices from this repo?
The Bark voice cloning here is excellent. If this repo can act as the voice
output for Ooba or Silly Tavern, we'd finally have the ability to output
tts in Ooba and have the AI read to you in any voice you want quickly and
at high quality. Same goes for Silly Tavern. You could give any Silly
Tavern character the voice of your choosing.
All other efforts have been limited so far. You can't clone voices in
Silero, Tortoise is slow, etc. This repo is the magic ingredient to making
characters come alive.
—
Reply to this email directly, view it on GitHub
<#57>, or
unsubscribe
<https://github.com/notifications/unsubscribe-auth/ABTRXIYDN7WVACI3IAIFLHTXN6LN5ANCNFSM6AAAAAAZ2Q3WCI>
.
You are receiving this because you are subscribed to this thread.Message
ID: ***@***.***>
|
Beta Was this translation helpful? Give feedback.
-
both Ooba and Silly Tavern have voice. Ooba has Silero, eleven labs, and a Bark extension here: https://github.com/wsippel/bark_tts I don't know if the extension is using the same version of Bark you are using. Quality seems different, but maybe that's just me. There are other ooba voice projects underway, including a coquoi and tortoise project. I don't think it has been merged to main yet, and it's been awhile. Silly Tavern has Silero and maybe a couple of other choices, but not Bark. Bark is clearly the strongest. An API to Silly Tavern would be most welcome. I'll play around with Tortoise on your repo. There is a repo that made voice cloning possible with tortoise by finetuning the voice models here: https://git.ecker.tech/mrq/ai-voice-cloning Quality was quite good, strong resemblance, but slower than Bark. |
Beta Was this translation helpful? Give feedback.
-
As for the speed of tortoise, apparently you can get something "passable" with just 4 samples and ~30-50 iterations. But hopefully things should speed up with bark switching to HF's transformers. |
Beta Was this translation helpful? Give feedback.
-
Ok so I took a look at bark_tts plugin, and it seems to do several things specifically to enhance the quality when running with oobabooga. What I'd say is that for this moment you could find the "optimal" parameters using tts-generation-webui, and then just dial them into the plugin. |
Beta Was this translation helpful? Give feedback.
-
Is there any way to get Ooba or Silly Tavern to use voices from this repo? The Bark voice cloning here is excellent. If this repo can act as the voice output for Ooba or Silly Tavern, we'd finally have the ability to output tts in Ooba and have the AI read to you in any voice you want quickly and at high quality. Same goes for Silly Tavern. You could give any Silly Tavern character the voice of your choosing.
All other efforts have been limited so far. You can't clone voices in Silero, Tortoise is slow, etc. This repo is the magic ingredient to making characters come alive.
Beta Was this translation helpful? Give feedback.
All reactions