Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Training code? #4

Open
GXcells opened this issue Oct 18, 2024 · 2 comments
Open

Training code? #4

GXcells opened this issue Oct 18, 2024 · 2 comments

Comments

@GXcells
Copy link

GXcells commented Oct 18, 2024

You did an amazing work.
I know it is very early but do you know if it will be possible to fine-tune the model and if yes are you planning to release a code for it or should I wait for community to work on it?

Thanks

@Nehc
Copy link

Nehc commented Oct 20, 2024

Yes, fine-tune of such a model is very interesting!

@yukiarimo
Copy link

I saw the fine-tuning was merged, but what does it mean no support to fine-tune the generation part? Is it possible train it on the Mel-spectrograms to turn into a text-to-speech-ish model like:

<text>Hello, this is a demo voice</text>
<audio>{audio_tokens_for_further_convertion_with_any_vocoder}

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants