-
Notifications
You must be signed in to change notification settings - Fork 38
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
How to improve the synthesized results? #19
Comments
@sanjeevani279 I have same problem with 22050 hz, while 1600hz is ok. Did you resolve this problem ? |
@chazo1994 Are you using 1600Hz and the batchsize is 16? How is the synthesis effect? |
@Summerxu86 I train two model 22050khz and 16khz, both use batchsize 48. Model 16k is faster convergence, and the synthesized audio at 200k step of model 16khz is much better than model 22k. |
What vocoder did you use? in the case of 16k and 22050? Did you use a different pre trained vocoder for each sampling rate? |
I have trained the model for 200k steps, and still, the synthesised results are extremely bad. The sampling rate I have used is 22050 Hz and the batch size used is 16.
This is how my loss curve looks after 200k steps. Can you help me with what can I do now to improve my synthesized audio results?
The text was updated successfully, but these errors were encountered: