Can the GPU be used to create WAV files instead of the CPU? #598

haydonryan · 2024-09-10T16:18:43Z

Love everyone's work here!

Reading the readme - can the GPU be used to create wave files by using the --cuda parameter on the python version?

or is the python version / cuda ONLY for training?

Thanks in advance. I'm trying to get the fastest TTS I can for converting large documents.

FrontierDK · 2024-09-10T16:50:11Z

It can, but it's slower, at least for single inference.

I got it working on Ubuntu (in a virtual PC with access to a RTX 3060 via PCI-E passthrough). If I had to guess, it's due to all the overhead of setting up / preparing GPU, and then finally do an inference.

haydonryan · 2024-09-10T17:03:43Z

Ahh yeah that makes sense but unfortunate.. My 5950x is getting a solid workout here.

rajuaryan21 · 2024-09-19T21:48:38Z

I tried, but it didn't work, I guess. I appended this --cuda parameter to audio generation and still see CPU doing all the work instead of RTX 4060. is it something I am missing? I am using win 11 and piper.exe to test this. Would love to see your insights.

thetznecker · 2024-09-21T08:31:53Z

It does work, yeah. But generation time, even after model is loaded into VRAM is x times longer than on CPU unfortunately

haydonryan · 2024-09-21T18:09:51Z

I wonder why that is! Sadly i'm not up on CUDA optimization enough (yet) to understand why this might be the case. Would love to see some focus on that (but understand it's not really the main direction of the project)

BryceBarbara · 2024-11-11T18:59:32Z

Just to confirm I understand, for use cases like the read-aloud extension where it needs to create multiple generations one after the other, GPU would likely be faster since it would be able to re-use the same state for subsequent generations, right?

This was referenced Nov 9, 2024

Gpu inference diffusionstudio/vits-web#3

Open

Feature request: Allow Piper inference to run on GPU ken107/read-aloud#424

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Can the GPU be used to create WAV files instead of the CPU? #598

Can the GPU be used to create WAV files instead of the CPU? #598

haydonryan commented Sep 10, 2024

FrontierDK commented Sep 10, 2024 •

edited

Loading

haydonryan commented Sep 10, 2024

rajuaryan21 commented Sep 19, 2024

thetznecker commented Sep 21, 2024

haydonryan commented Sep 21, 2024

BryceBarbara commented Nov 11, 2024

Can the GPU be used to create WAV files instead of the CPU? #598

Can the GPU be used to create WAV files instead of the CPU? #598

Comments

haydonryan commented Sep 10, 2024

FrontierDK commented Sep 10, 2024 • edited Loading

haydonryan commented Sep 10, 2024

rajuaryan21 commented Sep 19, 2024

thetznecker commented Sep 21, 2024

haydonryan commented Sep 21, 2024

BryceBarbara commented Nov 11, 2024

FrontierDK commented Sep 10, 2024 •

edited

Loading