-
Notifications
You must be signed in to change notification settings - Fork 329
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Properly init WhisperKit regarding Model per Device management #254
Comments
Hi @Alonnasi!
It depends on the file (feel free to share a link) but dozens of transcription segments is not out of the ordinary. You can always cross-reference the results from our TestFlight app to what you are observing in your project as a sanity check.
You can always pre-download and bundle the models but your app download size will bloat so the trade-off is yours to make. Word timestamps are supported on all models. tiny and base variants are supported on all Apple Silicon Macs + iPhone XS and newer. Please use the GPU for the AudioEncoder model on iPhone XS, XR and 11. We will make these presets available soon so you shouldn't have to make device-specific defaults on your side. |
Thank you so much for the quick response! 😇 I've managed to make tests on Various devices which I tried almost Every model on each device (to compare results):
Thank you again 🙌 |
Can you please elaborate about the AudioEncoder usage? Will it help with running transcribe tasks on iPhone 11 / XS / XR? For WhisperKit's computeOptions, you will need to set audioEncoderCompute to be |
Hello everybody 😇
Hoping I will get some guidance here 🙏
I'm trying to manage the Models in terms of Downloading, Storing, and Device/Model Management, and I have some issues/questions that, in hope, will help me & others understand better and make the best of this awesome Kit.
But, I'm facing some error that comes from the Kit logger, right after the Model finishes downloading.
This error comes and goes and is not consistent, it can come when initiating the same Model that once worked, and different ones:
What am I doing wrong?
I'm adding a screenshot of the Transcribe func:
Is there a way for me to download the Model files straight into the App Project, and init them locally, with no need to download any Model? Is there a Model that can calculate Word Timestamps AND run on all devices?
Thanks so much for any help 🙌
The text was updated successfully, but these errors were encountered: