You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
I've been using Whisper to subtitle and translate to English videos which I could not find any subtitles. The only issue I saw with it is that, while OpenAI's implementation generates subtitles, sometimes their timestamps are not great. But there is WhisperX and whisper-timestamped that improve that.
To be fair, I got here because it showed up in GitHub's "Explore repositories" and I thought "What it does better than Whisper?" then I saw that's just an older project. At this point I don't see that's no longer a case of incorporating it in AutoSub: those projects (including OpenAI's implementation) already generate subtitles and seem to use a lot of tricks to improve performance. A lot faster in fact: faster-whisper speed is 54s/13min, 4 seconds per minute of audio, against AutoSub's 34 second per minute (40 minutes/70 minutes). Maybe AutoSub can be faster if the same hardware was used, but the readme makes it seem a quite slower.
OpenAI just released probably the best model that there is for speech recognition right now.
It would be great to incorprate this into this project!
More info: https://openai.com/blog/whisper/
The text was updated successfully, but these errors were encountered: