Output transcription speed as a multiple of real-time in Whisper example #403

robertknight · 2024-11-09T13:58:53Z

This allows comparing performance across different audio clip lengths. Also this is a number that is often reported for other inference engines.

As a data point, on my Intel i5 laptop I get:

whisper-base: ~18x realtime
whisper-small: ~5x realtime
whisper-medium: ~2x realtime

This allows comparing performance across different audio clip lengths.

Output transcription speed as a multiple of real-time in Whisper example

f560fce

This allows comparing performance across different audio clip lengths.

robertknight merged commit aaaa8ce into main Nov 9, 2024
2 checks passed

robertknight deleted the whisper-real-time-factor branch November 9, 2024 14:04

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Output transcription speed as a multiple of real-time in Whisper example #403

Output transcription speed as a multiple of real-time in Whisper example #403

robertknight commented Nov 9, 2024 •

edited

Loading

Output transcription speed as a multiple of real-time in Whisper example #403

Output transcription speed as a multiple of real-time in Whisper example #403

Conversation

robertknight commented Nov 9, 2024 • edited Loading

robertknight commented Nov 9, 2024 •

edited

Loading