← all repositories

linto-ai/whisper-timestamped

A Python library that extends OpenAI Whisper to generate word-level timestamps and confidence scores for multilingual automatic speech recognition.

2.8k stars Python Image · Video · Audio
whisper-timestamped
Velocity · 7d
+2.3
★ / day
Trend
steady
star history

The repository provides an implementation that builds upon OpenAI’s Whisper models to predict word-level timestamps, which the base Whisper model does not natively support. It assigns a confidence score to each word and segment during transcription. The project uses PyTorch and the transformers library, supporting CPU and GPU inference with optional Voice Activity Detection preprocessing.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.