linto-ai/whisper-timestamped
A Python library that extends OpenAI Whisper to generate word-level timestamps and confidence scores for multilingual automatic speech recognition.

Velocity · 7d
+2.3
★ / day
Trend
→steady
star history
The repository provides an implementation that builds upon OpenAI’s Whisper models to predict word-level timestamps, which the base Whisper model does not natively support. It assigns a confidence score to each word and segment during transcription. The project uses PyTorch and the transformers library, supporting CPU and GPU inference with optional Voice Activity Detection preprocessing.