← all repositories

jianfch/stable-ts

A Python library that modifies OpenAI's Whisper model to produce more reliable timestamps and provides transcription, forced alignment, and audio indexing utilities.

stable-ts
Velocity · 7d
+1.7
★ / day
Trend
steady
star history

This library extends OpenAI’s Whisper speech recognition model to generate more reliable and accurate word-level timestamps. It provides utilities for forced alignment of audio to text, audio indexing, silence suppression, and word regrouping. The library offers both standard and “whisperless” installation options, allowing users to integrate it with existing Whisper installations or use it as a standalone tool for timestamp refinement on audio transcriptions.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.