KoljaB/RealtimeSTT
A Python library for real-time speech-to-text transcription using ML models like Whisper.

Velocity · 7d
+9.7
★ / day
Trend
→steady
star history
RealtimeSTT provides low-latency speech-to-text with voice activity detection, wake word activation, and instant transcription. It primarily uses faster-whisper as its backend and supports multiple ASR engines including kroko_onnx for local streaming. The library integrates Silero VAD for voice activity detection and offers direct audio stream access for developers building voice-enabled applications.