← all repositories

KoljaB/RealtimeSTT

A Python library for real-time speech-to-text transcription using ML models like Whisper.

9.9k stars Python Image · Video · Audio
RealtimeSTT
Velocity · 7d
+9.7
★ / day
Trend
steady
star history

RealtimeSTT provides low-latency speech-to-text with voice activity detection, wake word activation, and instant transcription. It primarily uses faster-whisper as its backend and supports multiple ASR engines including kroko_onnx for local streaming. The library integrates Silero VAD for voice activity detection and offers direct audio stream access for developers building voice-enabled applications.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.