alesaccoia/VoiceStreamAI
A Python/JavaScript server enabling near-realtime speech-to-text transcription using OpenAI's Whisper model via WebSocket.

Velocity · 7d
+1.1
★ / day
Trend
→steady
star history
VoiceStreamAI provides a real-time audio streaming and transcription solution built with Python and JavaScript. It leverages Huggingface’s Voice Activity Detection and OpenAI’s Whisper model (via faster-whisper) to perform accurate speech recognition. The system transmits audio chunks over WebSocket for processing, supporting multilingual transcription with a modular, factory-pattern architecture.