← all repositories

alesaccoia/VoiceStreamAI

A Python/JavaScript server enabling near-realtime speech-to-text transcription using OpenAI's Whisper model via WebSocket.

VoiceStreamAI
Velocity · 7d
+1.1
★ / day
Trend
steady
star history

VoiceStreamAI provides a real-time audio streaming and transcription solution built with Python and JavaScript. It leverages Huggingface’s Voice Activity Detection and OpenAI’s Whisper model (via faster-whisper) to perform accurate speech recognition. The system transmits audio chunks over WebSocket for processing, supporting multilingual transcription with a modular, factory-pattern architecture.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.