← all repositories

reriiasu/speech-to-text

Real-time speech-to-text tool powered by faster-whisper and Silero VAD with an HTML-based GUI.

615 stars HTML Image · Video · Audio
speech-to-text
Velocity · 7d
+0.5
★ / day
Trend
steady
star history

This project provides real-time transcription by capturing audio from a microphone, detecting voice activity using Silero VAD to segment speech, and converting audio to text using Faster-Whisper. It includes an HTML-based GUI for configuring model settings, adjusting VAD parameters, and viewing transcription results, with optional OpenAI API integration for proofreading.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.