reriiasu/speech-to-text
Real-time speech-to-text tool powered by faster-whisper and Silero VAD with an HTML-based GUI.

Velocity · 7d
+0.5
★ / day
Trend
→steady
star history
This project provides real-time transcription by capturing audio from a microphone, detecting voice activity using Silero VAD to segment speech, and converting audio to text using Faster-Whisper. It includes an HTML-based GUI for configuring model settings, adjusting VAD parameters, and viewing transcription results, with optional OpenAI API integration for proofreading.