CheshireCC/faster-whisper-GUI
A PySide6 desktop GUI that runs faster-whisper and whisperX models for automatic speech recognition and transcription.

Velocity · 7d
+2.8
★ / day
Trend
→steady
star history
This repository provides a graphical interface for faster-whisper, an optimized implementation of OpenAI’s Whisper model, and optionally whisperX for improved transcription. Users can transcribe audio or video files and export results to common subtitle and text formats (SRT, VTT, TXT, SMI, LRC). The application exposes model parameters including VAD configuration and supports additional audio preprocessing with Demucs for source separation.