Pikurrot/whisper-gui
A Gradio-based desktop GUI application for transcribing audio and video files using OpenAI's Whisper and WhisperX speech recognition models.

This project provides a user-friendly interface built with Gradio for interacting with Whisper and WhisperX models. It allows users to transcribe local audio and video files, supports automatic or manual language detection, and outputs results in multiple formats (SRT, JSON, TXT) with word and sentence-level timestamps. The application offers advanced transcription options and optimizations for running state-of-the-art speech-to-text models.