← all repositories

Pikurrot/whisper-gui

A Gradio-based desktop GUI application for transcribing audio and video files using OpenAI's Whisper and WhisperX speech recognition models.

whisper-gui
Velocity · 7d
+0.4
★ / day
Trend
steady
star history

This project provides a user-friendly interface built with Gradio for interacting with Whisper and WhisperX models. It allows users to transcribe local audio and video files, supports automatic or manual language detection, and outputs results in multiple formats (SRT, JSON, TXT) with word and sentence-level timestamps. The application offers advanced transcription options and optimizations for running state-of-the-art speech-to-text models.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.