chidiwilliams/buzz
A desktop application for offline audio and video transcription powered by OpenAI's Whisper model.

Buzz is an offline transcription and translation tool that processes audio and video files using OpenAI’s Whisper speech recognition model. It supports real-time microphone transcription, speaker identification, and multiple backend options including CUDA acceleration for Nvidia GPUs, Apple Silicon optimization, and Vulkan support. Users can export transcripts in TXT, SRT, and VTT formats, and the application includes features like folder watching for automated batch processing.