collabora/WhisperLive
A real-time speech-to-text transcription server built on OpenAI Whisper with support for multiple hardware backends.

Velocity · 7d
+3.6
★ / day
Trend
→steady
star history
WhisperLive is a real-time transcription application that runs the OpenAI Whisper model to convert live audio or pre-recorded files into text. It provides both server and client components, supporting hardware-accelerated inference via OpenVINO and TensorRT backends. The server exposes a REST API for clients to stream audio for transcription, with features including speaker diarization, word-level timestamps, and hotword support.