Is TheWhisper open source?

Yes — TheStageAI/TheWhisper is open source, released under the MIT license.

What language is TheWhisper written in?

TheStageAI/TheWhisper is primarily written in Python.

How popular is TheWhisper?

TheStageAI/TheWhisper has 892 stars on GitHub.

Where can I find TheWhisper?

TheStageAI/TheWhisper is on GitHub at https://github.com/TheStageAI/TheWhisper.

← all repositories

TheStageAI/TheWhisper

Whisper models that stream without the 30-second handcuffs

Fine-tuned Whisper weights that accept 10–30 second audio chunks, paired with optimized inference engines for both NVIDIA data centers and Apple Silicon laptops.

★892 stars Python Inference · Serving Image · Video · Audio

View on GitHub ↗ Homepage ↗

Not currently ranked — collecting fresh signals.

star history

What it does TheWhisper ships fine-tuned variants of OpenAI’s Whisper that accept audio chunks from 10 to 30 seconds without padding shorter clips with silence. It provides a Python inference kit with both open Hugging Face Transformers and proprietary TheStage AI optimized engines for NVIDIA GPUs, plus CoreML and MLX backends for Apple Silicon. The repository also includes a local REST API and an Electron/ReactJS frontend example for building macOS desktop transcribers.

The interesting bit Standard Whisper implementations force every input into a rigid 30-second window; TheWhisper’s fine-tuned weights remove that constraint, which directly helps streaming and low-latency scenarios. The project offers a split personality: fully open PyTorch and CoreML stacks for casual use, and closed optimized engines that claim 220 tok/s on an L40s and roughly 2W power draw on macOS.

Key highlights

Fine-tuned open weights support 10s, 15s, 20s, and 30s chunk modes without silence padding
Streaming transcription on both NVIDIA GPUs and Apple Silicon
Benchmarked on the multilingual Open ASR Leaderboard for quality comparisons
CoreML/MLX path claims ~2W power consumption and ~2GB RAM on macOS
Includes an Electron demo app and a certified macOS note-taking app, TheNotes
TheStage AI optimized NVIDIA engines are free for small organizations using ≤4 GPUs/year

Caveats

Several roadmap items are still unchecked: NVIDIA streaming containers, OpenAI-compatible API containers, and speaker diarization
TheStage AI’s optimized NVIDIA engines require a proprietary package and platform token, with commercial licensing for larger deployments
The “lowest in the world” macOS power claim is stated but not independently substantiated in the repository

Verdict Good for developers building real-time captioning, local meeting transcription, or edge voice interfaces who want Whisper-level accuracy without cloud dependencies. Skip it for now if you need turnkey speaker diarization or drop-in OpenAI-compatible inference containers.

Frequently asked

What is TheStageAI/TheWhisper?: Fine-tuned Whisper weights that accept 10–30 second audio chunks, paired with optimized inference engines for both NVIDIA data centers and Apple Silicon laptops.
Is TheWhisper open source?: Yes — TheStageAI/TheWhisper is open source, released under the MIT license.
What language is TheWhisper written in?: TheStageAI/TheWhisper is primarily written in Python.
How popular is TheWhisper?: TheStageAI/TheWhisper has 892 stars on GitHub.
Where can I find TheWhisper?: TheStageAI/TheWhisper is on GitHub at https://github.com/TheStageAI/TheWhisper.