peteonrails/voxtype
A Linux voice-to-text app that runs ASR models like Whisper, Cohere Transcribe, and Parakeet locally on CPU or GPU.

Voxtype is a push-to-talk voice-to-text application for Wayland compositors (Hyprland, Sway, GNOME, KDE). It runs seven different speech recognition engines including Whisper.cpp, Cohere Transcribe, and Parakeet, achieving 9-11x realtime transcription on standard CPUs. The tool includes built-in text processing for punctuation and capitalization, supports per-user replacement tables, and can pipe output through an LLM or shell script for post-processing. Models load dynamically and unload when idle to minimize memory usage.