petewarden/spchcat
A local speech-to-text command-line tool for Linux and Raspberry Pi that transcribes audio from files or microphone without network access.

Velocity · 7d
+0.3
★ / day
Trend
→steady
star history
spchcat uses Coqui’s speech-to-text models and TensorFlow to run speech recognition entirely offline on the user’s machine. It reads audio input from WAV files, microphones, or system audio sources and outputs text transcripts. The tool relies on KenLM for language modeling and models trained on Mozilla’s Common Voice dataset, supporting multiple languages through Coqui’s model library.