speaches-ai/speaches
An OpenAI API-compatible inference server for speech-to-text and text-to-speech models.

Velocity · 7d
+4.5
★ / day
Trend
→steady
star history
Speaches provides an API-compatible wrapper for running faster-whisper (speech-to-text/transcription) and Kokoro/piper (text-to-speech) models locally or in Docker. It handles dynamic model loading and offloading, streaming transcription via SSE, and exposes endpoints compatible with OpenAI SDKs and tools. The project aims to provide Ollama-like convenience for speech AI models.