canopyai/Orpheus-TTS
An open-source TTS system leveraging an LLM backbone to generate natural human-like speech with emotions, voice cloning, and low-latency streaming.

Velocity · 7d
+13
★ / day
Trend
→steady
star history
Orpheus TTS is a text-to-speech system built on a Llama-3b backbone that generates natural-sounding speech with human-like intonation, emotion, and rhythm. It supports zero-shot voice cloning without fine-tuning and allows guided control over speech characteristics through simple tags. The system offers ~200ms streaming latency for real-time applications and provides multilingual model variants.