coqui-ai/TTS
A PyTorch-based deep learning library for text-to-speech synthesis, voice cloning, and speaker encoding.

Velocity · 7d
+21
★ / day
Trend
→steady
star history
TTS is a research and production-grade text-to-speech toolkit supporting multiple neural architectures including Glow-TTS, HiFiGAN, MelGAN, Tacotron, Bark, and XTTS. It provides pretrained models in over 1100 languages, tools for training new models, fine-tuning capabilities, and utilities for dataset analysis and curation. The library supports voice cloning, speaker encoding, and streaming inference with low latency.