mozilla/TTS
A PyTorch/TensorFlow deep learning library for advanced text-to-speech synthesis with neural vocoders and pretrained models.

Velocity · 7d
+3.3
★ / day
Trend
→steady
star history
Mozilla TTS provides state-of-the-art neural text-to-speech models including Tacotron, Tacotron2, Glow-TTS, and neural vocoders like MelGAN and Multi-band-MelGAN. The library implements the full TTS pipeline from text analysis through mel-spectrogram generation to waveform synthesis. It includes pretrained models across 20+ languages, dataset quality tools, and training recipes for custom voice model development.