huggingface/parler-tts
An open-source library for running and training Parler-TTS, a text-to-speech model that generates natural speech from text.

Parler-TTS is an inference and training library for a high-quality text-to-speech model. The library provides tools for generating speech in various speaker styles (gender, pitch, speaking style) and includes optimization support like SDPA, Flash Attention 2, and model compilation for faster inference. Multiple model sizes are available, ranging from 880M to 2.3B parameters, and all model weights, datasets, and training code are released publicly.