snakers4/silero-models
Pre-trained text-to-speech models for synthesizing natural speech in multiple languages.

Velocity · 7d
+2.8
★ / day
Trend
→steady
star history
Silero provides pre-trained TTS (text-to-speech) models that convert text into natural-sounding speech. The models are fully end-to-end, built on PyTorch, and support numerous languages including Russian, Armenian, Georgian, and other Cyrillic and Indic languages. They are designed for simple one-line usage and run efficiently on both CPU and GPU.