netease-youdao/EmotiVoice
Multi-voice emotional text-to-speech engine supporting English and Chinese with over 2000 voices.

Velocity · 7d
+9.0
★ / day
Trend
→steady
star history
EmotiVoice is an open-source neural text-to-speech system built with PyTorch that generates speech in English and Chinese from text prompts. It offers emotional synthesis capabilities, allowing generation of speech with various emotional expressions like happy, sad, angry, and excited. The system provides both a web interface for interactive use and a scripting interface for batch processing.