pnnbao97/VieNeu-TTS
Vietnamese text-to-speech system with deep learning-based voice cloning and on-device inference capabilities.

Velocity · 7d
+7.5
★ / day
Trend
→steady
star history
VieNeu-TTS is a bilingual Vietnamese-English text-to-speech system built on deep learning. It supports instant voice cloning from 3-5 seconds of reference audio and includes a dedicated Podcast/Conversation mode for multi-speaker dialogue. The system is optimized for both GPU inference via LMDeploy and CPU/GGUF/ONNX on-device deployment, delivering 24kHz audio output.