lenML/Speech-AI-Forge
A Text-to-Speech generation framework supporting multiple TTS/STT models with API server and Gradio WebUI.

Velocity · 7d
+1.9
★ / day
Trend
→steady
star history
Speech-AI-Forge is a project built around TTS generation models, providing an API server and a Gradio-based WebUI for deployment. It integrates popular speech models including ChatTTS, CosyVoice, Fish-Speech, and supports speech-to-text via Whisper. Users can deploy locally, via Docker containers, use a Windows integration package, or run in Google Colab.