zhao-kun/VibeVoiceFusion
A full-stack web application for multi-speaker voice generation and cloning built on Microsoft's VibeVoice model.

Velocity · 7d
+1.9
★ / day
Trend
→steady
star history
VibeVoiceFusion is a multi-speaker synthetic speech generation system combining autoregressive and diffusion architectures. It provides a complete web interface for voice synthesis, supports LoRA fine-tuning for custom voice adaptation, and includes batch generation with VRAM optimization features. The system enables voice cloning with distinct speaker characteristics without requiring coding knowledge.