← all repositories

zhao-kun/VibeVoiceFusion

A full-stack web application for multi-speaker voice generation and cloning built on Microsoft's VibeVoice model.

VibeVoiceFusion
Velocity · 7d
+1.9
★ / day
Trend
steady
star history

VibeVoiceFusion is a multi-speaker synthetic speech generation system combining autoregressive and diffusion architectures. It provides a complete web interface for voice synthesis, supports LoRA fine-tuning for custom voice adaptation, and includes batch generation with VRAM optimization features. The system enables voice cloning with distinct speaker characteristics without requiring coding knowledge.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.