← all repositories

fishaudio/fish-diffusion

A PyTorch-based framework for training Text-to-Speech, Singing Voice Synthesis, and Singing Voice Conversion models using diffusion.

fish-diffusion
Velocity · 7d
+0.6
★ / day
Trend
steady
star history

Fish Diffusion is a training framework for audio generation tasks including Text-to-Speech (TTS), Singing Voice Synthesis (SVS), and Singing Voice Conversion (SVC). It leverages diffusion models and PyTorch to generate high-quality audio, featuring support for HiFiSinger models and integration with HuggingFace Spaces for model hosting.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.