prophesier/diff-svc
A diffusion-based singing voice conversion system that transforms input singing voices into a target timbre with pitch correction support.

This repository implements singing voice conversion using diffusion models. It converts input singing voices into a target timbre while supporting basic pitch correction. The system uses deep learning components including HuBERT and ContentVec for content feature extraction, along with neural vocoders for audio synthesis. The project supports training on custom datasets and provides GPU-accelerated inference for voice conversion tasks.