← all repositories

andabi/deep-voice-conversion

A TensorFlow project that converts one person's voice to sound like a target speaker using deep neural networks.

3.9k stars Python Image · Video · Audio
deep-voice-conversion
Velocity · 7d
+1.2
★ / day
Trend
steady
star history

Voice style transfer system that transforms a source voice to match a target speaker’s voice without requiring parallel training data. Uses two neural networks—one for speaker-independent phoneme classification and another for speech synthesis—to achieve non-parallel voice conversion. The model trains on waveforms of the target speaker to enable voice cloning style transfers.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.