andabi/deep-voice-conversion
A TensorFlow project that converts one person's voice to sound like a target speaker using deep neural networks.

Velocity · 7d
+1.2
★ / day
Trend
→steady
star history
Voice style transfer system that transforms a source voice to match a target speaker’s voice without requiring parallel training data. Uses two neural networks—one for speaker-independent phoneme classification and another for speech synthesis—to achieve non-parallel voice conversion. The model trains on waveforms of the target speaker to enable voice cloning style transfers.