ivanvovk/WaveGrad
PyTorch implementation of Google Brain's WaveGrad diffusion vocoder for high-fidelity text-to-speech synthesis.

Velocity · 7d
+0.2
★ / day
Trend
→steady
star history
This repository provides a full implementation of WaveGrad, a probabilistic vocoder that uses a diffusion probabilistic model to generate high-quality speech waveforms from mel-spectrograms. The implementation includes support for multi-iteration inference (6 to 1000 iterations), mixed-precision and distributed training, parallel grid search for optimal noise schedules, and pretrained checkpoints for the LJSpeech dataset.