← all repositories

ivanvovk/WaveGrad

PyTorch implementation of Google Brain's WaveGrad diffusion vocoder for high-fidelity text-to-speech synthesis.

409 stars Jupyter Notebook Image · Video · Audio
WaveGrad
Velocity · 7d
+0.2
★ / day
Trend
steady
star history

This repository provides a full implementation of WaveGrad, a probabilistic vocoder that uses a diffusion probabilistic model to generate high-quality speech waveforms from mel-spectrograms. The implementation includes support for multi-iteration inference (6 to 1000 iterations), mixed-precision and distributed training, parallel grid search for optimal noise schedules, and pretrained checkpoints for the LJSpeech dataset.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.