fatchord/WaveRNN
A PyTorch implementation of Deepmind's WaveRNN neural vocoder paired with Tacotron for neural text-to-speech synthesis.

Velocity · 7d
+0.7
★ / day
Trend
→steady
star history
This repository provides a complete TTS pipeline using two deep learning models: Tacotron converts text into mel-spectrograms, and WaveRNN generates raw audio waveforms from those spectrograms using efficient neural synthesis. Users can train custom models on datasets like LJSpeech, generate speech via a quick-start script, or integrate the vocoder separately for inference.