← all repositories

fatchord/WaveRNN

A PyTorch implementation of Deepmind's WaveRNN neural vocoder paired with Tacotron for neural text-to-speech synthesis.

2.2k stars Python Image · Video · Audio
WaveRNN
Velocity · 7d
+0.7
★ / day
Trend
steady
star history

This repository provides a complete TTS pipeline using two deep learning models: Tacotron converts text into mel-spectrograms, and WaveRNN generates raw audio waveforms from those spectrograms using efficient neural synthesis. Users can train custom models on datasets like LJSpeech, generate speech via a quick-start script, or integrate the vocoder separately for inference.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.