← all repositories

Rayhane-mamah/Tacotron-2

A TensorFlow implementation of DeepMind's Tacotron-2 neural network for text-to-speech synthesis.

2.3k stars Python Image · Video · Audio
Tacotron-2
Velocity · 7d
+0.8
★ / day
Trend
steady
star history

This repository implements the Tacotron-2 architecture combining a seq2seq encoder-decoder that predicts mel spectrograms from text with a Wavenet vocoder that converts those spectrograms into audio waveforms. It provides hyperparameters to reproduce the original paper results along with additional improvements that improve output quality in most cases. The project includes pre-trained checkpoints, training logs, and utilities for processing standard TTS datasets.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.