← all repositories

NVIDIA/tacotron2

A PyTorch implementation of the Tacotron 2 neural network for text-to-speech synthesis.

5.3k stars Jupyter Notebook Image · Video · Audio
tacotron2
Velocity · 7d
+1.8
★ / day
Trend
steady
star history

Neural network model for text-to-speech synthesis based on the Tacotron 2 architecture from the paper Natural TTS Synthesis By Conditioning Wavenet On Mel Spectrogram Predictions. The model converts text input into mel spectrograms which are then used to generate speech audio. Includes distributed training support, automatic mixed precision via NVIDIA Apex, and faster-than-realtime inference capabilities.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.