r9y9/deepvoice3_pytorch
A PyTorch implementation of convolutional neural network text-to-speech synthesis models based on the Deep Voice 3 architecture.

This repository provides a PyTorch implementation of deep learning-based text-to-speech synthesis systems, specifically implementing Deep Voice 3 and guided attention models. It supports both single-speaker and multi-speaker TTS with convolutional sequence-to-sequence architectures using attention mechanisms. The project includes preprocessors for standard datasets like LJSpeech, JSUT, and VCTK, along with pre-trained models and audio samples.