← all repositories

r9y9/deepvoice3_pytorch

A PyTorch implementation of convolutional neural network text-to-speech synthesis models based on the Deep Voice 3 architecture.

2k stars Python Image · Video · Audio
deepvoice3_pytorch
Velocity · 7d
+0.6
★ / day
Trend
steady
star history

This repository provides a PyTorch implementation of deep learning-based text-to-speech synthesis systems, specifically implementing Deep Voice 3 and guided attention models. It supports both single-speaker and multi-speaker TTS with convolutional sequence-to-sequence architectures using attention mechanisms. The project includes preprocessors for standard datasets like LJSpeech, JSUT, and VCTK, along with pre-trained models and audio samples.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.