← all repositories

PaddlePaddle/PaddleSpeech

Speech processing toolkit with pretrained models for automatic speech recognition, text-to-speech synthesis, voice cloning, and speech translation.

12.6k stars Python Image · Video · Audio
PaddleSpeech
Velocity · 7d
+4.0
★ / day
Trend
steady
star history

PaddleSpeech is an open-source speech processing library that provides pretrained models and tools for speech recognition, speech synthesis, speaker verification, and end-to-end speech translation. It supports streaming and non-streaming inference modes, includes self-supervised learning models like wav2vec2, and implements modern architectures such as Conformer and Transformer. The toolkit won the NAACL2022 Best Demo Award.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.