PaddlePaddle/PaddleSpeech
Speech processing toolkit with pretrained models for automatic speech recognition, text-to-speech synthesis, voice cloning, and speech translation.

PaddleSpeech is an open-source speech processing library that provides pretrained models and tools for speech recognition, speech synthesis, speaker verification, and end-to-end speech translation. It supports streaming and non-streaming inference modes, includes self-supervised learning models like wav2vec2, and implements modern architectures such as Conformer and Transformer. The toolkit won the NAACL2022 Best Demo Award.