Is PaddleSpeech open source?

Yes — PaddlePaddle/PaddleSpeech is open source, released under the Apache-2.0 license.

What language is PaddleSpeech written in?

PaddlePaddle/PaddleSpeech is primarily written in Python.

How popular is PaddleSpeech?

PaddlePaddle/PaddleSpeech has 12.7k stars on GitHub.

Where can I find PaddleSpeech?

PaddlePaddle/PaddleSpeech is on GitHub at https://github.com/PaddlePaddle/PaddleSpeech.

← all repositories

PaddlePaddle/PaddleSpeech

A speech Swiss Army knife that won NAACL’s top demo prize

It exists because most speech toolkits treat Chinese text normalization and code-switching as afterthoughts.

★12.7k stars Python Image · Video · Audio

View on GitHub ↗ Homepage ↗

Not currently ranked — collecting fresh signals.

star history

What it does

PaddleSpeech is a speech and audio toolkit built atop the PaddlePaddle framework. It covers the usual suspects—ASR, text-to-speech, speaker verification, keyword spotting, audio classification, and speech translation—plus oddities like punctuation restoration and singing voice synthesis. The project also ships streaming ASR and TTS pipelines, C++ runtimes, and mobile demos for Android and ARM Linux.

The interesting bit

Most speech libraries bolt on multilingual support later; this one treats Chinese text normalization, grapheme-to-phoneme conversion, and tone sandhi as first-class citizens. It also handles code-switching—Mandarin-English and Cantonese mixes—without pretending the world is monolingual. The NAACL 2022 Best Demo Award suggests the judges noticed.

Key highlights

Self-supervised ASR backbones including Wav2vec 2.0, WavLM, and HuBERT, plus Whisper large v3 and turbo integration
Production-oriented streaming ASR and TTS with C++ deployment paths and server interfaces
Rule-based Chinese frontend handling polyphones, tone sandhi, and text normalization
Code-switching recipes for mixed-language speech recognition
Cascaded pipelines that glue speech tasks to NLP and computer vision workflows

Caveats

Tied tightly to the PaddlePaddle ecosystem; if you’re not already in that orbit, the dependency is a conscious choice
The README is heavy on demo links and quick-start tables, light on architectural detail or training cost estimates

Verdict

Worth a look if you need Mandarin-aware speech pipelines or want a single toolkit that spans research models and edge deployment. Skip it if you’re allergic to the PaddlePaddle stack or only need a narrow, framework-agnostic component.

Frequently asked

What is PaddlePaddle/PaddleSpeech?: It exists because most speech toolkits treat Chinese text normalization and code-switching as afterthoughts.
Is PaddleSpeech open source?: Yes — PaddlePaddle/PaddleSpeech is open source, released under the Apache-2.0 license.
What language is PaddleSpeech written in?: PaddlePaddle/PaddleSpeech is primarily written in Python.
How popular is PaddleSpeech?: PaddlePaddle/PaddleSpeech has 12.7k stars on GitHub.
Where can I find PaddleSpeech?: PaddlePaddle/PaddleSpeech is on GitHub at https://github.com/PaddlePaddle/PaddleSpeech.