Is wespeaker open source?

Yes — wenet-e2e/wespeaker is open source, released under the Apache-2.0 license.

What language is wespeaker written in?

wenet-e2e/wespeaker is primarily written in Python.

How popular is wespeaker?

wenet-e2e/wespeaker has 1.4k stars on GitHub.

Where can I find wespeaker?

wenet-e2e/wespeaker is on GitHub at https://github.com/wenet-e2e/wespeaker.

← all repositories

wenet-e2e/wespeaker

A Swiss Army knife for telling voices apart

WeSpeaker bundles speaker verification, recognition, and diarization into one PyTorch toolkit that actually ships to production.

★1.4k stars Python Domain Apps Computer Vision ML Frameworks

View on GitHub ↗

Not currently ranked — collecting fresh signals.

star history

What it does WeSpeaker learns speaker embeddings — compact fingerprints of a voice — and uses them for verification (is this the same person?), recognition, and diarization (who spoke when). It handles feature extraction online or slurps pre-extracted Kaldi-format features if you’re migrating from older speech pipelines.

The interesting bit The project straddles research and production without falling into the usual trap of being too academic to deploy or too brittle to experiment with. It bundles a CLI, a Python API, and a separate runtime with MNN inference support. The maintainers also track competitive benchmarks obsessively — VoxCeleb, CNCeleb, NIST SRE16 — and publish the recipes to reproduce them.

Key highlights

Pretrained models available for English and Chinese, including DAMO models like eres2net-cn-common-200k and campplus-cn-common-200k
Self-supervised learning support (WavLM frontend, DINO, w2v-bert2) for label-scarce scenarios
Diarization via UMAP + HDBSCAN clustering on the VoxConverse recipe
Kaldi-compatible PLDA with unsupervised adaptation for domain shift
Score calibration and large-margin fine-tuning baked into standard recipes

Caveats

The README is essentially a changelog with installation steps; architectural overview lives in external docs and papers
Runtime and training code are split across repos, so “production-ready” requires some assembly
WeChat QR codes for community support may not suit all users

Verdict Speech researchers and engineers building voice biometrics or meeting transcription pipelines should grab this. Casual tinkerers may find the Kaldi heritage and recipe-heavy workflow steeper than end-to-end alternatives.

Frequently asked

What is wenet-e2e/wespeaker?: WeSpeaker bundles speaker verification, recognition, and diarization into one PyTorch toolkit that actually ships to production.
Is wespeaker open source?: Yes — wenet-e2e/wespeaker is open source, released under the Apache-2.0 license.
What language is wespeaker written in?: wenet-e2e/wespeaker is primarily written in Python.
How popular is wespeaker?: wenet-e2e/wespeaker has 1.4k stars on GitHub.
Where can I find wespeaker?: wenet-e2e/wespeaker is on GitHub at https://github.com/wenet-e2e/wespeaker.