← all repositories

wq2012/awesome-diarization

A curated list of papers, libraries, datasets, and tools for speaker diarization using deep learning and machine learning techniques.

1.9k stars Learning
awesome-diarization
Velocity · 7d
+0.7
★ / day
Trend
steady
star history

This repository organizes the world’s resources for speaker diarization, a speech processing task that identifies who spoke when in audio recordings. The curated list covers publications including review and survey papers on deep learning approaches, software frameworks for clustering and speaker embedding, evaluation tools, datasets for training and augmentation, and other learning materials such as courses, books, and tutorials. It serves as a centralized reference for practitioners and researchers in the speech and audio processing community.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.