← all repositories

yinruiqing/pyannote-whisper

A Python tool that runs OpenAI Whisper for speech-to-text and pyannote.audio for speaker diarization, outputting transcripts with speaker labels and timestamps.

pyannote-whisper
Velocity · 7d
+0.5
★ / day
Trend
steady
star history

The project integrates Whisper, an automatic speech recognition model, with pyannote.audio, a speaker diarization pipeline, into a unified transcription workflow. Users can run it via command line or import it as a Python module. The output pairs transcribed sentences with identified speakers and their corresponding time segments, making it suitable for meeting summarization and multi-speaker transcription tasks.

heatdrop uses Google Analytics to see which pages get read — nothing else. Your call. How we handle data.