yinruiqing/pyannote-whisper
A Python tool that runs OpenAI Whisper for speech-to-text and pyannote.audio for speaker diarization, outputting transcripts with speaker labels and timestamps.

The project integrates Whisper, an automatic speech recognition model, with pyannote.audio, a speaker diarization pipeline, into a unified transcription workflow. Users can run it via command line or import it as a Python module. The output pairs transcribed sentences with identified speakers and their corresponding time segments, making it suitable for meeting summarization and multi-speaker transcription tasks.