Skip to content

Instantly share code, notes, and snippets.

View noamgot's full-sized avatar

noamgot

View GitHub Profile
@noamgot
noamgot / README.md
Last active August 11, 2025 10:29
Generate Speaker Diarization Ground Truth

Speaker Diarization GT Generator

Generate ground-truth diarization data using Voice Activity Detection (VAD).

This script processes audio files using pyannote's VAD pipeline to create ground-truth diarization data in RTTM format. For each input audio file, it:

  1. Applies Voice Activity Detection to identify speech segments
  2. Labels the segments with the audio file's name
  3. Saves individual RTTM files for each input
  4. Creates a combined RTTM file mixing all inputs