noamgot

Please visit my LinkedIn page for more info about me: https://www.linkedin.com/in/noamgot/

1 follower · 1 following

in/noamgot

View GitHub Profile

Recently created

Least recently created

Recently updated

Least recently updated

2 files
0 forks
0 comments
0 stars

noamgot / README.md

Last active August 11, 2025 10:29

Generate Speaker Diarization Ground Truth

Speaker Diarization GT Generator

Generate ground-truth diarization data using Voice Activity Detection (VAD).

This script processes audio files using pyannote's VAD pipeline to create ground-truth diarization data in RTTM format. For each input audio file, it:

Applies Voice Activity Detection to identify speech segments
Labels the segments with the audio file's name
Saves individual RTTM files for each input
Creates a combined RTTM file mixing all inputs