Zheng Rui ZhengRui

How NeMo Diarizer Works

This article details NeMo’s diarization pipelines—ClusteringDiarizer and NeuralDiarizer (MSDD)—covering the algorithmic flow, scale definitions, fusion strategies, speaker counting, long‑form handling, and the exposed configuration surfaces, with practical examples.

Two pipelines, one foundation

ClusteringDiarizer (unsupervised):
- VAD → cut speech into multi-scale windows ("scales") → extract speaker embeddings → fuse information across scales → estimate how many speakers → spectral clustering → RTTM.
- Output has one dominant speaker at each time (no learned overlap model).
NeuralDiarizer (MSDD) (learned overlap model on top):
Reuses ClusteringDiarizer’s products (speakers discovered, multiscale embeddings) and predicts, at each time step, which of the discovered speakers are active. Multiple can be active → overlap-aware RTTM.

	countloc() {
	local exclude_ext="webp\|ttf\|json\|png\|lock\|lockb\|svg\|jpg\|jpeg\|gif\|ico\|pdf\|zip\|tar\|gz\|mp3\|mp4\|woff\|woff2\|eot"
	local exclude_dirs=".venv\|node_modules\|.git\|dist\|build\|coverage\|__pycache__\|.next\|.cache"
	local force=0
	local breakdown=0
	local help=0
	local languages=()
	local selected_extensions=()
	local dir="." # Default directory is current directory

	# This file has been auto-generated by i3-config-wizard(1).
	# It will not be overwritten, so edit it as you like.
	#
	# Should you change your keyboard layout somewhen, delete
	# this file and re-run i3-config-wizard(1).
	#

	# i3 config file (v4)
	#
	# Please see http://i3wm.org/docs/userguide.html for a complete reference!

	" add for vundle
	set nocompatible
	filetype off

	" set the runtime path to include Vundle and initialize
	set rtp+=~/.vim/bundle/Vundle.vim
	call vundle#begin()

	Plugin 'gmarik/Vundle.vim'
	Plugin 'Valloric/YouCompleteMe'

	from PyQt4 import QtGui, QtCore
	import sys

	class QStrokeRect(QtGui.QGraphicsRectItem):
	def __init__(self, parent=None):
	super(QStrokeRect, self).__init__(parent)
	self.strokeWidth = 4
	self.setPen(QtGui.QPen(QtGui.QColor(255, 0, 0), 4, QtCore.Qt.SolidLine))
	self.setFlags(QtGui.QGraphicsItem.ItemIsSelectable)

	#! /usr/bin/env python

	import matplotlib
	import matplotlib.pyplot as plt
	matplotlib.style.use("ggplot")

	import numpy as np
	import time

	def newfig(l):

	{
	"enabled_plugins": [
	"SimpleReloadPlugin",
	"SimpleRefresh"
	]
	}