Task | Model version | Comments |
---|---|---|
Voice Activity Detection | Multilingual Marblenet | Other versions exist trained on telephonic conversation or only on english data |
Speaker Embeddings | Titanet Large | Smaller version of the model exists. |
Multiscale Clustering | Diarization MSDD Telephonic | Specifically trained on telephonic conversations which makes it suitable for similar use cases. |
Created
July 12, 2023 13:15
-
-
Save ljnmedium/25e1b7435084e90d679394f9d7d60dd1 to your computer and use it in GitHub Desktop.
pipeline.md
Sign up for free
to join this conversation on GitHub.
Already have an account?
Sign in to comment