The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description

2023-01-17 15:52:39

Yannis Tevissen (ARMEDIA-SAMOVAR), Jérôme Boudy (ARMEDIA-SAMOVAR), Frédéric Petitpont

arXiv_CL

arXiv_CL Segmentation Recognition Detection Embedding Activity

Abstract
Abstract (translated)
URL
PDF

Abstract

We describe the system used by our team for the VoxCeleb Speaker Recognition Challenge 2022 (VoxSRC 2022) in the speaker diarization track. Our solution was designed around a new combination of voice activity detection algorithms that uses the strengths of several systems. We introduce a novel multi stream approach with a decision protocol based on classifiers entropy. We called this method a multi-stream voice activity detection and used it with standard baseline diarization embeddings, clustering and resegmentation. With this work, we successfully demonstrated that using a strong baseline and working only on voice activity detection, one can achieved close to state-of-theart results.

Abstract (translated)

URL

https://arxiv.org/abs/2301.07491

PDF

https://arxiv.org/pdf/2301.07491.pdf

The Newsbridge -Telecom SudParis VoxCeleb Speaker Recognition Challenge 2022 System Description

Abstract

Abstract (translated)

URL

PDF Copy

PDF