Online Speaker Diarization with Graph-based Label Generation

2021-11-27 03:34:34

Yucong Zhang, Qinjian Lin, Weiqing Wang, Lin Yang, Xuyang Wang, Junjie Wang, Ming Li

arXiv_SD

Abstract
Abstract (translated)
URL
PDF

Abstract

This paper introduces an online speaker diarization system that can handle long-time audio with low latency. First, a new variant of agglomerative hierarchy clustering is built to cluster the speakers in an online fashion. Then, a speaker embedding graph is proposed. We use this graph to exploit a graph-based reclustering method to further improve the performance. Finally, a label matching algorithm is introduced to generate consistent speaker labels, and we evaluate our system on both DIHARD3 and VoxConverse datasets, which contain long audios with various kinds of scenarios. The experimental results show that our online diarization system outperforms the baseline offline system and has comparable performance to our offline system.

Abstract (translated)

URL

https://arxiv.org/abs/2111.13803

PDF

https://arxiv.org/pdf/2111.13803.pdf