Model Compression for DNN-Based Text-Independent Speaker Verification Using Weight Quantization

2022-10-31 13:46:47

Jingyu Li, Zhaoyang Zhang, Jiong Wang, Tan Lee

arXiv_SD

Abstract
Abstract (translated)
URL
PDF

Abstract

DNN-based models achieve high performance in the speaker verification (SV) task with substantial computation costs. The model size is an essential concern in applying models on resource-constrained devices, while model compression for SV models has not been studied extensively in previous works. Weight quantization is exploited to compress DNN-based speaker embedding extraction models in this paper. Uniform and Powers-of-Two quantization are utilized in the experiments. The results on VoxCeleb show that the weight quantization can decrease the size of ECAPA-TDNN and ResNet by 4 times with insignificant performance decline. The quantized 4-bit ResNet achieves similar performance to the original model with an 8 times smaller size. We empirically show that the performance of ECAPA-TDNN is more sensitive than ResNet to quantization due to the difference in weight distribution. The experiments on CN-Celeb also demonstrate that quantized models are robust for SV in the language mismatch scenario.

Abstract (translated)

URL

https://arxiv.org/abs/2210.17326

PDF

https://arxiv.org/pdf/2210.17326.pdf

Model Compression for DNN-Based Text-Independent Speaker Verification Using Weight Quantization

Abstract

Abstract (translated)

URL

PDF Copy

PDF