Controllable Multichannel Speech Dereverberation based on Deep Neural Networks

2021-10-16 01:41:25

Ziteng Wang, Yueyue Na, Biao Tian, Qiang Fu

arXiv_SD

arXiv_SD Pose Speech

Abstract
Abstract (translated)
URL
PDF

Abstract

Neural network based speech dereverberation has achieved promising results in recent studies. Nevertheless, many are focused on recovery of only the direct path sound and early reflections, which could be beneficial to speech perception, are discarded. The performance of a model trained to recover clean speech degrades when evaluated on early reverberation targets, and vice versa. This paper proposes a novel deep neural network based multichannel speech dereverberation algorithm, in which the dereverberation level is controllable. This is realized by adding a simple floating-point number as target controller of the model. Experiments are conducted using spatially distributed microphones, and the efficacy of the proposed algorithm is confirmed in various simulated conditions.

Abstract (translated)

URL

https://arxiv.org/abs/2110.08439

PDF

https://arxiv.org/pdf/2110.08439.pdf