Domain-aware Neural Language Models for Speech Recognition

2021-01-05 00:08:32

Linda Liu, Yile Gu, Aditya Gourav, Ankur Gandhe, Shashank Kalmane, Denis Filimonov, Ariya Rastrow, Ivan Bulyko

arXiv_CL

arXiv_CL Speech_Recognition RNN Recognition Classification Language_Model Speech

Abstract
Abstract (translated)
URL
PDF

Abstract

As voice assistants become more ubiquitous, they are increasingly expected to support and perform well on a wide variety of use-cases across different domains. We present a domain-aware rescoring framework suitable for achieving domain-adaptation during second-pass rescoring in production settings. In our framework, we fine-tune a domain-general neural language model on several domains, and use an LSTM-based domain classification model to select the appropriate domain-adapted model to use for second-pass rescoring. This domain-aware rescoring improves the word error rate by up to 2.4% and slot word error rate by up to 4.1% on three individual domains -- shopping, navigation, and music -- compared to domain general rescoring. These improvements are obtained while maintaining accuracy for the general use case.

Abstract (translated)

URL

https://arxiv.org/abs/2101.03229

PDF

https://arxiv.org/pdf/2101.03229.pdf