AutoNLU: Detecting, root-causing, and fixing NLU model errors

2021-10-12 22:12:26

Pooja Sethi, Denis Savenkov, Forough Arabshahi, Jack Goetz, Micaela Tolliver, Nicolas Scheffer, Ilknur Kabul, Yue Liu, Ahmed Aly

arXiv_CL

arXiv_CL Detection Pose

Abstract
Abstract (translated)
URL
PDF

Abstract

Improving the quality of Natural Language Understanding (NLU) models, and more specifically, task-oriented semantic parsing models, in production is a cumbersome task. In this work, we present a system called AutoNLU, which we designed to scale the NLU quality improvement process. It adds automation to three key steps: detection, attribution, and correction of model errors, i.e., bugs. We detected four times more failed tasks than with random sampling, finding that even a simple active learning sampling method on an uncalibrated model is surprisingly effective for this purpose. The AutoNLU tool empowered linguists to fix ten times more semantic parsing bugs than with prior manual processes, auto-correcting 65% of all identified bugs.

Abstract (translated)

URL

https://arxiv.org/abs/2110.06384

PDF

https://arxiv.org/pdf/2110.06384.pdf