AutoNLU: Detecting, root-causing, and fixing NLU model errors

2021-10-12Unverified0· sign in to hype

Pooja Sethi, Denis Savenkov, Forough Arabshahi, Jack Goetz, Micaela Tolliver, Nicolas Scheffer, Ilknur Kabul, Yue Liu, Ahmed Aly

arXiv PDF

Unverified — Be the first to reproduce this paper.

Reproduce

Abstract

Improving the quality of Natural Language Understanding (NLU) models, and more specifically, task-oriented semantic parsing models, in production is a cumbersome task. In this work, we present a system called AutoNLU, which we designed to scale the NLU quality improvement process. It adds automation to three key steps: detection, attribution, and correction of model errors, i.e., bugs. We detected four times more failed tasks than with random sampling, finding that even a simple active learning sampling method on an uncalibrated model is surprisingly effective for this purpose. The AutoNLU tool empowered linguists to fix ten times more semantic parsing bugs than with prior manual processes, auto-correcting 65% of all identified bugs.

Tasks

Active Learning Natural Language Understanding Semantic Parsing

AutoNLU: Detecting, root-causing, and fixing NLU model errors

Abstract

Tasks

Reproductions