Active Learning for New Domains in Natural Language Understanding

2018-10-03NAACL 2019Unverified0· sign in to hype

Stanislav Peshterliev, John Kearney, Abhyuday Jagannatha, Imre Kiss, Spyros Matsoukas

Unverified — Be the first to reproduce this paper.

Abstract

We explore active learning (AL) for improving the accuracy of new domains in a natural language understanding (NLU) system. We propose an algorithm called Majority-CRF that uses an ensemble of classification models to guide the selection of relevant utterances, as well as a sequence labeling model to help prioritize informative examples. Experiments with three domains show that Majority-CRF achieves 6.6%-9% relative error rate reduction compared to random sampling with the same annotation budget, and statistically significant improvements compared to other AL approaches. Additionally, case studies with human-in-the-loop AL on six new domains show 4.6%-9% improvement on an existing NLU system.

Tasks

Active Learning General Classification Natural Language Understanding

Active Learning for New Domains in Natural Language Understanding

Abstract

Tasks

Reproductions