Closing the Loop: Fast, Interactive Semi-Supervised Annotation With Queries on Features and Instances
2011-07-01Proceedings of the 2011 Conference on Empirical Methods in Natural Language Processing 2011Code Available1· sign in to hype
Burr Settles
Code Available — Be the first to reproduce this paper.
ReproduceCode
- github.com/burrsettles/dualistnone★ 86
Abstract
This paper describes DUALIST, an active learning annotation paradigm which solicits and learns from labels on both features (e.g., words) and instances (e.g., documents). We present a novel semi-supervised training algorithm developed for this setting, which is (1) fast enough to support real-time interactive speeds, and (2) at least as accurate as preexisting methods for learning with mixed feature and instance labels. Human annotators in user studies were able to produce near-state-of-the-art classifiers—on several corpora in a variety of application domains—with only a few minutes of effort.