A Competitive Algorithm for Agnostic Active Learning

2023-10-28NeurIPS 2023Unverified0· sign in to hype

Eric Price, Yihan Zhou

Unverified — Be the first to reproduce this paper.

Abstract

For some hypothesis classes and input distributions, active agnostic learning needs exponentially fewer samples than passive learning; for other classes and distributions, it offers little to no improvement. The most popular algorithms for agnostic active learning express their performance in terms of a parameter called the disagreement coefficient, but it is known that these algorithms are inefficient on some inputs. We take a different approach to agnostic active learning, getting an algorithm that is competitive with the optimal algorithm for any binary hypothesis class H and distribution D_X over X. In particular, if any algorithm can use m^* queries to get O() error, then our algorithm uses O(m^* |H|) queries to get O() error. Our algorithm lies in the vein of the splitting-based approach of Dasgupta [2004], which gets a similar result for the realizable ( = 0) setting. We also show that it is NP-hard to do better than our algorithm's O( |H|) overhead in general.

Tasks

Active Learning

A Competitive Algorithm for Agnostic Active Learning

Abstract

Tasks

Reproductions