Low-Regret Active learning

04/06/2021
by   Cenk Baykal, et al.
11

We develop an online learning algorithm for identifying unlabeled data points that are most informative for training (i.e., active learning). By formulating the active learning problem as the prediction with sleeping experts problem, we provide a framework for identifying informative data with respect to any given definition of informativeness. At the core of our work is an efficient algorithm for sleeping experts that is tailored to achieve low regret on predictable (easy) instances while remaining resilient to adversarial ones. This stands in contrast to state-of-the-art active learning methods that are overwhelmingly based on greedy selection, and hence cannot ensure good performance across varying problem instances. We present empirical results demonstrating that our method (i) instantiated with an informativeness measure consistently outperforms its greedy counterpart and (ii) reliably outperforms uniform sampling on real-world data sets and models.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/23/2021

One-Round Active Learning

Active learning has been a main solution for reducing data labeling cost...
research
04/16/2021

Data Shapley Valuation for Efficient Batch Active Learning

Annotating the right set of data amongst all available data points is a ...
research
02/27/2017

Diameter-Based Active Learning

To date, the tightest upper and lower-bounds for the active learning of ...
research
06/20/2012

On Discarding, Caching, and Recalling Samples in Active Learning

We address challenges of active learning under scarce informational reso...
research
12/04/2019

Active Learning of SVDD Hyperparameter Values

Support Vector Data Description is a popular method for outlier detectio...
research
06/23/2017

A Variance Maximization Criterion for Active Learning

Active learning aims to train a classifier as fast as possible with as f...
research
10/14/2020

Identifying Wrongly Predicted Samples: A Method for Active Learning

State-of-the-art machine learning models require access to significant a...

Please sign up or login with your details

Forgot password? Click here to reset