Active Learning Helps Pretrained Models Learn the Intended Task

04/18/2022
by   Alex Tamkin, et al.
0

Models can fail in unpredictable ways during deployment due to task ambiguity, when multiple behaviors are consistent with the provided training data. An example is an object classifier trained on red squares and blue circles: when encountering blue squares, the intended behavior is undefined. We investigate whether pretrained models are better active learners, capable of disambiguating between the possible tasks a user may be trying to specify. Intriguingly, we find that better active learning is an emergent property of the pretraining process: pretrained models require up to 5 times fewer labels when using uncertainty-based active learning, while non-pretrained models see no or even negative benefit. We find these gains come from an ability to select examples with attributes that disambiguate the intended behavior, such as rare product categories or atypical backgrounds. These attributes are far more linearly separable in pretrained model's representation spaces vs non-pretrained models, suggesting a possible mechanism for this behavior.

READ FULL TEXT

page 2

page 5

page 6

page 7

page 14

page 15

page 16

research
08/29/2018

Learning a Policy for Opportunistic Active Learning

Active learning identifies data points to label that are expected to be ...
research
10/31/2022

Active Learning of Non-semantic Speech Tasks with Pretrained Models

Pretraining neural networks with massive unlabeled datasets has become p...
research
03/11/2022

Can I see an Example? Active Learning the Long Tail of Attributes and Relations

There has been significant progress in creating machine learning models ...
research
04/05/2022

An Exploration of Active Learning for Affective Digital Phenotyping

Some of the most severe bottlenecks preventing widespread development of...
research
09/22/2020

Model-Centric and Data-Centric Aspects of Active Learning for Neural Network Models

We study different data-centric and model-centric aspects of active lear...
research
09/20/2023

Large-scale Pretraining Improves Sample Efficiency of Active Learning based Molecule Virtual Screening

Virtual screening of large compound libraries to identify potential hit ...
research
10/06/2020

Are "Undocumented Workers" the Same as "Illegal Aliens"? Disentangling Denotation and Connotation in Vector Spaces

In politics, neologisms are frequently invented for partisan objectives....

Please sign up or login with your details

Forgot password? Click here to reset