ALdataset: a benchmark for pool-based active learning

10/16/2020
by   Xueying Zhan, et al.
0

Active learning (AL) is a subfield of machine learning (ML) in which a learning algorithm could achieve good accuracy with less training samples by interactively querying a user/oracle to label new data points. Pool-based AL is well-motivated in many ML tasks, where unlabeled data is abundant, but their labels are hard to obtain. Although many pool-based AL methods have been developed, the lack of a comparative benchmarking and integration of techniques makes it difficult to: 1) determine the current state-of-the-art technique; 2) evaluate the relative benefit of new methods for various properties of the dataset; 3) understand what specific problems merit greater attention; and 4) measure the progress of the field over time. To conduct easier comparative evaluation among AL methods, we present a benchmark task for pool-based active learning, which consists of benchmarking datasets and quantitative metrics that summarize overall performance. We present experiment results for various active learning strategies, both recently proposed and classic highly-cited methods, and draw insights from the results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/04/2022

Pareto Optimization for Active Learning under Out-of-Distribution Data Scenarios

Pool-based Active Learning (AL) has achieved great success in minimizing...
research
02/22/2021

Interpret-able feedback for AutoML systems

Automated machine learning (AutoML) systems aim to enable training machi...
research
05/23/2022

PyRelationAL: A Library for Active Learning Research and Development

In constrained real-world scenarios where it is challenging or costly to...
research
09/11/2023

Learning Objective-Specific Active Learning Strategies with Attentive Neural Processes

Pool-based active learning (AL) is a promising technology for increasing...
research
06/08/2021

A critical look at the current train/test split in machine learning

The randomized or cross-validated split of training and testing sets has...
research
07/08/2023

Active Learning in Physics: From 101, to Progress, and Perspective

Active Learning (AL) is a family of machine learning (ML) algorithms tha...
research
09/11/2023

Stream-based Active Learning by Exploiting Temporal Properties in Perception with Temporal Predicted Loss

Active learning (AL) reduces the amount of labeled data needed to train ...

Please sign up or login with your details

Forgot password? Click here to reset