On the Robustness of Active Learning

06/18/2020
by   Lukas Hahn, et al.
0

Active Learning is concerned with the question of how to identify the most useful samples for a Machine Learning algorithm to be trained with. When applied correctly, it can be a very powerful tool to counteract the immense data requirements of Artificial Neural Networks. However, we find that it is often applied with not enough care and domain knowledge. As a consequence, unrealistic hopes are raised and transfer of the experimental results from one dataset to another becomes unnecessarily hard. In this work we analyse the robustness of different Active Learning methods with respect to classifier capacity, exchangeability and type, as well as hyperparameters and falsely labelled data. Experiments reveal possible biases towards the architecture used for sample selection, resulting in suboptimal performance for other classifiers. We further propose the new "Sum of Squared Logits" method based on the Simpson diversity index and investigate the effect of using the confusion matrix for balancing in sample selection.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/20/2021

Influence Selection for Active Learning

The existing active learning methods select the samples by evaluating th...
research
11/10/2009

Active Learning for Mention Detection: A Comparison of Sentence Selection Strategies

We propose and compare various sentence selection strategies for active ...
research
10/26/2020

Inspecting Sample Reusability for Active Learning

Active Learning (AL) exploits a learning algorithm to selectively sample...
research
12/21/2021

Practical Active Learning with Model Selection for Small Data

Active learning is of great interest for many practical applications, es...
research
06/13/2022

On the reusability of samples in active learning

An interesting but not extensively studied question in active learning i...
research
11/19/2020

Finding the Homology of Decision Boundaries with Active Learning

Accurately and efficiently characterizing the decision boundary of class...
research
06/14/2023

Towards Balanced Active Learning for Multimodal Classification

Training multimodal networks requires a vast amount of data due to their...

Please sign up or login with your details

Forgot password? Click here to reset