Minority Class Oriented Active Learning for Imbalanced Datasets

02/01/2022
by   Umang Aggarwal, et al.
1

Active learning aims to optimize the dataset annotation process when resources are constrained. Most existing methods are designed for balanced datasets. Their practical applicability is limited by the fact that a majority of real-life datasets are actually imbalanced. Here, we introduce a new active learning method which is designed for imbalanced datasets. It favors samples likely to be in minority classes so as to reduce the imbalance of the labeled subset and create a better representation for these classes. We also compare two training schemes for active learning: (1) the one commonly deployed in deep active learning using model fine tuning for each iteration and (2) a scheme which is inspired by transfer learning and exploits generic pre-trained models and train shallow classifiers for each iteration. Evaluation is run with three imbalanced datasets. Results show that the proposed active learning method outperforms competitive baselines. Equally interesting, they also indicate that the transfer learning training scheme outperforms model fine tuning if features are transferable from the generic dataset to the unlabeled one. This last result is surprising and should encourage the community to explore the design of deep active learning methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/09/2021

Class-Balanced Active Learning for Image Classification

Active learning aims to reduce the labeling effort that is required to t...
research
01/18/2022

Optimizing Active Learning for Low Annotation Budgets

When we can not assume a large amount of annotated data , active learnin...
research
02/03/2018

AFT*: Integrating Active Learning and Transfer Learning to Reduce Annotation Efforts

The splendid success of convolutional neural networks (CNNs) in computer...
research
08/25/2020

Active Class Incremental Learning for Imbalanced Datasets

Incremental Learning (IL) allows AI systems to adapt to streamed data. M...
research
10/10/2020

On the Importance of Adaptive Data Collection for Extremely Imbalanced Pairwise Tasks

Many pairwise classification tasks, such as paraphrase detection and ope...
research
08/14/2018

An Overview and a Benchmark of Active Learning for One-Class Classification

Active learning stands for methods which increase classification quality...
research
01/26/2021

Adversarial Vulnerability of Active Transfer Learning

Two widely used techniques for training supervised machine learning mode...

Please sign up or login with your details

Forgot password? Click here to reset