Deep Active Learning for Biased Datasets via Fisher Kernel Self-Supervision

03/01/2020
by   Denis Gudovskiy, et al.
0

Active learning (AL) aims to minimize labeling efforts for data-demanding deep neural networks (DNNs) by selecting the most representative data points for annotation. However, currently used methods are ill-equipped to deal with biased data. The main motivation of this paper is to consider a realistic setting for pool-based semi-supervised AL, where the unlabeled collection of train data is biased. We theoretically derive an optimal acquisition function for AL in this setting. It can be formulated as distribution shift minimization between unlabeled train data and weakly-labeled validation dataset. To implement such acquisition function, we propose a low-complexity method for feature density matching using self-supervised Fisher kernel (FK) as well as several novel pseudo-label estimators. Our FK-based method outperforms state-of-the-art methods on MNIST, SVHN, and ImageNet classification while requiring only 1/10th of processing. The conducted experiments show at least 40 existing methods.

READ FULL TEXT
research
10/19/2020

Semi-supervised Batch Active Learning via Bilevel Optimization

Active learning is an effective technique for reducing the labeling cost...
research
06/22/2021

Active Learning under Pool Set Distribution Shift and Noisy Data

Active Learning is essential for more label-efficient deep learning. Bay...
research
11/16/2020

On the Marginal Benefit of Active Learning: Does Self-Supervision Eat Its Cake?

Active learning is the set of techniques for intelligently labeling larg...
research
01/19/2022

Using Self-Supervised Pretext Tasks for Active Learning

Labeling a large set of data is expensive. Active learning aims to tackl...
research
01/25/2023

Toward Realistic Evaluation of Deep Active Learning Algorithms in Image Classification

Active Learning (AL) aims to reduce the labeling burden by interactively...
research
11/30/2020

On Initial Pools for Deep Active Learning

Active Learning (AL) techniques aim to minimize the training data requir...
research
06/20/2022

Actively Learning Deep Neural Networks with Uncertainty Sampling Based on Sum-Product Networks

Active learning is popular approach for reducing the amount of data in t...

Please sign up or login with your details

Forgot password? Click here to reset