Open-Sampling: Exploring Out-of-Distribution data for Re-balancing Long-tailed datasets

06/17/2022
by   Hongxin Wei, et al.
0

Deep neural networks usually perform poorly when the training dataset suffers from extreme class imbalance. Recent studies found that directly training with out-of-distribution data (i.e., open-set samples) in a semi-supervised manner would harm the generalization performance. In this work, we theoretically show that out-of-distribution data can still be leveraged to augment the minority classes from a Bayesian perspective. Based on this motivation, we propose a novel method called Open-sampling, which utilizes open-set noisy labels to re-balance the class priors of the training dataset. For each open-set instance, the label is sampled from our pre-defined distribution that is complementary to the distribution of original class priors. We empirically show that Open-sampling not only re-balances the class priors but also encourages the neural network to learn separable representations. Extensive experiments demonstrate that our proposed method significantly outperforms existing data re-balancing methods and can boost the performance of existing state-of-the-art methods.

READ FULL TEXT

page 7

page 8

research
09/22/2020

Gamma distribution-based sampling for imbalanced data

Imbalanced class distribution is a common problem in a number of fields ...
research
11/20/2022

Learning from Long-Tailed Noisy Data with Sample Selection and Balanced Loss

The success of deep learning depends on large-scale and well-curated tra...
research
06/19/2022

Out-of-distribution Detection by Cross-class Vicinity Distribution of In-distribution Data

Deep neural networks only learn to map in-distribution inputs to their c...
research
01/25/2023

Integrating Local Real Data with Global Gradient Prototypes for Classifier Re-Balancing in Federated Long-Tailed Learning

Federated Learning (FL) has become a popular distributed learning paradi...
research
07/10/2022

One-shot Neural Backdoor Erasing via Adversarial Weight Masking

Recent studies show that despite achieving high accuracy on a number of ...
research
12/05/2019

BBN: Bilateral-Branch Network with Cumulative Learning for Long-Tailed Visual Recognition

Our work focuses on tackling the challenging but natural visual recognit...
research
03/01/2022

Addressing Randomness in Evaluation Protocols for Out-of-Distribution Detection

Deep Neural Networks for classification behave unpredictably when confro...

Please sign up or login with your details

Forgot password? Click here to reset