CReST: A Class-Rebalancing Self-Training Framework for Imbalanced Semi-Supervised Learning

02/18/2021
by   Chen Wei, et al.
0

Semi-supervised learning on class-imbalanced data, although a realistic problem, has been under studied. While existing semi-supervised learning (SSL) methods are known to perform poorly on minority classes, we find that they still generate high precision pseudo-labels on minority classes. By exploiting this property, in this work, we propose Class-Rebalancing Self-Training (CReST), a simple yet effective framework to improve existing SSL methods on class-imbalanced data. CReST iteratively retrains a baseline SSL model with a labeled set expanded by adding pseudo-labeled samples from an unlabeled set, where pseudo-labeled samples from minority classes are selected more frequently according to an estimated class distribution. We also propose a progressive distribution alignment to adaptively adjust the rebalancing strength dubbed CReST+. We show that CReST and CReST+ improve state-of-the-art SSL algorithms on various class-imbalanced datasets and consistently outperform other popular rebalancing methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/14/2021

BiSTF: Bilateral-Branch Self-Training Framework for Semi-Supervised Large-scale Fine-Grained Recognition

Semi-supervised Fine-Grained Recognition is a challenge task due to the ...
research
12/22/2021

Barely-Supervised Learning: Semi-Supervised Learning with very few labeled images

This paper tackles the problem of semi-supervised learning when the set ...
research
05/26/2022

Transfer and Share: Semi-Supervised Learning from Long-Tailed Data

Long-Tailed Semi-Supervised Learning (LTSSL) aims to learn from class-im...
research
11/20/2022

An interpretable imbalanced semi-supervised deep learning framework for improving differential diagnosis of skin diseases

Dermatological diseases are among the most common disorders worldwide. T...
research
03/24/2019

Exploiting Synthetically Generated Data with Semi-Supervised Learning for Small and Imbalanced Datasets

Data augmentation is rapidly gaining attention in machine learning. Synt...
research
08/29/2023

Prototype Fission: Closing Set for Robust Open-set Semi-supervised Learning

Semi-supervised Learning (SSL) has been proven vulnerable to out-of-dist...
research
07/17/2018

Pseudo-Feature Generation for Imbalanced Data Analysis in Deep Learning

We generate pseudo-features by multivariate probability distributions ob...

Please sign up or login with your details

Forgot password? Click here to reset