When does Privileged Information Explain Away Label Noise?

03/03/2023
by   Guillermo Ortiz-Jiménez, et al.
0

Leveraging privileged information (PI), or features available during training but not at test time, has recently been shown to be an effective method for addressing label noise. However, the reasons for its effectiveness are not well understood. In this study, we investigate the role played by different properties of the PI in explaining away label noise. Through experiments on multiple datasets with real PI (CIFAR-N/H) and a new large-scale benchmark ImageNet-PI, we find that PI is most helpful when it allows networks to easily distinguish clean from noisy data, while enabling a learning shortcut to memorize the noisy examples. Interestingly, when PI becomes too predictive of the target label, PI methods often perform worse than their no-PI baselines. Based on these findings, we propose several enhancements to the state-of-the-art PI methods and demonstrate the potential of PI as a means of tackling label noise. Finally, we show how we can easily combine the resulting PI approaches with existing no-PI techniques designed to deal with label noise.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/18/2022

Transfer and Marginalize: Explaining Away Label Noise with Privileged Information

Supervised learning datasets often have privileged information, in the f...
research
09/07/2021

Instance-dependent Label-noise Learning under a Structural Causal Model

Label noise will degenerate the performance of deep learning algorithms ...
research
12/01/2022

Noisy Label Classification using Label Noise Selection with Test-Time Augmentation Cross-Entropy and NoiseMix Learning

As the size of the dataset used in deep learning tasks increases, the no...
research
10/18/2022

CNT (Conditioning on Noisy Targets): A new Algorithm for Leveraging Top-Down Feedback

We propose a novel regularizer for supervised learning called Conditioni...
research
06/09/2014

Training Convolutional Networks with Noisy Labels

The availability of large labeled datasets has allowed Convolutional Net...
research
04/06/2023

Logistic-Normal Likelihoods for Heteroscedastic Label Noise in Classification

A natural way of estimating heteroscedastic label noise in regression is...
research
06/24/2023

Cross-Validation Is All You Need: A Statistical Approach To Label Noise Estimation

Label noise is prevalent in machine learning datasets. It is crucial to ...

Please sign up or login with your details

Forgot password? Click here to reset