Learning with Inadequate and Incorrect Supervision

02/20/2019
by   Chen gong, et al.
0

Practically, we are often in the dilemma that the labeled data at hand are inadequate to train a reliable classifier, and more seriously, some of these labeled data may be mistakenly labeled due to the various human factors. Therefore, this paper proposes a novel semi-supervised learning paradigm that can handle both label insufficiency and label inaccuracy. To address label insufficiency, we use a graph to bridge the data points so that the label information can be propagated from the scarce labeled examples to unlabeled examples along the graph edges. To address label inaccuracy, Graph Trend Filtering (GTF) and Smooth Eigenbase Pursuit (SEP) are adopted to filter out the initial noisy labels. GTF penalizes the l_0 norm of label difference between connected examples in the graph and exhibits better local adaptivity than the traditional l_2 norm-based Laplacian smoother. SEP reconstructs the correct labels by emphasizing the leading eigenvectors of Laplacian matrix associated with small eigenvalues, as these eigenvectors reflect real label smoothness and carry rich class separation cues. We term our algorithm as `Semi-supervised learning under Inadequate and Incorrect Supervision' (SIIS). Thorough experimental results on image classification, text categorization, and speech recognition demonstrate that our SIIS is effective in label error correction, leading to superior performance to the state-of-the-art methods in the presence of label noise and label scarcity.

READ FULL TEXT
research
01/12/2023

Graph Laplacian for Semi-Supervised Learning

Semi-supervised learning is highly useful in common scenarios where labe...
research
06/04/2020

Rates of Convergence for Laplacian Semi-Supervised Learning with Low Labeling Rates

We study graph-based Laplacian semi-supervised learning at low labeling ...
research
09/27/2020

Analysis of label noise in graph-based semi-supervised learning

In machine learning, one must acquire labels to help supervise a model t...
research
09/14/2020

One-bit Supervision for Image Classification

This paper presents one-bit supervision, a novel setting of learning fro...
research
01/03/2022

An analysis of over-sampling labeled data in semi-supervised learning with FixMatch

Most semi-supervised learning methods over-sample labeled data when cons...
research
10/13/2020

Making Every Label Count: Handling Semantic Imprecision by Integrating Domain Knowledge

Noisy data, crawled from the web or supplied by volunteers such as Mecha...
research
03/17/2020

The Value of Nullspace Tuning Using Partial Label Information

In semi-supervised learning, information from unlabeled examples is used...

Please sign up or login with your details

Forgot password? Click here to reset