DeepAI AI Chat
Log In Sign Up

Conformal Credal Self-Supervised Learning

by   Julian Lienen, et al.
Universität München
Universität Paderborn

In semi-supervised learning, the paradigm of self-training refers to the idea of learning from pseudo-labels suggested by the learner itself. Across various domains, corresponding methods have proven effective and achieve state-of-the-art performance. However, pseudo-labels typically stem from ad-hoc heuristics, relying on the quality of the predictions though without guaranteeing their validity. One such method, so-called credal self-supervised learning, maintains pseudo-supervision in the form of sets of (instead of single) probability distributions over labels, thereby allowing for a flexible yet uncertainty-aware labeling. Again, however, there is no justification beyond empirical effectiveness. To address this deficiency, we make use of conformal prediction, an approach that comes with guarantees on the validity of set-valued predictions. As a result, the construction of credal sets of labels is supported by a rigorous theoretical foundation, leading to better calibrated and less error-prone supervision for unlabeled data. Along with this, we present effective algorithms for learning from credal self-supervision. An empirical study demonstrates excellent calibration properties of the pseudo-supervision, as well as the competitiveness of our method on several benchmark datasets.


page 1

page 2

page 3

page 4


Credal Self-Supervised Learning

Self-training is an effective approach to semi-supervised learning. The ...

Revisiting Self-Training with Regularized Pseudo-Labeling for Tabular Data

Recent progress in semi- and self-supervised learning has caused a rift ...

Self Meta Pseudo Labels: Meta Pseudo Labels Without The Teacher

We present Self Meta Pseudo Labels, a novel semi-supervised learning met...

Repetitive Reprediction Deep Decipher for Semi-Supervised Learning

Most recent semi-supervised deep learning (deep SSL) methods used a simi...

A Benchmark Generative Probabilistic Model for Weak Supervised Learning

Finding relevant and high-quality datasets to train machine learning mod...

Conformal Prediction with Partially Labeled Data

While the predictions produced by conformal prediction are set-valued, t...

Statistical and Algorithmic Insights for Semi-supervised Learning with Self-training

Self-training is a classical approach in semi-supervised learning which ...