Don't fear the unlabelled: safe deep semi-supervised learning via simple debiasing

03/14/2022
by   Hugo Schmutz, et al.
0

Semi supervised learning (SSL) provides an effective means of leveraging unlabelled data to improve a model's performance. Even though the domain has received a considerable amount of attention in the past years, most methods present the common drawback of being unsafe. By safeness we mean the quality of not degrading a fully supervised model when including unlabelled data. Our starting point is to notice that the estimate of the risk that most discriminative SSL methods minimise is biased, even asymptotically. This bias makes these techniques untrustable without a proper validation set, but we propose a simple way of removing the bias. Our debiasing approach is straightforward to implement, and applicable to most deep SSL methods. We provide simple theoretical guarantees on the safeness of these modified methods, without having to rely on the strong assumptions on the data distribution that SSL theory usually requires. We evaluate debiased versions of different existing SSL methods and show that debiasing can compete with classic deep SSL techniques in various classic settings and even performs well when traditional SSL fails.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/10/2017

Safe Semi-Supervised Learning of Sum-Product Networks

In several domains obtaining class annotations is expensive while at the...
research
02/28/2021

A Survey on Deep Semi-supervised Learning

Deep semi-supervised learning is a fast-growing field with a range of pr...
research
06/08/2021

LaplaceNet: A Hybrid Energy-Neural Model for Deep Semi-Supervised Classification

Semi-supervised learning has received a lot of recent attention as it al...
research
05/19/2022

A Topological Approach for Semi-Supervised Learning

Nowadays, Machine Learning and Deep Learning methods have become the sta...
research
10/30/2020

Improving Dialogue Breakdown Detection with Semi-Supervised Learning

Building user trust in dialogue agents requires smooth and consistent di...
research
07/16/2021

Semi-supervised Learning for Marked Temporal Point Processes

Temporal Point Processes (TPPs) are often used to represent the sequence...

Please sign up or login with your details

Forgot password? Click here to reset