Informative Pseudo-Labeling for Graph Neural Networks with Few Labels

01/20/2022
by   Yayong Li, et al.
0

Graph Neural Networks (GNNs) have achieved state-of-the-art results for semi-supervised node classification on graphs. Nevertheless, the challenge of how to effectively learn GNNs with very few labels is still under-explored. As one of the prevalent semi-supervised methods, pseudo-labeling has been proposed to explicitly address the label scarcity problem. It aims to augment the training set with pseudo-labeled unlabeled nodes with high confidence so as to re-train a supervised model in a self-training cycle. However, the existing pseudo-labeling approaches often suffer from two major drawbacks. First, they tend to conservatively expand the label set by selecting only high-confidence unlabeled nodes without assessing their informativeness. Unfortunately, those high-confidence nodes often convey overlapping information with given labels, leading to minor improvements for model re-training. Second, these methods incorporate pseudo-labels to the same loss function with genuine labels, ignoring their distinct contributions to the classification task. In this paper, we propose a novel informative pseudo-labeling framework, called InfoGNN, to facilitate learning of GNNs with extremely few labels. Our key idea is to pseudo label the most informative nodes that can maximally represent the local neighborhoods via mutual information maximization. To mitigate the potential label noise and class-imbalance problem arising from pseudo labeling, we also carefully devise a generalized cross entropy loss with a class-balanced regularization to incorporate generated pseudo labels into model re-training. Extensive experiments on six real-world graph datasets demonstrate that our proposed approach significantly outperforms state-of-the-art baselines and strong self-supervised methods on graphs.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2022

Label-Enhanced Graph Neural Network for Semi-supervised Node Classification

Graph Neural Networks (GNNs) have been widely applied in the semi-superv...
research
02/19/2023

Pseudo Contrastive Learning for Graph-based Semi-supervised Learning

Pseudo Labeling is a technique used to improve the performance of semi-s...
research
01/27/2022

Confidence May Cheat: Self-Training on Graph Neural Networks under Distribution Shift

Graph Convolutional Networks (GCNs) have recently attracted vast interes...
research
02/08/2019

Partial Label Learning with Self-Guided Retraining

Partial label learning deals with the problem where each training instan...
research
03/05/2021

Unified Robust Training for Graph NeuralNetworks against Label Noise

Graph neural networks (GNNs) have achieved state-of-the-art performance ...
research
02/16/2023

Self-supervised Guided Hypergraph Feature Propagation for Semi-supervised Classification with Missing Node Features

Graph neural networks (GNNs) with missing node features have recently re...
research
05/20/2023

Semi-Supervised Graph Imbalanced Regression

Data imbalance is easily found in annotated data when the observations o...

Please sign up or login with your details

Forgot password? Click here to reset