Online Semi-Supervised Learning with Bandit Feedback

10/23/2020
by   Sohini Upadhyay, et al.
5

We formulate a new problem at the intersectionof semi-supervised learning and contextual bandits,motivated by several applications including clini-cal trials and ad recommendations. We demonstratehow Graph Convolutional Network (GCN), a semi-supervised learning approach, can be adjusted tothe new problem formulation. We also propose avariant of the linear contextual bandit with semi-supervised missing rewards imputation. We thentake the best of both approaches to develop multi-GCN embedded contextual bandit. Our algorithmsare verified on several real world datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/17/2020

Online Semi-Supervised Learning in Contextual Bandits with Episodic Reward

We considered a novel practical problem of online learning with episodic...
research
09/26/2018

Graph Laplacian Regularized Graph Convolutional Networks for Semi-supervised Learning

Recently, graph convolutional network (GCN) has been widely used for sem...
research
02/24/2018

N-GCN: Multi-scale Graph Convolution for Semi-supervised Node Classification

Graph Convolutional Networks (GCNs) have shown significant improvements ...
research
04/27/2021

Semi-Supervised Joint Estimation of Word and Document Readability

Readability or difficulty estimation of words and documents has been inv...
research
05/28/2021

Detecting the hosts of bacteriophages using GCN-based semi-supervised learning

Motivation: Bacteriophages (aka phages) are viruses that infect bacteria...
research
06/08/2020

Speaker Diarization as a Fully Online Learning Problem in MiniVox

We proposed a novel AI framework to conduct real-time multi-speaker diar...
research
03/16/2021

Graph Convolutional Network for Swahili News Classification

This work empirically demonstrates the ability of Text Graph Convolution...

Please sign up or login with your details

Forgot password? Click here to reset