Two-Stage Neural Contextual Bandits for Personalised News Recommendation

06/26/2022
by   Mengyan Zhang, et al.
1

We consider the problem of personalised news recommendation where each user consumes news in a sequential fashion. Existing personalised news recommendation methods focus on exploiting user interests and ignores exploration in recommendation, which leads to biased feedback loops and hurt recommendation quality in the long term. We build on contextual bandits recommendation strategies which naturally address the exploitation-exploration trade-off. The main challenges are the computational efficiency for exploring the large-scale item space and utilising the deep representations with uncertainty. We propose a two-stage hierarchical topic-news deep contextual bandits framework to efficiently learn user preferences when there are many news items. We use deep learning representations for users and news, and generalise the neural upper confidence bound (UCB) policies to generalised additive UCB and bilinear UCB. Empirical results on a large-scale news recommendation dataset show that our proposed policies are efficient and outperform the baseline bandit policies.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/31/2020

Graph Enhanced Representation Learning for News Recommendation

With the explosion of online news, personalized news recommendation beco...
research
09/26/2021

Deep Exploration for Recommendation Systems

We investigate the design of recommendation systems that can efficiently...
research
06/27/2012

Hierarchical Exploration for Accelerating Contextual Bandits

Contextual bandit learning is an increasingly popular approach to optimi...
research
06/25/2021

Knowledge Infused Policy Gradients with Upper Confidence Bound for Relational Bandits

Contextual Bandits find important use cases in various real-life scenari...
research
08/03/2020

Deep Bayesian Bandits: Exploring in Online Personalized Recommendations

Recommender systems trained in a continuous learning fashion are plagued...
research
06/30/2017

Towards Bursting Filter Bubble via Contextual Risks and Uncertainties

A rising topic in computational journalism is how to enhance the diversi...

Please sign up or login with your details

Forgot password? Click here to reset