Neural Collaborative Filtering Bandits via Meta Learning

01/31/2022
by   Yikun Ban, et al.
0

Contextual multi-armed bandits provide powerful tools to solve the exploitation-exploration dilemma in decision making, with direct applications in the personalized recommendation. In fact, collaborative effects among users carry the significant potential to improve the recommendation. In this paper, we introduce and study the problem by exploring `Neural Collaborative Filtering Bandits', where the rewards can be non-linear functions and groups are formed dynamically given different specific contents. To solve this problem, inspired by meta-learning, we propose Meta-Ban (meta-bandits), where a meta-learner is designed to represent and rapidly adapt to dynamic groups, along with a UCB-based exploration strategy. Furthermore, we analyze that Meta-Ban can achieve the regret bound of 𝒪(√(T log T)), improving a multiplicative factor √(log T) over state-of-the-art related works. In the end, we conduct extensive experiments showing that Meta-Ban significantly outperforms six strong baselines.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/11/2015

Collaborative Filtering Bandits

Classical collaborative filtering, and content-based filtering methods t...
research
09/07/2019

AutoML for Contextual Bandits

Contextual Bandits is one of the widely popular techniques used in appli...
research
07/04/2020

Neural Interactive Collaborative Filtering

In this paper, we study collaborative filtering in an interactive settin...
research
02/11/2021

Meta-Thompson Sampling

Efficient exploration in multi-armed bandits is a fundamental online lea...
research
09/04/2023

Interactive Graph Convolutional Filtering

Interactive Recommender Systems (IRS) have been increasingly used in var...
research
03/09/2021

u-cf2vec: Representation Learning for Personalized Algorithm Selection in Recommender Systems

Collaborative Filtering (CF) has become the standard approach to solve r...
research
05/26/2022

Collaborative Distillation Meta Learning for Simulation Intensive Hardware Design

This paper proposes a novel collaborative distillation meta learning (CD...

Please sign up or login with your details

Forgot password? Click here to reset