Best Arm Identification in Graphical Bilinear Bandits

12/14/2020
by   Geovani Rizk, et al.
0

We introduce a new graphical bilinear bandit problem where a learner (or a central entity) allocates arms to the nodes of a graph and observes for each edge a noisy bilinear reward representing the interaction between the two end nodes. We study the best arm identification problem in which the learner wants to find the graph allocation maximizing the sum of the bilinear rewards. By efficiently exploiting the geometry of this bandit problem, we propose a somehow decentralized allocation strategy based on random sampling with theoretical guarantees. In particular, we characterize the influence of the graph structure (e.g. star, complete or circle) on the convergence rate and propose empirical experiments that confirm this dependency.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/01/2022

An α-No-Regret Algorithm For Graphical Bilinear Bandits

We propose the first regret-based approach to the Graphical Bilinear Ban...
research
06/05/2023

Covariance Adaptive Best Arm Identification

We consider the problem of best arm identification in the multi-armed ba...
research
01/08/2019

Bilinear Bandits with Low-rank Structure

We introduce the bilinear bandit problem with low-rank structure where a...
research
05/20/2020

Best Arm Identification in Spectral Bandits

We study best-arm identification with fixed confidence in bandit models ...
research
05/24/2022

Optimality Conditions and Algorithms for Top-K Arm Identification

We consider the top-k arm identification problem for multi-armed bandits...
research
11/17/2016

Unimodal Thompson Sampling for Graph-Structured Arms

We study, to the best of our knowledge, the first Bayesian algorithm for...

Please sign up or login with your details

Forgot password? Click here to reset