Contextual Information-Directed Sampling

05/22/2022
by   Botao Hao, et al.
0

Information-directed sampling (IDS) has recently demonstrated its potential as a data-efficient reinforcement learning algorithm. However, it is still unclear what is the right form of information ratio to optimize when contextual information is available. We investigate the IDS design through two contextual bandit problems: contextual bandits with graph feedback and sparse linear contextual bandits. We provably demonstrate the advantage of contextual IDS over conditional IDS and emphasize the importance of considering the context distribution. The main message is that an intelligent agent should invest more on the actions that are beneficial for the future unseen contexts while the conditional IDS can be myopic. We further propose a computationally-efficient version of contextual IDS based on Actor-Critic and evaluate it empirically on a neural network contextual bandit.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/14/2022

Efficient Contextual Bandits with Knapsacks via Regression

We consider contextual bandits with knapsacks (CBwK), a variant of the c...
research
04/10/2019

Charging control of electric vehicles using contextual bandits considering the electrical distribution grid

With the proliferation of electric vehicles, the electrical distribution...
research
06/16/2022

A Contextual Combinatorial Semi-Bandit Approach to Network Bottleneck Identification

Bottleneck identification is a challenging task in network analysis, esp...
research
02/21/2020

Online Learning in Contextual Bandits using Gated Linear Networks

We introduce a new and completely online contextual bandit algorithm cal...
research
01/02/2019

Warm-starting Contextual Bandits: Robustly Combining Supervised and Bandit Feedback

We investigate the feasibility of learning from both fully-labeled super...
research
02/27/2018

Robust Actor-Critic Contextual Bandit for Mobile Health (mHealth) Interventions

We consider the actor-critic contextual bandit for the mobile health (mH...
research
10/15/2020

Blending Search and Discovery: Tag-Based Query Refinement with Contextual Reinforcement Learning

We tackle tag-based query refinement as a mobile-friendly alternative to...

Please sign up or login with your details

Forgot password? Click here to reset