Playing with and against Hedge

11/20/2018
by   Miltiades E. Anagnostou, et al.
0

Hedge has been proposed as an adaptive scheme, which guides an agent's decision in resource selection and distribution problems that can be modeled as a multi-armed bandit full information game. Such problems are encountered in the areas of computer and communication networks, e.g. network path selection, load distribution, network interdiction, and also in problems in the area of transportation. We study Hedge under the assumption that the total loss that can be suffered by the player in each round is upper bounded. In this paper, we study the worst performance of Hedge.

READ FULL TEXT
research
07/17/2017

Online Multi-Armed Bandit

We introduce a novel variant of the multi-armed bandit problem, in which...
research
03/13/2015

Interactive Restless Multi-armed Bandit Game and Swarm Intelligence Effect

We obtain the conditions for the emergence of the swarm intelligence eff...
research
10/02/2015

A Survey of Online Experiment Design with the Stochastic Multi-Armed Bandit

Adaptive and sequential experiment design is a well-studied area in nume...
research
02/05/2019

Reinforcement Learning for Optimal Load Distribution Sequencing in Resource-Sharing System

Divisible Load Theory (DLT) is a powerful tool for modeling divisible lo...
research
06/10/2021

A Central Limit Theorem, Loss Aversion and Multi-Armed Bandits

This paper establishes a central limit theorem under the assumption that...
research
05/03/2021

Mean Field Equilibrium in Multi-Armed Bandit Game with Continuous Reward

Mean field game facilitates analyzing multi-armed bandit (MAB) for a lar...
research
09/13/2021

Machine Learning for Online Algorithm Selection under Censored Feedback

In online algorithm selection (OAS), instances of an algorithmic problem...

Please sign up or login with your details

Forgot password? Click here to reset