A Problem-Adaptive Algorithm for Resource Allocation

02/12/2019
by   Xavier Fontaine, et al.
0

We consider a sequential stochastic resource allocation problem under the gradient feedback, where the reward of each resource is concave. We construct a generic algorithm that is adaptive to the complexity of the problem, which is measured using the exponent in Łojasiewicz inequality. Our algorithm interpolates between the non-strongly concave and the strongly-concave rates without depending on the strong-concavity parameter and recover the fast rate of classical multi-armed bandit (corresponding roughly to linear reward functions).

READ FULL TEXT
research
05/10/2021

Combinatorial Multi-armed Bandits for Resource Allocation

We study the sequential resource allocation problem where a decision mak...
research
04/16/2018

Distributed, Private, and Derandomized Allocation Algorithm for EV Charging

Efficient resource allocation is challenging when privacy of users is im...
research
12/13/2020

Adaptive Algorithms for Multi-armed Bandit with Composite and Anonymous Feedback

We study the multi-armed bandit (MAB) problem with composite and anonymo...
research
12/09/2022

Networked Restless Bandits with Positive Externalities

Restless multi-armed bandits are often used to model budget-constrained ...
research
09/22/2019

Distributed Conjugate Gradient Tracking for Resource Allocation in Unbalanced Networks

This paper proposes a distributed conjugate gradient tracking algorithm ...
research
01/06/2016

On Bayesian index policies for sequential resource allocation

This paper is about index policies for minimizing (frequentist) regret i...
research
03/25/2023

Hierarchical Multi-Agent Multi-Armed Bandit for Resource Allocation in Multi-LEO Satellite Constellation Networks

Low Earth orbit (LEO) satellite constellation is capable of providing gl...

Please sign up or login with your details

Forgot password? Click here to reset