The Impact of the Geometric Properties of the Constraint Set in Safe Optimization with Bandit Feedback

05/01/2023
by   Spencer Hutchinson, et al.
0

We consider a safe optimization problem with bandit feedback in which an agent sequentially chooses actions and observes responses from the environment, with the goal of maximizing an arbitrary function of the response while respecting stage-wise constraints. We propose an algorithm for this problem, and study how the geometric properties of the constraint set impact the regret of the algorithm. In order to do so, we introduce the notion of the sharpness of a particular constraint set, which characterizes the difficulty of performing learning within the constraint set in an uncertain setting. This concept of sharpness allows us to identify the class of constraint sets for which the proposed algorithm is guaranteed to enjoy sublinear regret. Simulation results for this algorithm support the sublinear regret bound and provide empirical evidence that the sharpness of the constraint set impacts the performance of the algorithm.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/18/2022

Safe Online Bid Optimization with Return-On-Investment and Budget Constraints subject to Uncertainty

In online marketing, the advertisers' goal is usually a tradeoff between...
research
10/28/2020

Provably Efficient Online Agnostic Learning in Markov Games

We study online agnostic learning, a problem that arises in episodic mul...
research
08/29/2023

Exploiting Problem Geometry in Safe Linear Bandits

The safe linear bandit problem is a version of the classic linear bandit...
research
10/27/2022

Lifelong Bandit Optimization: No Prior and No Regret

In practical applications, machine learning algorithms are often repeate...
research
11/06/2019

Safe Linear Thompson Sampling

The design and performance analysis of bandit algorithms in the presence...
research
08/12/2020

Non-Stochastic Control with Bandit Feedback

We study the problem of controlling a linear dynamical system with adver...
research
11/14/2018

Incentivizing Exploration with Unbiased Histories

In a social learning setting, there is a set of actions, each of which h...

Please sign up or login with your details

Forgot password? Click here to reset