Mixed Strategies for Robust Optimization of Unknown Objectives

02/28/2020
by   Pier Giuseppe Sessa, et al.
8

We consider robust optimization problems, where the goal is to optimize an unknown objective function against the worst-case realization of an uncertain parameter. For this setting, we design a novel sample-efficient algorithm GP-MRO, which sequentially learns about the unknown objective from noisy point evaluations. GP-MRO seeks to discover a robust and randomized mixed strategy, that maximizes the worst-case expected objective value. To achieve this, it combines techniques from online learning with nonparametric confidence bounds from Gaussian processes. Our theoretical results characterize the number of samples required by GP-MRO to discover a robust near-optimal mixed strategy for different GP kernels of interest. We experimentally demonstrate the performance of our algorithm on synthetic datasets and on human-assisted trajectory planning tasks for autonomous vehicles. In our simulations, we show that robust deterministic strategies can be overly conservative, while the mixed strategies found by GP-MRO significantly improve the overall performance.

READ FULL TEXT

page 7

page 8

research
10/21/2015

Optimization as Estimation with Gaussian Processes in Bandit Settings

Recently, there has been rising interest in Bayesian optimization -- the...
research
10/25/2018

Adversarially Robust Optimization with Gaussian Processes

In this paper, we consider the problem of Gaussian process (GP) optimiza...
research
07/04/2017

Robust Optimization for Non-Convex Objectives

We consider robust optimization problems, where the goal is to optimize ...
research
08/07/2023

Learning-based Near-optimal Motion Planning for Intelligent Vehicles with Uncertain Dynamics

Motion planning has been an important research topic in achieving safe a...
research
03/16/2023

Learning-Based Modeling of Human-Autonomous Vehicle Interaction for Enhancing Safety in Mixed-Vehicle Platooning Control

As autonomous vehicles (AVs) become more prevalent on public roads, they...
research
03/17/2020

The value of randomized strategies in distributionally robust risk averse network interdiction games

Conditional Value at Risk (CVaR) is widely used to account for the prefe...
research
03/09/2023

Robust Social Welfare Maximization via Information Design in Linear-Quadratic-Gaussian Games

Information design in an incomplete information game includes a designer...

Please sign up or login with your details

Forgot password? Click here to reset