Strategizing against No-regret Learners

09/30/2019
by   Yuan Deng, et al.
0

How should a player who repeatedly plays a game against a no-regret learner strategize to maximize his utility? We study this question and show that under some mild assumptions, the player can always guarantee himself a utility of at least what he would get in a Stackelberg equilibrium of the game. When the no-regret learner has only two actions, we show that the player cannot get any higher utility than the Stackelberg equilibrium utility. But when the no-regret learner has more than two actions and plays a mean-based no-regret strategy, we show that the player can get strictly higher than the Stackelberg equilibrium utility. We provide a characterization of the optimal game-play for the player against a mean-based no-regret learner as a solution to a control problem. When the no-regret learner's strategy also guarantees him a no-swap regret, we show that the player cannot get anything higher than a Stackelberg equilibrium utility.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/31/2023

Is Learning in Games Good for the Learners?

We consider a number of questions related to tradeoffs between reward an...
research
05/17/2022

Strategizing against Learners in Bayesian Games

We study repeated two-player games where one of the players, the learner...
research
08/16/2021

Near-Optimal No-Regret Learning in General Games

We show that Optimistic Hedge – a common variant of multiplicative-weigh...
research
07/22/2020

Exploiting No-Regret Algorithms in System Design

We investigate a repeated two-player zero-sum game setting where the col...
research
08/18/2019

Geometrical Regret Matching of Mixed Strategies

We argue that the existing regret matchings for equilibrium approximatio...
research
02/12/2018

Equilibrium solutions of three player Kuhn poker with N>3 cards: A new numerical method using regularization and arc-length continuation

We study the equilibrium solutions of three player Kuhn poker with N>3 c...
research
09/11/2019

Combinatorial Bandits for Sequential Learning in Colonel Blotto Games

The Colonel Blotto game is a renowned resource allocation problem with a...

Please sign up or login with your details

Forgot password? Click here to reset