Differentiable Arbitrating in Zero-sum Markov Games

02/20/2023
by   Jing Wang, et al.
0

We initiate the study of how to perturb the reward in a zero-sum Markov game with two players to induce a desirable Nash equilibrium, namely arbitrating. Such a problem admits a bi-level optimization formulation. The lower level requires solving the Nash equilibrium under a given reward function, which makes the overall problem challenging to optimize in an end-to-end way. We propose a backpropagation scheme that differentiates through the Nash equilibrium, which provides the gradient feedback for the upper level. In particular, our method only requires a black-box solver for the (regularized) Nash equilibrium (NE). We develop the convergence analysis for the proposed framework with proper black-box NE solvers and demonstrate the empirical successes in two multi-agent reinforcement learning (MARL) environments.

READ FULL TEXT
research
07/18/2019

Optimal Bi-level Lottery Design for Multi-agent Systems

We consider a bi-level lottery where a social planner at the high level ...
research
06/13/2023

On Faking a Nash Equilibrium

We characterize offline data poisoning attacks on Multi-Agent Reinforcem...
research
11/08/2019

Bridging Bayesian and Minimax Mean Square Error Estimation via Wasserstein Distributionally Robust Optimization

We introduce a distributionally robust minimium mean square error estima...
research
11/10/2022

Slightly Altruistic Nash Equilibrium for Multi-agent Pursuit-Evasion Games With Input Constraints

This is an initial manuscript that presents the basic idea of "slightly ...
research
04/11/2023

Cooperative Coevolution for Non-Separable Large-Scale Black-Box Optimization: Convergence Analyses and Distributed Accelerations

Given the ubiquity of non-separable optimization problems in real worlds...
research
02/08/2023

Non-zero-sum Game Control for Multi-vehicle Driving via Reinforcement Learning

When a vehicle drives on the road, its behaviors will be affected by sur...
research
08/12/2021

On Liquidity Mining for Uniswap v3

The recently proposed Uniswap v3 replaces the fungible liquidity provide...

Please sign up or login with your details

Forgot password? Click here to reset