Independent Learning in Stochastic Games

11/23/2021
by   Asuman Ozdaglar, et al.
0

Reinforcement learning (RL) has recently achieved tremendous successes in many artificial intelligence applications. Many of the forefront applications of RL involve multiple agents, e.g., playing chess and Go games, autonomous driving, and robotics. Unfortunately, the framework upon which classical RL builds is inappropriate for multi-agent learning, as it assumes an agent's environment is stationary and does not take into account the adaptivity of other agents. In this review paper, we present the model of stochastic games for multi-agent learning in dynamic environments. We focus on the development of simple and independent learning dynamics for stochastic games: each agent is myopic and chooses best-response type actions to other agents' strategy without any coordination with her opponent. There has been limited progress on developing convergent best-response type independent learning dynamics for stochastic games. We present our recently proposed simple and independent learning dynamics that guarantee convergence in zero-sum stochastic games, together with a review of other contemporaneous algorithms for dynamic multi-agent learning in this setting. Along the way, we also reexamine some classical results from both the game theory and RL literature, to situate both the conceptual contributions of our independent learning dynamics, and the mathematical novelties of our analysis. We hope this review paper serves as an impetus for the resurgence of studying independent and natural learning dynamics in game theory, for the more challenging settings with a dynamic environment.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/14/2020

Extended Markov Games to Learn Multiple Tasks in Multi-Agent Reinforcement Learning

The combination of Formal Methods with Reinforcement Learning (RL) has r...
research
11/20/2018

Stable Opponent Shaping in Differentiable Games

A growing number of learning methods are actually games which optimise m...
research
10/31/2020

FireCommander: An Interactive, Probabilistic Multi-agent Environment for Joint Perception-Action Tasks

The purpose of this tutorial is to help individuals use the FireCommande...
research
08/06/2019

A stochastic game theory approach for the prediction of interfacial parameters in two-phase flow systems

The prediction of interfacial area properties in two-phase flow systems ...
research
09/19/2018

Deterministic limit of temporal difference reinforcement learning for stochastic games

Reinforcement learning in multi-agent systems has been studied in the fi...
research
07/13/2022

A Coupling Approach to Analyzing Games with Dynamic Environments

The theory of learning in games has extensively studied situations where...
research
09/06/2023

Episodic Logit-Q Dynamics for Efficient Learning in Stochastic Teams

We present new learning dynamics combining (independent) log-linear lear...

Please sign up or login with your details

Forgot password? Click here to reset