Continuous Strategy Replicator Dynamics for Multi--Agent Learning

04/29/2009
by   Aram Galstyan, et al.
0

The problem of multi-agent learning and adaptation has attracted a great deal of attention in recent years. It has been suggested that the dynamics of multi agent learning can be studied using replicator equations from population biology. Most existing studies so far have been limited to discrete strategy spaces with a small number of available actions. In many cases, however, the choices available to agents are better characterized by continuous spectra. This paper suggests a generalization of the replicator framework that allows to study the adaptive dynamics of Q-learning agents with continuous strategy spaces. Instead of probability vectors, agents strategies are now characterized by probability measures over continuous variables. As a result, the ordinary differential equations for the discrete case are replaced by a system of coupled integral--differential replicator equations that describe the mutual evolution of individual agent strategies. We derive a set of functional equations describing the steady state of the replicator dynamics, examine their solutions for several two-player games, and confirm our analytical results using simulations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/10/2023

Generalizing Graph ODE for Learning Complex System Dynamics across Environments

Learning multi-agent system dynamics has been extensively studied for va...
research
03/18/2018

Detection under One-Bit Messaging over Adaptive Networks

This work studies the operation of multi-agent networks engaged in binar...
research
05/25/2020

Non-cooperative Multi-agent Systems with Exploring Agents

Multi-agent learning is a challenging problem in machine learning that h...
research
01/24/2020

MagNet: Discovering Multi-agent Interaction Dynamics using Neural Network

We present the MagNet, a multi-agent interaction network to discover gov...
research
10/04/2019

Discrete Processes and their Continuous Limits

The possibility that a discrete process can be fruitfully approximated b...
research
11/28/2019

Multiple quadrotors carrying a flexible hose: dynamics, differential flatness and control

Using quadrotors UAVs for cooperative payload transportation using cable...
research
08/02/2021

Tuning Cooperative Behavior in Games with Nonlinear Opinion Dynamics

We examine the tuning of cooperative behavior in repeated multi-agent ga...

Please sign up or login with your details

Forgot password? Click here to reset