Consolidation via Policy Information Regularization in Deep RL for Multi-Agent Games

11/23/2020
by   Tyler Malloy, et al.
0

This paper introduces an information-theoretic constraint on learned policy complexity in the Multi-Agent Deep Deterministic Policy Gradient (MADDPG) reinforcement learning algorithm. Previous research with a related approach in continuous control experiments suggests that this method favors learning policies that are more robust to changing environment dynamics. The multi-agent game setting naturally requires this type of robustness, as other agents' policies change throughout learning, introducing a nonstationary environment. For this reason, recent methods in continual learning are compared to our approach, termed Capacity-Limited MADDPG. Results from experimentation in multi-agent cooperative and competitive tasks demonstrate that the capacity-limited approach is a good candidate for improving learning performance in these environments.

READ FULL TEXT

page 7

page 8

page 9

research
12/17/2020

MAGNet: Multi-agent Graph Network for Deep Multi-agent Reinforcement Learning

Over recent years, deep reinforcement learning has shown strong successe...
research
05/04/2023

Stackelberg Games for Learning Emergent Behaviors During Competitive Autocurricula

Autocurricular training is an important sub-area of multi-agent reinforc...
research
05/01/2020

Smart Containers With Bidding Capacity: A Policy Gradient Algorithm for Semi-Cooperative Learning

Smart modular freight containers – as propagated in the Physical Interne...
research
03/22/2022

Is Vanilla Policy Gradient Overlooked? Analyzing Deep Reinforcement Learning for Hanabi

In pursuit of enhanced multi-agent collaboration, we analyze several on-...
research
02/10/2023

Learning cooperative behaviours in adversarial multi-agent systems

This work extends an existing virtual multi-agent platform called RoboSu...
research
05/07/2023

Multi-agent Continual Coordination via Progressive Task Contextualization

Cooperative Multi-agent Reinforcement Learning (MARL) has attracted sign...
research
08/16/2019

Iterative Update and Unified Representation for Multi-Agent Reinforcement Learning

Multi-agent systems have a wide range of applications in cooperative and...

Please sign up or login with your details

Forgot password? Click here to reset