Reducing Overestimation Bias in Multi-Agent Domains Using Double Centralized Critics

10/03/2019
by   Johannes Ackermann, et al.
0

Many real world tasks require multiple agents to work together. Multi-agent reinforcement learning (RL) methods have been proposed in recent years to solve these tasks, but current methods often fail to efficiently learn policies. We thus investigate the presence of a common weakness in single-agent RL, namely value function overestimation bias, in the multi-agent setting. Based on our findings, we propose an approach that reduces this bias by using double centralized critics. We evaluate it on six mixed cooperative-competitive tasks, showing a significant advantage over current methods. Finally, we investigate the application of multi-agent methods to high-dimensional robotic tasks and show that our approach can be used to learn decentralized policies in this domain.

READ FULL TEXT

page 5

page 7

research
09/13/2018

CM3: Cooperative Multi-goal Multi-stage Multi-agent Reinforcement Learning

We propose CM3, a new deep reinforcement learning method for cooperative...
research
07/14/2021

Centralized Model and Exploration Policy for Multi-Agent RL

Reinforcement learning (RL) in partially observable, fully cooperative m...
research
11/04/2020

Moving Forward in Formation: A Decentralized Hierarchical Learning Approach to Multi-Agent Moving Together

Multi-agent path finding in formation has many potential real-world appl...
research
10/13/2021

Competitive Multi-Agent Load Balancing with Adaptive Policies in Wireless Networks

Using Machine Learning (ML) techniques for the next generation wireless ...
research
10/20/2022

Co-Training an Observer and an Evading Target

Reinforcement learning (RL) is already widely applied to applications su...
research
10/19/2022

Oracles Followers: Stackelberg Equilibria in Deep Multi-Agent Reinforcement Learning

Stackelberg equilibria arise naturally in a range of popular learning pr...
research
12/01/2019

MANELA: A Multi-Agent Algorithm for Learning Network Embeddings

Playing an essential role in data mining, machine learning has a long hi...

Please sign up or login with your details

Forgot password? Click here to reset