Centralized Training with Hybrid Execution in Multi-Agent Reinforcement Learning

10/12/2022
by   Pedro P. Santos, et al.
0

We introduce hybrid execution in multi-agent reinforcement learning (MARL), a new paradigm in which agents aim to successfully perform cooperative tasks with any communication level at execution time by taking advantage of information-sharing among the agents. Under hybrid execution, the communication level can range from a setting in which no communication is allowed between agents (fully decentralized), to a setting featuring full communication (fully centralized). To formalize our setting, we define a new class of multi-agent partially observable Markov decision processes (POMDPs) that we name hybrid-POMDPs, which explicitly models a communication process between the agents. We contribute MARO, an approach that combines an autoregressive predictive model to estimate missing agents' observations, and a dropout-based RL training scheme that simulates different communication levels during the centralized training phase. We evaluate MARO on standard scenarios and extensions of previous benchmarks tailored to emphasize the negative impact of partial observability in MARL. Experimental results show that our method consistently outperforms baselines, allowing agents to act with faulty communication while successfully exploiting shared information.

READ FULL TEXT

page 30

page 31

page 32

page 33

research
06/06/2022

Consensus Learning for Cooperative Multi-Agent Reinforcement Learning

Almost all multi-agent reinforcement learning algorithms without communi...
research
02/22/2022

A Decentralized Communication Framework based on Dual-Level Recurrence for Multi-Agent Reinforcement Learning

We propose a model enabling decentralized multiple agents to share their...
research
04/28/2023

From Explicit Communication to Tacit Cooperation:A Novel Paradigm for Cooperative MARL

Centralized training with decentralized execution (CTDE) is a widely-use...
research
08/04/2022

Transferable Multi-Agent Reinforcement Learning with Dynamic Participating Agents

We study multi-agent reinforcement learning (MARL) with centralized trai...
research
10/25/2021

Common Information based Approximate State Representations in Multi-Agent Reinforcement Learning

Due to information asymmetry, finding optimal policies for Decentralized...
research
04/07/2022

Robust Event-Driven Interactions in Cooperative Multi-Agent Learning

We present an approach to reduce the communication required between agen...
research
07/12/2023

Learning Decentralized Partially Observable Mean Field Control for Artificial Collective Behavior

Recent reinforcement learning (RL) methods have achieved success in vari...

Please sign up or login with your details

Forgot password? Click here to reset