DeepAI AI Chat
Log In Sign Up

R-MADDPG for Partially Observable Environments and Limited Communication

by   Rose E. Wang, et al.

There are several real-world tasks that would ben-efit from applying multiagent reinforcement learn-ing (MARL) algorithms, including the coordina-tion among self-driving cars. The real world haschallenging conditions for multiagent learningsystems, such as its partial observable and nonsta-tionary nature. Moreover, if agents must share alimited resource (e.g. network bandwidth) theymust all learn how to coordinate resource use.(Hochreiter Schmidhuber, 1997) This paper in-troduces a deep recurrent multiagent actor-criticframework (R-MADDPG) for handling multia-gent coordination under partial observable set-tings and limited communication. We investigaterecurrency effects on performance and commu-nication use of a team of agents. We demon-strate that the resulting framework learns time-dependencies for sharing missing observations,handling resource limitations, and developing dif-ferent communication patterns among agents.


page 1

page 2

page 3

page 4


Coordination in Adversarial Sequential Team Games via Multi-Agent Deep Reinforcement Learning

Many real-world applications involve teams of agents that have to coordi...

A Decentralized Communication Framework based on Dual-Level Recurrence for Multi-Agent Reinforcement Learning

We propose a model enabling decentralized multiple agents to share their...

Decentralized Coordination in Partially Observable Queueing Networks

We consider communication in a fully cooperative multi-agent system, whe...

Soft Hierarchical Graph Recurrent Networks for Many-Agent Partially Observable Environments

The recent progress in multi-agent deep reinforcement learning(MADRL) ma...

Vehicle Community Strategies

Interest in emergent communication has recently surged in Machine Learni...

More Like Real World Game Challenge for Partially Observable Multi-Agent Cooperation

Some standardized environments have been designed for partially observab...

Vehicle Communication Strategies for Simulated Highway Driving

Interest in emergent communication has recently surged in Machine Learni...