MASER: Multi-Agent Reinforcement Learning with Subgoals Generated from Experience Replay Buffer

06/20/2022
by   Jeewon Jeon, et al.
0

In this paper, we consider cooperative multi-agent reinforcement learning (MARL) with sparse reward. To tackle this problem, we propose a novel method named MASER: MARL with subgoals generated from experience replay buffer. Under the widely-used assumption of centralized training with decentralized execution and consistent Q-value decomposition for MARL, MASER automatically generates proper subgoals for multiple agents from the experience replay buffer by considering both individual Q-value and total Q-value. Then, MASER designs individual intrinsic reward for each agent based on actionable representation relevant to Q-learning so that the agents reach their subgoals while maximizing the joint action value. Numerical results show that MASER significantly outperforms StarCraft II micromanagement benchmark compared to other state-of-the-art MARL algorithms.

READ FULL TEXT

page 7

page 8

research
06/05/2019

Exploration with Unreliable Intrinsic Reward in Multi-Agent Reinforcement Learning

This paper investigates the use of intrinsic reward to guide exploration...
research
03/16/2023

SVDE: Scalable Value-Decomposition Exploration for Cooperative Multi-Agent Reinforcement Learning

Value-decomposition methods, which reduce the difficulty of a multi-agen...
research
01/25/2023

Discriminative Experience Replay for Efficient Multi-agent Reinforcement Learning

In cooperative multi-agent tasks, parameter sharing among agents is a co...
research
10/02/2020

Correcting Experience Replay for Multi-Agent Communication

We consider the problem of learning to communicate using multi-agent rei...
research
07/30/2022

Reinforcement learning with experience replay and adaptation of action dispersion

Effective reinforcement learning requires a proper balance of exploratio...
research
07/18/2019

Prioritized Guidance for Efficient Multi-Agent Reinforcement Learning Exploration

Exploration efficiency is a challenging problem in multi-agent reinforce...
research
03/24/2022

Remember and Forget Experience Replay for Multi-Agent Reinforcement Learning

We present the extension of the Remember and Forget for Experience Repla...

Please sign up or login with your details

Forgot password? Click here to reset