Agent-Time Attention for Sparse Rewards Multi-Agent Reinforcement Learning

10/31/2022
by   Jennifer She, et al.
0

Sparse and delayed rewards pose a challenge to single agent reinforcement learning. This challenge is amplified in multi-agent reinforcement learning (MARL) where credit assignment of these rewards needs to happen not only across time, but also across agents. We propose Agent-Time Attention (ATA), a neural network model with auxiliary losses for redistributing sparse and delayed rewards in collaborative MARL. We provide a simple example that demonstrates how providing agents with their own local redistributed rewards and shared global redistributed rewards motivate different policies. We extend several MiniGrid environments, specifically MultiRoom and DoorKey, to the multi-agent sparse delayed rewards setting. We demonstrate that ATA outperforms various baselines on many instances of these environments. Source code of the experiments is available at https://github.com/jshe/agent-time-attention.

READ FULL TEXT

page 8

page 9

page 10

research
05/28/2019

Coordinated Exploration via Intrinsic Rewards for Multi-Agent Reinforcement Learning

Sparse rewards are one of the most important challenges in reinforcement...
research
11/23/2022

Contrastive Identity-Aware Learning for Multi-Agent Value Decomposition

Value Decomposition (VD) aims to deduce the contributions of agents for ...
research
04/25/2023

Centralized control for multi-agent RL in a complex Real-Time-Strategy game

Multi-agent Reinforcement learning (MARL) studies the behaviour of multi...
research
06/03/2023

MA2CL:Masked Attentive Contrastive Learning for Multi-Agent Reinforcement Learning

Recent approaches have utilized self-supervised auxiliary tasks as repre...
research
03/03/2023

CoRL: Environment Creation and Management Focused on System Integration

Existing reinforcement learning environment libraries use monolithic env...
research
05/15/2015

Reinforcement Learning applied to Single Neuron

This paper extends the reinforcement learning ideas into the multi-agent...
research
01/01/2020

Long-Term Visitation Value for Deep Exploration in Sparse Reward Reinforcement Learning

Reinforcement learning with sparse rewards is still an open challenge. C...

Please sign up or login with your details

Forgot password? Click here to reset