Learning Reward Machines in Cooperative Multi-Agent Tasks

03/24/2023
by   Leo Ardon, et al.
0

This paper presents a novel approach to Multi-Agent Reinforcement Learning (MARL) that combines cooperative task decomposition with the learning of reward machines (RMs) encoding the structure of the sub-tasks. The proposed method helps deal with the non-Markovian nature of the rewards in partially observable environments and improves the interpretability of the learnt policies required to complete the cooperative task. The RMs associated with each sub-task are learnt in a decentralised manner and then used to guide the behaviour of each agent. By doing so, the complexity of a cooperative multi-agent problem is reduced, allowing for more effective learning. The results suggest that our approach is a promising direction for future research in MARL, especially in complex environments with large state spaces and multiple agents.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/13/2023

Heterogeneous Multi-Agent Reinforcement Learning via Mirror Descent Policy Optimization

This paper presents an extension of the Mirror Descent method to overcom...
research
10/26/2018

TarMAC: Targeted Multi-Agent Communication

We explore a collaborative multi-agent reinforcement learning setting wh...
research
03/05/2020

Reward Design in Cooperative Multi-agent Reinforcement Learning for Packet Routing

In cooperative multi-agent reinforcement learning (MARL), how to design ...
research
08/25/2022

Learning Task Automata for Reinforcement Learning using Hidden Markov Models

Training reinforcement learning (RL) agents using scalar reward signals ...
research
08/27/2014

Definition and properties to assess multi-agent environments as social intelligence tests

Social intelligence in natural and artificial systems is usually measure...
research
08/13/2021

Q-Mixing Network for Multi-Agent Pathfinding in Partially Observable Grid Environments

In this paper, we consider the problem of multi-agent navigation in part...
research
12/06/2022

Curriculum Learning for Relative Overgeneralization

In multi-agent reinforcement learning (MARL), many popular methods, such...

Please sign up or login with your details

Forgot password? Click here to reset