Learning Graph-Enhanced Commander-Executor for Multi-Agent Navigation

02/08/2023
by   Xinyi Yang, et al.
5

This paper investigates the multi-agent navigation problem, which requires multiple agents to reach the target goals in a limited time. Multi-agent reinforcement learning (MARL) has shown promising results for solving this issue. However, it is inefficient for MARL to directly explore the (nearly) optimal policy in the large search space, which is exacerbated as the agent number increases (e.g., 10+ agents) or the environment is more complex (e.g., 3D simulator). Goal-conditioned hierarchical reinforcement learning (HRL) provides a promising direction to tackle this challenge by introducing a hierarchical structure to decompose the search space, where the low-level policy predicts primitive actions in the guidance of the goals derived from the high-level policy. In this paper, we propose Multi-Agent Graph-Enhanced Commander-Executor (MAGE-X), a graph-based goal-conditioned hierarchical method for multi-agent navigation tasks. MAGE-X comprises a high-level Goal Commander and a low-level Action Executor. The Goal Commander predicts the probability distribution of goals and leverages them to assign each agent the most appropriate final target. The Action Executor utilizes graph neural networks (GNN) to construct a subgraph for each agent that only contains crucial partners to improve cooperation. Additionally, the Goal Encoder in the Action Executor captures the relationship between the agent and the designated goal to encourage the agent to reach the final target. The results show that MAGE-X outperforms the state-of-the-art MARL baselines with a 100 only 3 million training steps in multi-agent particle environments (MPE) with 50 agents, and at least a 12 in a more complicated quadrotor 3D navigation task.

READ FULL TEXT
research
06/25/2022

Hierarchical Reinforcement Learning with Opponent Modeling for Distributed Multi-agent Cooperation

Many real-world applications can be formulated as multi-agent cooperatio...
research
06/30/2023

Landmark Guided Active Exploration with Stable Low-level Policy Learning

Goal-conditioned hierarchical reinforcement learning (GCHRL) decomposes ...
research
11/16/2018

On the Complexity of Exploration in Goal-Driven Navigation

Building agents that can explore their environments intelligently is a c...
research
09/22/2022

Environment Optimization for Multi-Agent Navigation

Traditional approaches to the design of multi-agent navigation algorithm...
research
09/05/2021

Hierarchical Object-to-Zone Graph for Object Navigation

The goal of object navigation is to reach the expected objects according...
research
10/17/2022

Learning Control Admissibility Models with Graph Neural Networks for Multi-Agent Navigation

Deep reinforcement learning in continuous domains focuses on learning co...
research
03/22/2019

Deep Hierarchical Reinforcement Learning Based Recommendations via Multi-goals Abstraction

The recommender system is an important form of intelligent application, ...

Please sign up or login with your details

Forgot password? Click here to reset