GALOPP: Multi-Agent Deep Reinforcement Learning For Persistent Monitoring With Localization Constraints

09/14/2021
by   Manav Mishra, et al.
0

Persistently monitoring a region under localization and communication constraints is a challenging problem. In this paper, we consider a heterogenous robotic system consisting of two types of agents – anchor agents that have accurate localization capability, and auxiliary agents that have low localization accuracy. The auxiliary agents must be within the communication range of an anchor, directly or indirectly to localize itself. The objective of the robotic team is to minimize the uncertainty in the environment through persistent monitoring. We propose a multi-agent deep reinforcement learning (MADRL) based architecture with graph attention called Graph Localized Proximal Policy Optimization (GALLOP), which incorporates the localization and communication constraints of the agents along with persistent monitoring objective to determine motion policies for each agent. We evaluate the performance of GALLOP on three different custom-built environments. The results show the agents are able to learn a stable policy and outperform greedy and random search baseline approaches.

READ FULL TEXT

page 3

page 5

research
11/02/2020

Multi-Agent Reinforcement Learning for Persistent Monitoring

The Persistent Monitoring (PM) problem seeks to find a set of trajectori...
research
12/03/2018

Multi-agent Deep Reinforcement Learning with Extremely Noisy Observations

Multi-agent reinforcement learning systems aim to provide interacting ag...
research
05/31/2022

Free-Space Ellipsoid Graphs for Multi-Agent Target Monitoring

We apply a novel framework for decomposing and reasoning about free spac...
research
11/06/2019

Asymptotic Analysis for Greedy Initialization of Threshold-Based Distributed Optimization of Persistent Monitoring on Graphs

We consider the optimal multi-agent persistent monitoring problem define...
research
11/06/2019

Asymptotic Analysis Based Greedy Method for Threshold-Based Distributed Optimization of Persistent Monitoring on Graphs

We consider the optimal multi-agent persistent monitoring problem define...
research
03/16/2023

FindView: Precise Target View Localization Task for Look Around Agents

With the increase in demands for service robots and automated inspection...
research
06/15/2020

ForMIC: Foraging via Multiagent RL with Implicit Communication

Multi-agent foraging (MAF) involves distributing a team of agents to sea...

Please sign up or login with your details

Forgot password? Click here to reset