Distributed Learning of Decentralized Control Policies for Articulated Mobile Robots

01/24/2019
by   Guillaume Sartoretti, et al.
0

State-of-the-art distributed algorithms for reinforcement learning rely on multiple independent agents, which simultaneously learn in parallel environments while asynchronously updating a common, shared policy. Moreover, decentralized control architectures (e.g., CPGs) can coordinate spatially distributed portions of an articulated robot to achieve system-level objectives. In this work, we investigate the relationship between distributed learning and decentralized control by learning decentralized control policies for the locomotion of articulated robots in challenging environments. To this end, we present an approach that leverages the structure of the asynchronous advantage actor-critic (A3C) algorithm to provide a natural means of learning decentralized control policies on a single articulated robot. Our primary contribution shows individual agents in the A3C algorithm can be defined by independently controlled portions of the robot's body, thus enabling distributed learning on a single robot for efficient hardware implementation. We present results of closed-loop locomotion in unstructured terrains on a snake and a hexapod robot, using decentralized controllers learned offline and online respectively. Preprint of the paper submitted to the IEEE Transactions in Robotics (T-RO) journal in October 2018, and conditionally accepted for publication as a regular paper in January 2019.

READ FULL TEXT

page 1

page 4

page 6

page 8

page 9

research
02/19/2021

Decentralized Deterministic Multi-Agent Reinforcement Learning

[Zhang, ICML 2018] provided the first decentralized actor-critic algorit...
research
10/28/2021

An Adaptable Approach to Learn Realistic Legged Locomotion without Examples

Learning controllers that reproduce legged locomotion in nature has been...
research
09/18/2017

Guided Deep Reinforcement Learning for Swarm Systems

In this paper, we investigate how to learn to control a group of coopera...
research
12/15/2020

Distributed Data Storage and Fusion for Collective Perception in Resource-Limited Mobile Robot Swarms

In this paper, we propose an approach to the distributed storage and fus...
research
10/11/2021

Learning to Coordinate in Multi-Agent Systems: A Coordinated Actor-Critic Algorithm and Finite-Time Guarantees

Multi-agent reinforcement learning (MARL) has attracted much research at...
research
06/15/2020

ForMIC: Foraging via Multiagent RL with Implicit Communication

Multi-agent foraging (MAF) involves distributing a team of agents to sea...
research
03/02/2023

PuSHR: A Multirobot System for Nonprehensile Rearrangement

We focus on the problem of rearranging a set of objects with a team of c...

Please sign up or login with your details

Forgot password? Click here to reset