Learning Memory-Based Control for Human-Scale Bipedal Locomotion

06/03/2020
by   Jonah Siekmann, et al.
0

Controlling a non-statically stable biped is a difficult problem largely due to the complex hybrid dynamics involved. Recent work has demonstrated the effectiveness of reinforcement learning (RL) for simulation-based training of neural network controllers that successfully transfer to real bipeds. The existing work, however, has primarily used simple memoryless network architectures, even though more sophisticated architectures, such as those including memory, often yield superior performance in other RL domains. In this work, we consider recurrent neural networks (RNNs) for sim-to-real biped locomotion, allowing for policies that learn to use internal memory to model important physical properties. We show that while RNNs are able to significantly outperform memoryless policies in simulation, they do not exhibit superior behavior on the real biped due to overfitting to the simulation physics unless trained using dynamics randomization to prevent overfitting; this leads to consistently better sim-to-real transfer. We also show that RNNs could use their learned memory states to perform online system identification by encoding parameters of the dynamics into memory.

READ FULL TEXT
research
04/09/2022

Sim-to-Real Learning for Bipedal Locomotion Under Unsensed Dynamic Loads

Recent work on sim-to-real learning for bipedal locomotion has demonstra...
research
11/09/2020

Learning Task Space Actions for Bipedal Locomotion

Recent work has demonstrated the success of reinforcement learning (RL) ...
research
04/20/2021

GLiDE: Generalizable Quadrupedal Locomotion in Diverse Environments with a Centroidal Model

Model-free reinforcement learning (RL) for legged locomotion commonly re...
research
01/29/2019

Emergence of Hierarchy via Reinforcement Learning Using a Multiple Timescale Stochastic RNN

Although recurrent neural networks (RNNs) for reinforcement learning (RL...
research
11/04/2020

Dynamics Randomization Revisited:A Case Study for Quadrupedal Locomotion

Understanding the gap between simulation andreality is critical for rein...
research
06/08/2015

Learning to Transduce with Unbounded Memory

Recently, strong results have been demonstrated by Deep Recurrent Neural...
research
09/26/2022

Learning and Deploying Robust Locomotion Policies with Minimal Dynamics Randomization

Training deep reinforcement learning (DRL) locomotion policies often req...

Please sign up or login with your details

Forgot password? Click here to reset