Solving Challenging Control Problems Using Two-Staged Deep Reinforcement Learning

09/27/2021
by   Nitish Sontakke, et al.
0

We present a two-staged deep reinforcement learning algorithm for solving challenging control problems. Deep reinforcement learning (deep RL) has been an effective tool for solving many high-dimensional continuous control problems, but it cannot effectively solve challenging problems with certain properties, such as sparse reward functions or sensitive dynamics. In this work, we propose an approach that decomposes the given problem into two stages: motion planning and motion imitation. The motion planning stage seeks to compute a feasible motion plan with approximated dynamics by directly sampling the state space rather than exploring random control signals. Once the motion plan is obtained, the motion imitation stage learns a control policy that can imitate the given motion plan with realistic sensors and actuations. We demonstrate that our approach can solve challenging control problems - rocket navigation and quadrupedal locomotion - which cannot be solved by the standard MDP formulation. The supplemental video can be found at: https://youtu.be/FYLo1Ov_8-g

READ FULL TEXT

page 1

page 4

page 6

research
06/01/2019

Harnessing Reinforcement Learning for Neural Motion Planning

Motion planning is an essential component in most of today's robotic app...
research
09/18/2019

DeepGait: Planning and Control of Quadrupedal Gaits using Deep Reinforcement Learning

This paper addresses the problem of legged locomotion in non-flat terrai...
research
01/22/2019

Visual Imitation Learning with Recurrent Siamese Networks

People solve the difficult problem of understanding the salient features...
research
05/23/2023

Solving Stabilize-Avoid Optimal Control via Epigraph Form and Deep Reinforcement Learning

Tasks for autonomous robotic systems commonly require stabilization to a...
research
12/14/2020

The orienteering problem: a hybrid control formulation

In the last years, a growing number of challenging applications in navig...
research
05/08/2022

Learning to Brachiate via Simplified Model Imitation

Brachiation is the primary form of locomotion for gibbons and siamangs, ...
research
11/25/2019

A Deep Reinforcement Learning Architecture for Multi-stage Optimal Control

Deep reinforcement learning for high dimensional, hierarchical control t...

Please sign up or login with your details

Forgot password? Click here to reset