Deep Learning with Experience Ranking Convolutional Neural Network for Robot Manipulator

09/16/2018
by   Hai Nguyen, et al.
0

Supervised learning, more specifically Convolutional Neural Networks (CNN), has surpassed human ability in some visual recognition tasks such as detection of traffic signs, faces and handwritten numbers. On the other hand, even state-of-the-art reinforcement learning (RL) methods have difficulties in environments with sparse and binary rewards. They requires manually shaping reward functions, which might be challenging to come up with. These tasks, however, are trivial to human. One of the reasons that human are better learners in these tasks is that we are embedded with much prior knowledge of the world. These knowledge might be either embedded in our genes or learned from imitation - a type of supervised learning. For that reason, the best way to narrow the gap between machine and human learning ability should be to mimic how we learn so well in various tasks by a combination of RL and supervised learning. Our method, which integrates Deep Deterministic Policy Gradients and Hindsight Experience Replay (RL method specifically dealing with sparse rewards) with an experience ranking CNN, provides a significant speedup over the learning curve on simulated robotics tasks. Experience ranking allows high-reward transitions to be replayed more frequently, and therefore help learn more efficiently. Our proposed approach can also speed up learning in any other tasks that provide additional information for experience ranking.

READ FULL TEXT

page 5

page 6

research
09/28/2017

Overcoming Exploration in Reinforcement Learning with Demonstrations

Exploration in environments with sparse rewards has been a persistent pr...
research
07/05/2017

Hindsight Experience Replay

Dealing with sparse rewards is one of the biggest challenges in Reinforc...
research
02/01/2019

Competitive Experience Replay

Deep learning has achieved remarkable successes in solving challenging r...
research
02/26/2018

Multi-Goal Reinforcement Learning: Challenging Robotics Environments and Request for Research

The purpose of this technical report is two-fold. First of all, it intro...
research
12/01/2016

Generalizing Skills with Semi-Supervised Reinforcement Learning

Deep reinforcement learning (RL) can acquire complex behaviors from low-...
research
06/07/2022

Imitating Past Successes can be Very Suboptimal

Prior work has proposed a simple strategy for reinforcement learning (RL...
research
12/05/2019

Reinforcement Learning Upside Down: Don't Predict Rewards – Just Map Them to Actions

We transform reinforcement learning (RL) into a form of supervised learn...

Please sign up or login with your details

Forgot password? Click here to reset