Unbiased Deep Reinforcement Learning: A General Training Framework for Existing and Future Algorithms

05/12/2020
by   Huihui Zhang, et al.
0

In recent years deep neural networks have been successfully applied to the domains of reinforcement learning <cit.>. Deep reinforcement learning <cit.> is reported to have the advantage of learning effective policies directly from high-dimensional sensory inputs over traditional agents. However, within the scope of the literature, there is no fundamental change or improvement on the existing training framework. Here we propose a novel training framework that is conceptually comprehensible and potentially easy to be generalized to all feasible algorithms for reinforcement learning. We employ Monte-carlo sampling to achieve raw data inputs, and train them in batch to achieve Markov decision process sequences and synchronously update the network parameters instead of experience replay. This training framework proves to optimize the unbiased approximation of loss function whose estimation exactly matches the real probability distribution data inputs follow, and thus have overwhelming advantages of sample efficiency and convergence rate over existing deep reinforcement learning after evaluating it on both discrete action spaces and continuous control problems. Besides, we propose several algorithms embedded with our new framework to deal with typical discrete and continuous scenarios. These algorithms prove to be far more efficient than their original versions under the framework of deep reinforcement learning, and provide examples for existing and future algorithms to generalize to our new framework.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/25/2019

Action Guidance with MCTS for Deep Reinforcement Learning

Deep reinforcement learning has achieved great successes in recent years...
research
10/15/2018

Using Deep Reinforcement Learning for the Continuous Control of Robotic Arms

Deep reinforcement learning enables algorithms to learn complex behavior...
research
11/13/2015

Deep Reinforcement Learning in Parameterized Action Space

Recent work has shown that deep neural networks are capable of approxima...
research
06/15/2016

Deep Reinforcement Learning With Macro-Actions

Deep reinforcement learning has been shown to be a powerful framework fo...
research
12/16/2021

Deep Reinforcement Learning Policies Learn Shared Adversarial Features Across MDPs

The use of deep neural networks as function approximators has led to str...
research
09/17/2018

Transparency and Explanation in Deep Reinforcement Learning Neural Networks

Autonomous AI systems will be entering human society in the near future ...
research
09/05/2018

A Robotic Auto-Focus System based on Deep Reinforcement Learning

Considering its advantages in dealing with high-dimensional visual input...

Please sign up or login with your details

Forgot password? Click here to reset