Random Projection in Neural Episodic Control

04/03/2019
by   Daichi Nishio, et al.
0

End-to-end deep reinforcement learning has enabled agents to learn with little preprocessing by humans. However, it is still difficult to learn stably and efficiently because the learning method usually uses a nonlinear function approximation. Neural Episodic Control (NEC), which has been proposed in order to improve sample efficiency, is able to learn stably by estimating action values using a non-parametric method. In this paper, we propose an architecture that incorporates random projection into NEC to train with more stability. In addition, we verify the effectiveness of our architecture by Atari's five games. The main idea is to reduce the number of parameters that have to learn by replacing neural networks with random projection in order to reduce dimensions while keeping the learning end-to-end.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/26/2018

Control with Distributed Deep Reinforcement Learning: Learn a Better Policy

Distributed approach is a very effective method to improve training effi...
research
03/03/2021

Reinforcement Learning with External Knowledge by using Logical Neural Networks

Conventional deep reinforcement learning methods are sample-inefficient ...
research
11/25/2019

Biologically inspired architectures for sample-efficient deep reinforcement learning

Deep reinforcement learning requires a heavy price in terms of sample ef...
research
03/03/2020

Contention Window Optimization in IEEE 802.11ax Networks with Deep Reinforcement Learning

The proper setting of contention window (CW) values has a significant im...
research
01/30/2017

Expert Level control of Ramp Metering based on Multi-task Deep Reinforcement Learning

This article shows how the recent breakthroughs in Reinforcement Learnin...
research
04/08/2019

End-to-end Projector Photometric Compensation

Projector photometric compensation aims to modify a projector input imag...
research
07/30/2020

End-to-end Full Projector Compensation

Full projector compensation aims to modify a projector input image to co...

Please sign up or login with your details

Forgot password? Click here to reset