Deep Reinforcement Learning With Macro-Actions

06/15/2016
by   Ishan P. Durugkar, et al.
0

Deep reinforcement learning has been shown to be a powerful framework for learning policies from complex high-dimensional sensory inputs to actions in complex tasks, such as the Atari domain. In this paper, we explore output representation modeling in the form of temporal abstraction to improve convergence and reliability of deep reinforcement learning approaches. We concentrate on macro-actions, and evaluate these on different Atari 2600 games, where we show that they yield significant improvements in learning speed. Additionally, we show that they can even achieve better scores than DQN. We offer analysis and explanation for both convergence and final results, revealing a problem deep RL approaches have with sparse reward signals.

READ FULL TEXT
research
11/28/2019

Algorithmic Improvements for Deep Reinforcement Learning applied to Interactive Fiction

Text-based games are a natural challenge domain for deep reinforcement l...
research
08/05/2019

Construction of Macro Actions for Deep Reinforcement Learning

Conventional deep reinforcement learning typically determines an appropr...
research
03/22/2019

Macro Action Reinforcement Learning with Sequence Disentanglement using Variational Autoencoder

One problem in the application of reinforcement learning to real-world p...
research
02/10/2022

Abstraction for Deep Reinforcement Learning

We characterise the problem of abstraction in the context of deep reinfo...
research
05/12/2020

Unbiased Deep Reinforcement Learning: A General Training Framework for Existing and Future Algorithms

In recent years deep neural networks have been successfully applied to t...
research
12/13/2019

Long-Term Planning and Situational Awareness in OpenAI Five

Understanding how knowledge about the world is represented within model-...
research
10/09/2018

Distributed Wildfire Surveillance with Autonomous Aircraft using Deep Reinforcement Learning

Teams of autonomous unmanned aircraft can be used to monitor wildfires, ...

Please sign up or login with your details

Forgot password? Click here to reset