Hierarchical Reinforcement Learning with Hindsight

05/21/2018
by   Andrew Levy, et al.
0

Reinforcement Learning (RL) algorithms can suffer from poor sample efficiency when rewards are delayed and sparse. We introduce a solution that enables agents to learn temporally extended actions at multiple levels of abstraction in a sample efficient and automated fashion. Our approach combines universal value functions and hindsight learning, allowing agents to learn policies belonging to different time scales in parallel. We show that our method significantly accelerates learning in a variety of discrete and continuous tasks.

READ FULL TEXT
research
02/14/2020

Learning Functionally Decomposed Hierarchies for Continuous Control Tasks

Solving long-horizon sequential decision making tasks in environments wi...
research
12/05/2019

Training Agents using Upside-Down Reinforcement Learning

Traditional Reinforcement Learning (RL) algorithms either predict reward...
research
09/16/2022

Optimizing Industrial HVAC Systems with Hierarchical Reinforcement Learning

Reinforcement learning (RL) techniques have been developed to optimize i...
research
06/18/2019

Language as an Abstraction for Hierarchical Deep Reinforcement Learning

Solving complex, temporally-extended tasks is a long-standing problem in...
research
03/27/2020

Modeling 3D Shapes by Reinforcement Learning

We explore how to enable machines to model 3D shapes like human modelers...
research
11/30/2019

IMPACT: Importance Weighted Asynchronous Architectures with Clipped Target Networks

The practical usage of reinforcement learning agents is often bottleneck...
research
09/26/2022

Delayed Geometric Discounts: An Alternative Criterion for Reinforcement Learning

The endeavor of artificial intelligence (AI) is to design autonomous age...

Please sign up or login with your details

Forgot password? Click here to reset