Self-Paced Context Evaluation for Contextual Reinforcement Learning

06/09/2021
by   Theresa Eimer, et al.
0

Reinforcement learning (RL) has made a lot of advances for solving a single problem in a given environment; but learning policies that generalize to unseen variations of a problem remains challenging. To improve sample efficiency for learning on such instances of a problem domain, we present Self-Paced Context Evaluation (SPaCE). Based on self-paced learning, automatically generates curricula online with little computational overhead. To this end, SPaCE leverages information contained in state values during training to accelerate and improve training performance as well as generalization capabilities to new instances from the same problem domain. Nevertheless, SPaCE is independent of the problem domain at hand and can be applied on top of any RL agent with state-value function approximation. We demonstrate SPaCE's ability to speed up learning of different value-based RL agents on two environments, showing better generalization capabilities and up to 10x faster learning compared to naive approaches such as round robin or SPDRL, as the closest state-of-the-art approach.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/12/2018

Will it Blend? Composing Value Functions in Reinforcement Learning

An important property for lifelong-learning agents is the ability to com...
research
10/25/2021

Self-Consistent Models and Values

Learned models of the environment provide reinforcement learning (RL) ag...
research
11/02/2020

Instance based Generalization in Reinforcement Learning

Agents trained via deep reinforcement learning (RL) routinely fail to ge...
research
01/05/2020

Universal Successor Features for Transfer Reinforcement Learning

Transfer in Reinforcement Learning (RL) refers to the idea of applying k...
research
07/01/2020

Group Equivariant Deep Reinforcement Learning

In Reinforcement Learning (RL), Convolutional Neural Networks(CNNs) have...
research
11/19/2019

Attention Privileged Reinforcement Learning For Domain Transfer

Applying reinforcement learning (RL) to physical systems presents notabl...
research
10/07/2022

Scaling Directed Controller Synthesis via Reinforcement Learning

Directed Controller Synthesis technique finds solutions for the non-bloc...

Please sign up or login with your details

Forgot password? Click here to reset