Decoupling feature extraction from policy learning: assessing benefits of state representation learning in goal based robotics

01/24/2019
by   Antonin Raffin, et al.
0

Scaling end-to-end reinforcement learning to control real robots from vision presents a series of challenges, in particular in terms of sample efficiency. Against end-to-end learning, state representation learning can help learn a compact, efficient and relevant representation of states that speeds up policy learning, reducing the number of samples needed, and that is easier to interpret. We evaluate several state representation learning methods on goal based robotics tasks and propose a new unsupervised model that stacks representations and combines strengths of several of these approaches. This method encodes all the relevant features, performs on par or better than end-to-end learning, and is robust to hyper-parameters change.

READ FULL TEXT

page 8

page 12

page 13

page 15

page 16

page 18

page 19

page 20

research
02/12/2018

State Representation Learning for Control: An Overview

Representation learning algorithms are designed to learn abstract featur...
research
07/29/2020

Low Dimensional State Representation Learning with Reward-shaped Priors

Reinforcement Learning has been able to solve many complicated robotics ...
research
03/10/2021

RMP2: A Structured Composable Policy Class for Robot Learning

We consider the problem of learning motion policies for acceleration-bas...
research
08/10/2023

RLSAC: Reinforcement Learning enhanced Sample Consensus for End-to-End Robust Estimation

Robust estimation is a crucial and still challenging task, which involve...
research
07/12/2022

Learning Bellman Complete Representations for Offline Policy Evaluation

We study representation learning for Offline Reinforcement Learning (RL)...
research
07/04/2021

Low Dimensional State Representation Learning with Robotics Priors in Continuous Action Spaces

Autonomous robots require high degrees of cognitive and motoric intellig...
research
07/08/2022

Learning with Muscles: Benefits for Data-Efficiency and Robustness in Anthropomorphic Tasks

Humans are able to outperform robots in terms of robustness, versatility...

Please sign up or login with your details

Forgot password? Click here to reset