Unified State Representation Learning under Data Augmentation

09/12/2022
by   Taylor Hearn, et al.
0

The capacity for rapid domain adaptation is important to increasing the applicability of reinforcement learning (RL) to real world problems. Generalization of RL agents is critical to success in the real world, yet zero-shot policy transfer is a challenging problem since even minor visual changes could make the trained agent completely fail in the new task. We propose USRA: Unified State Representation Learning under Data Augmentation, a representation learning framework that learns a latent unified state representation by performing data augmentations on its observations to improve its ability to generalize to unseen target domains. We showcase the success of our approach on the DeepMind Control Generalization Benchmark for the Walker environment and find that USRA achieves higher sample efficiency and 14.3 better domain adaptation performance compared to the best baseline results.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/10/2021

Domain Adaptation In Reinforcement Learning Via Latent Unified State Representation

Despite the recent success of deep reinforcement learning (RL), domain a...
research
07/26/2017

DARLA: Improving Zero-Shot Transfer in Reinforcement Learning

Domain adaptation is an important open problem in deep reinforcement lea...
research
06/17/2021

SECANT: Self-Expert Cloning for Zero-Shot Generalization of Visual Policies

Generalization has been a long-standing challenge for reinforcement lear...
research
06/04/2021

Cross-Trajectory Representation Learning for Zero-Shot Generalization in RL

A highly desirable property of a reinforcement learning (RL) agent – and...
research
12/03/2020

Intervention Design for Effective Sim2Real Transfer

The goal of this work is to address the recent success of domain randomi...
research
10/10/2022

A Comprehensive Survey of Data Augmentation in Visual Reinforcement Learning

Visual reinforcement learning (RL), which makes decisions directly from ...
research
06/14/2023

VIBR: Learning View-Invariant Value Functions for Robust Visual Control

End-to-end reinforcement learning on images showed significant progress ...

Please sign up or login with your details

Forgot password? Click here to reset