Expert-Level Atari Imitation Learning from Demonstrations Only

09/09/2019
by   Xin-Qiang Cai, et al.
9

One of the key issues for imitation learning lies in making policy learned from limited samples to generalize well in the whole state-action space. This problem is much more severe in high-dimensional state environments, such as game playing with raw pixel inputs. Under this situation, even state-of-the-art adversary based imitation learning algorithms fail. Through theoretical and empirical studies, we find that the main cause lies in the failure of training a powerful discriminator to generate meaningful rewards in high-dimensional environments. Theoretical results are provided to suggest the necessity of dimensionality reduction. However, since preserving important discriminative information via feature transformation is a non-trivial task, a straightforward application of off-the-shelf methods cannot achieve desirable performance. To address the above issues, we propose HashReward, which is a novel imitation learning algorithm utilizing the idea of supervised hashing to realize effective training of the discriminator. As far as we are aware, HashReward is the first pure imitation learning approach to achieve expert comparable performance in Atari game environments with raw pixel inputs.

READ FULL TEXT

page 9

page 17

page 18

research
06/22/2022

Latent Policies for Adversarial Imitation Learning

This paper considers learning robot locomotion and manipulation tasks fr...
research
07/01/2022

Discriminator-Guided Model-Based Offline Imitation Learning

Offline imitation learning (IL) is a powerful method to solve decision-m...
research
02/16/2020

Correlated Adversarial Imitation Learning

A novel imitation learning algorithm is introduced by applying a game-th...
research
08/17/2023

Regularizing Adversarial Imitation Learning Using Causal Invariance

Imitation learning methods are used to infer a policy in a Markov decisi...
research
11/08/2021

Off-policy Imitation Learning from Visual Inputs

Recently, various successful applications utilizing expert states in imi...
research
10/06/2017

Socially-compliant Navigation through Raw Depth Inputs with Generative Adversarial Imitation Learning

We present an approach for mobile robots to learn to navigate in pedestr...
research
09/07/2019

Mature GAIL: Imitation Learning for Low-level and High-dimensional Input using Global Encoder and Cost Transformation

Recently, GAIL framework and various variants have shown remarkable poss...

Please sign up or login with your details

Forgot password? Click here to reset