Domain-Robust Visual Imitation Learning with Mutual Information Constraints

03/08/2021
by   Edoardo Cetin, et al.
13

Human beings are able to understand objectives and learn by simply observing others perform a task. Imitation learning methods aim to replicate such capabilities, however, they generally depend on access to a full set of optimal states and actions taken with the agent's actuators and from the agent's point of view. In this paper, we introduce a new algorithm - called Disentangling Generative Adversarial Imitation Learning (DisentanGAIL) - with the purpose of bypassing such constraints. Our algorithm enables autonomous agents to learn directly from high dimensional observations of an expert performing a task, by making use of adversarial learning with a latent representation inside the discriminator network. Such latent representation is regularized through mutual information constraints to incentivize learning only features that encode information about the completion levels of the task being demonstrated. This allows to obtain a shared feature space to successfully perform imitation while disregarding the differences between the expert's and the agent's domains. Empirically, our algorithm is able to efficiently imitate in a diverse range of control problems including balancing, manipulation and locomotive tasks, while being robust to various domain differences in terms of both environment appearance and agent embodiment.

READ FULL TEXT

page 7

page 8

page 17

page 18

page 20

page 22

research
05/04/2018

Behavioral Cloning from Observation

Humans often learn how to perform tasks via imitation: they observe othe...
research
10/02/2019

Task-Relevant Adversarial Imitation Learning

We show that a critical problem in adversarial imitation from high-dimen...
research
06/22/2022

Latent Policies for Adversarial Imitation Learning

This paper considers learning robot locomotion and manipulation tasks fr...
research
06/19/2023

SeMAIL: Eliminating Distractors in Visual Imitation via Separated Models

Model-based imitation learning (MBIL) is a popular reinforcement learnin...
research
10/01/2018

Variational Discriminator Bottleneck: Improving Imitation Learning, Inverse RL, and GANs by Constraining Information Flow

Adversarial learning methods have been proposed for a wide range of appl...
research
07/01/2019

Active Learning within Constrained Environments through Imitation of an Expert Questioner

Active learning agents typically employ a query selection algorithm whic...
research
05/26/2019

Operation and Imitation under Safety-Aware Shared Control

We describe a shared control methodology that can, without knowledge of ...

Please sign up or login with your details

Forgot password? Click here to reset