RIDM: Reinforced Inverse Dynamics Modeling for Learning from a Single Observed Demonstration

06/18/2019
by   Brahma S. Pavse, et al.
3

Imitation learning has long been an approach to alleviate the tractability issues that arise in reinforcement learning. However, most literature makes several assumptions such as access to the expert's actions, availability of many expert demonstrations, and injection of task-specific domain knowledge into the learning process. We propose reinforced inverse dynamics modeling (RIDM), a method of combining reinforcement learning and imitation from observation (IfO) to perform imitation using a single expert demonstration, with no access to the expert's actions, and with little task-specific domain knowledge. Given only a single set of the expert's raw states, such as joint angles in a robot control task, at each time-step, we learn an inverse dynamics model to produce the necessary low-level actions, such as torques, to transition from one state to the next such that the reward from the environment is maximized. We demonstrate that RIDM outperforms other techniques when we apply the same constraints on the other methods on six domains of the MuJoCo simulator and for two different robot soccer tasks for two experts from the RoboCup 3D simulation league on the SimSpark simulator.

READ FULL TEXT
research
08/12/2022

Causal Imitation Learning with Unobserved Confounders

One of the common ways children learn is by mimicking adults. Imitation ...
research
10/16/2020

On the Guaranteed Almost Equivalence between Imitation Learning from Observation and Demonstration

Imitation learning from observation (LfO) is more preferable than imitat...
research
02/15/2019

ProLoNets: Neural-encoding Human Experts' Domain Knowledge to Warm Start Reinforcement Learning

Deep reinforcement learning has seen great success across a breadth of t...
research
02/08/2020

Multi-task Reinforcement Learning with a Planning Quasi-Metric

We introduce a new reinforcement learning approach combining a planning ...
research
03/01/2018

Inverse Reinforcement Learning via Nonparametric Spatio-Temporal Subgoal Modeling

Recent advances in the field of inverse reinforcement learning (IRL) hav...
research
04/28/2020

Augmented Behavioral Cloning from Observation

Imitation from observation is a computational technique that teaches an ...
research
10/10/2019

Imitation Learning from Observations by Minimizing Inverse Dynamics Disagreement

This paper studies Learning from Observations (LfO) for imitation learni...

Please sign up or login with your details

Forgot password? Click here to reset