Curriculum Design for Teaching via Demonstrations: Theory and Applications

06/08/2021
by   Gaurav Yengera, et al.
0

We consider the problem of teaching via demonstrations in sequential decision-making settings. In particular, we study how to design a personalized curriculum over demonstrations to speed up the learner's convergence. We provide a unified curriculum strategy for two popular learner models: Maximum Causal Entropy Inverse Reinforcement Learning (MaxEnt-IRL) and Cross-Entropy Behavioral Cloning (CrossEnt-BC). Our unified strategy induces a ranking over demonstrations based on a notion of difficulty scores computed w.r.t. the teacher's optimal policy and the learner's current policy. Compared to the state of the art, our strategy doesn't require access to the learner's internal dynamics and still enjoys similar convergence guarantees under mild technical conditions. Furthermore, we adapt our curriculum strategy to teach a learner using domain knowledge in the form of task-specific difficulty scores when the teacher's optimal policy is unknown. Experiments on a car driving simulator environment and shortest path problems in a grid-world environment demonstrate the effectiveness of our proposed curriculum strategy.

READ FULL TEXT
research
05/28/2019

Interactive Teaching Algorithms for Inverse Reinforcement Learning

We study the problem of inverse reinforcement learning (IRL) with the ad...
research
06/02/2019

Learner-aware Teaching: Inverse Reinforcement Learning with Preferences and Constraints

Inverse reinforcement learning (IRL) enables an agent to learn complex b...
research
09/16/2023

Interactively Teaching an Inverse Reinforcement Learner with Limited Feedback

We study the problem of teaching via demonstrations in sequential decisi...
research
10/21/2018

Teaching Inverse Reinforcement Learners via Features and Demonstrations

Learning near-optimal behaviour from an expert's demonstrations typicall...
research
10/17/2019

Adaptive Curriculum Generation from Demonstrations for Sim-to-Real Visuomotor Control

We propose Adaptive Curriculum Generation from Demonstrations (ACGD) for...
research
08/14/2020

Mastering Rate based Curriculum Learning

Recent automatic curriculum learning algorithms, and in particular Teach...
research
10/26/2019

ZPD Teaching Strategies for Deep Reinforcement Learning from Demonstrations

Learning from demonstrations is a popular tool for accelerating and redu...

Please sign up or login with your details

Forgot password? Click here to reset