Risk-Aware Active Inverse Reinforcement Learning

01/08/2019
by   Daniel S. Brown, et al.
0

Active learning from demonstration allows a robot to query a human for specific types of input to achieve efficient learning. Existing work has explored a variety of active query strategies; however, to our knowledge, none of these strategies directly minimize the performance risk of the policy the robot is learning. Utilizing recent advances in performance bounds for inverse reinforcement learning, we propose a risk-aware active inverse reinforcement learning algorithm that focuses active queries on areas of the state space with the potential for large generalization error. We show that risk-aware active learning outperforms standard active IRL approaches on gridworld, simulated driving, and table setting tasks, while also providing a performance-based stopping criterion that allows a robot to know when it has received enough demonstrations to safely perform a task.

READ FULL TEXT

page 5

page 7

research
07/03/2017

Efficient Probabilistic Performance Bounds for Inverse Reinforcement Learning

In the field of reinforcement learning there has been recent progress to...
research
09/14/2019

Active Learning for Risk-Sensitive Inverse Reinforcement Learning

One typical assumption in inverse reinforcement learning (IRL) is that h...
research
01/23/2013

Multi-class Generalized Binary Search for Active Inverse Reinforcement Learning

This paper addresses the problem of learning a task from demonstration. ...
research
06/29/2019

Active Learning of Probabilistic Movement Primitives

A Probabilistic Movement Primitive (ProMP) defines a distribution over t...
research
11/18/2019

Bias-Aware Heapified Policy for Active Learning

The data efficiency of learning-based algorithms is more and more import...
research
01/31/2023

Learning Risk-Aware Costmaps via Inverse Reinforcement Learning for Off-Road Navigation

The process of designing costmaps for off-road driving tasks is often a ...
research
03/17/2018

Structural query-by-committee

In this work, we describe a framework that unifies many different intera...

Please sign up or login with your details

Forgot password? Click here to reset