Learning from Richer Human Guidance: Augmenting Comparison-Based Learning with Feature Queries

02/05/2018
by   Chandrayee Basu, et al.
0

We focus on learning the desired objective function for a robot. Although trajectory demonstrations can be very informative of the desired objective, they can also be difficult for users to provide. Answers to comparison queries, asking which of two trajectories is preferable, are much easier for users, and have emerged as an effective alternative. Unfortunately, comparisons are far less informative. We propose that there is much richer information that users can easily provide and that robots ought to leverage. We focus on augmenting comparisons with feature queries, and introduce a unified formalism for treating all answers as observations about the true desired reward. We derive an active query selection algorithm, and test these queries in simulation and on real users. We find that richer, feature-augmented queries can extract more information faster, leading to robots that better match user preferences in their behavior.

READ FULL TEXT
research
06/21/2019

Learning Reward Functions by Integrating Human Demonstrations and Preferences

Our goal is to accurately and efficiently learn reward functions for aut...
research
12/04/2018

Learning from Intended Corrections

Our goal is to enable robots to learn cost functions from user guidance....
research
06/24/2020

Learning Reward Functions from Diverse Sources of Human Feedback: Optimally Integrating Demonstrations and Preferences

Reward functions are a common way to specify the objective of a robot. A...
research
05/23/2019

Hypothetical answers to continuous queries over data streams

Continuous queries over data streams may suffer from blocking operations...
research
10/01/2021

Learning Reward Functions from Scale Feedback

Today's robots are increasingly interacting with people and need to effi...
research
06/26/2013

Learning Trajectory Preferences for Manipulators via Iterative Improvement

We consider the problem of learning good trajectories for manipulation t...
research
09/06/2019

An Edge Computing Robot Experience for Automatic Elderly Mental Health Care Based on Voice

We need open platforms driven by specialists, in which queries can be cr...

Please sign up or login with your details

Forgot password? Click here to reset