Unified Learning from Demonstrations, Corrections, and Preferences during Physical Human-Robot Interaction

07/07/2022
by   Shaunak A. Mehta, et al.
0

Humans can leverage physical interaction to teach robot arms. This physical interaction takes multiple forms depending on the task, the user, and what the robot has learned so far. State-of-the-art approaches focus on learning from a single modality, or combine multiple interaction types by assuming that the robot has prior information about the human's intended task. By contrast, in this paper we introduce an algorithmic formalism that unites learning from demonstrations, corrections, and preferences. Our approach makes no assumptions about the tasks the human wants to teach the robot; instead, we learn a reward model from scratch by comparing the human's inputs to nearby alternatives. We first derive a loss function that trains an ensemble of reward models to match the human's demonstrations, corrections, and preferences. The type and order of feedback is up to the human teacher: we enable the robot to collect this feedback passively or actively. We then apply constrained optimization to convert our learned reward into a desired robot trajectory. Through simulations and a user study we demonstrate that our proposed approach more accurately learns manipulation tasks from physical human interaction than existing baselines, particularly when the robot is faced with new or unexpected objectives. Videos of our user study are available at: https://youtu.be/FSUJsTYvEKU

READ FULL TEXT

page 2

page 5

page 16

page 18

research
07/07/2022

Wrapping Haptic Displays Around Robot Arms to Communicate Learning

Humans can leverage physical interaction to teach robot arms. As the hum...
research
08/23/2022

The Effect of Modeling Human Rationality Level on Learning Rewards from Multiple Feedback Types

When inferring reward functions from human behavior (be it demonstration...
research
08/19/2023

StROL: Stabilized and Robust Online Learning from Humans

Today's robots can learn the human's reward function online, during the ...
research
11/09/2020

Joint Estimation of Expertise and Reward Preferences From Human Demonstrations

When a robot learns from human examples, most approaches assume that the...
research
03/09/2022

Learning from Physical Human Feedback: An Object-Centric One-Shot Adaptation Method

For robots to be effectively deployed in novel environments and tasks, t...
research
11/08/2021

Wrapped Haptic Display for Communicating Physical Robot Learning

Physical interaction between humans and robots can help robots learn to ...
research
01/31/2019

Characterizing Input Methods for Human-to-robot Demonstrations

Human demonstrations are important in a range of robotics applications, ...

Please sign up or login with your details

Forgot password? Click here to reset