Learning to Arbitrate Human and Robot Control using Disagreement between Sub-Policies

08/24/2021
by   Yoojin Oh, et al.
0

In the context of teleoperation, arbitration refers to deciding how to blend between human and autonomous robot commands. We present a reinforcement learning solution that learns an optimal arbitration strategy that allocates more control authority to the human when the robot comes across a decision point in the task. A decision point is where the robot encounters multiple options (sub-policies), such as having multiple paths to get around an obstacle or deciding between two candidate goals. By expressing each directional sub-policy as a von Mises distribution, we identify the decision points by observing the modality of the mixture distribution. Our reward function reasons on this modality and prioritizes to match its learned policy to either the user or the robot accordingly. We report teleoperation experiments on reach-and-grasping objects using a robot manipulator arm with different simulated human controllers. Results indicate that our shared control agent outperforms direct control and improves the teleoperation performance among different users. Using our reward term enables flexible blending between human and robot commands while maintaining safe and accurate teleoperation.

READ FULL TEXT
research
06/27/2019

Demonstration-Guided Deep Reinforcement Learning of Control Policies for Dexterous Human-Robot Interaction

In this paper, we propose a method for training control policies for hum...
research
09/11/2023

Effect of Adapting to Human Preferences on Trust in Human-Robot Teaming

We present the effect of adapting to human preferences on trust in a hum...
research
04/14/2018

Intrinsically motivated reinforcement learning for human-robot interaction in the real-world

For a natural social human-robot interaction, it is essential for a robo...
research
03/11/2020

A General Arbitration Model for Robust Human-Robot Shared Control with Multi-Source Uncertainty Modeling

Shared control in teleoperation leverages both human and robot's strengt...
research
09/14/2019

Modeling Collaboration for Robot-assisted Dressing Tasks

We investigated the application of haptic aware feedback control and dee...
research
04/14/2022

Blending Primitive Policies in Shared Control for Assisted Teleoperation

Movement primitives have the property to accommodate changes in the robo...
research
11/30/2020

Learning from Incremental Directional Corrections

This paper proposes a technique which enables a robot to learn a control...

Please sign up or login with your details

Forgot password? Click here to reset