Bayesian Active Learning for Collaborative Task Specification Using Equivalence Regions

01/28/2019
by   Nils Wilde, et al.
0

Specifying complex task behaviours while ensuring good robot performance may be difficult for untrained users. We study a framework for users to specify rules for acceptable behaviour in a shared environment such as industrial facilities. As non-expert users might have little intuition about how their specification impacts the robot's performance, we design a learning system that interacts with the user to find an optimal solution. Using active preference learning, we iteratively show alternative paths that the robot could take on an interface. From the user feedback ranking the alternatives, we learn about the weights that users place on each part of their specification. We extend the user model from our previous work to a discrete Bayesian learning model and introduce a greedy algorithm for proposing alternative that operates on the notion of equivalence regions of user weights. We prove that with this algorithm the revision active learning process converges on the user-optimal path. In simulations on realistic industrial environments, we demonstrate the convergence and robustness of our approach.

READ FULL TEXT

page 1

page 8

research
07/24/2019

Improving User Specifications for Robot Behavior through Active Preference Learning: Framework and Evaluation

An important challenge in human robot interaction (HRI) is enabling non-...
research
07/11/2012

A Bayesian Approach toward Active Learning for Collaborative Filtering

Collaborative filtering is a useful technique for exploiting the prefere...
research
05/08/2020

Active Preference Learning using Maximum Regret

We study active preference learning as a framework for intuitively speci...
research
05/09/2020

Empowering Active Learning to Jointly Optimize System and User Demands

Existing approaches to active learning maximize the system performance b...
research
08/11/2020

Maximizing BCI Human Feedback using Active Learning

Recent advancements in Learning from Human Feedback present an effective...
research
11/19/2017

Trajectory-Optimized Sensing for Active Search of Tissue Abnormalities in Robotic Surgery

In this work, we develop an approach for guiding robots to automatically...
research
07/19/2023

Learning Formal Specifications from Membership and Preference Queries

Active learning is a well-studied approach to learning formal specificat...

Please sign up or login with your details

Forgot password? Click here to reset