Reinforcement Learning with Fairness Constraints for Resource Distribution in Human-Robot Teams

06/30/2019
by   Houston Claure, et al.
0

Much work in robotics and operations research has focused on optimal resource distribution, where an agent dynamically decides how to sequentially distribute resources among different candidates. However, most work ignores the notion of fairness in candidate selection. In the case where a robot distributes resources to human team members, favoring heavily the highest performing teammate can have negative effects in team dynamics and system acceptance. We introduce a multi-armed bandit algorithm with fairness constraints, where a robot distributes resources to human teammates of different skill levels. In this problem, the robot does not know the skill level of each human teammate, but learns it by observing their performance over time. We define fairness as a constraint on the minimum rate that each human teammate is selected throughout the task. We provide theoretical guarantees on performance and perform a large-scale user study, where we adjust the level of fairness in our algorithm. Results show that fairness in resource distribution has a significant effect on users' trust in the system.

READ FULL TEXT

page 5

page 7

research
12/13/2019

Fair Contextual Multi-Armed Bandits: Theory and Experiments

When an AI system interacts with multiple users, it frequently needs to ...
research
12/22/2018

Robot Assisted Tower Construction - A Resource Distribution Task to Study Human-Robot Collaboration and Interaction with Groups of People

Research on human-robot collaboration or human-robot teaming, has focuse...
research
04/12/2021

Risk-Averse Biased Human Policies in Assistive Multi-Armed Bandit Settings

Assistive multi-armed bandit problems can be used to model team situatio...
research
05/27/2019

Stochastic Multi-armed Bandits with Arm-specific Fairness Guarantees

We study an interesting variant of the stochastic multi-armed bandit pro...
research
09/11/2023

Steps Towards Satisficing Distributed Dynamic Team Trust

Defining and measuring trust in dynamic, multiagent teams is important i...
research
10/02/2021

Partner-Aware Algorithms in Decentralized Cooperative Bandit Teams

When humans collaborate with each other, they often make decisions by ob...
research
03/23/2022

The Harmony Index: a Utilitarian Metric for Measuring Effectiveness in Mixed-Skill Teams

As teamwork becomes ever-more important in a new age of remote work, it ...

Please sign up or login with your details

Forgot password? Click here to reset