Learning Personalized Decision Support Policies

04/13/2023
by   Umang Bhatt, et al.
0

Individual human decision-makers may benefit from different forms of support to improve decision outcomes. However, a key question is which form of support will lead to accurate decisions at a low cost. In this work, we propose learning a decision support policy that, for a given input, chooses which form of support, if any, to provide. We consider decision-makers for whom we have no prior information and formalize learning their respective policies as a multi-objective optimization problem that trades off accuracy and cost. Using techniques from stochastic contextual bandits, we propose , an online algorithm to personalize a decision support policy for each decision-maker, and devise a hyper-parameter tuning strategy to identify a cost-performance trade-off using simulated human behavior. We provide computational experiments to demonstrate the benefits of compared to offline baselines. We then introduce , an interactive tool that provides with an interface. We conduct human subject experiments to show how learns policies personalized to each decision-maker and discuss the nuances of learning decision support policies online for real users.

READ FULL TEXT

page 38

page 40

page 41

page 42

research
06/03/2020

Learning Robust Decision Policies from Observational Data

We address the problem of learning a decision policy from observational ...
research
07/13/2021

Inverse Contextual Bandits: Learning How Behavior Evolves over Time

Understanding an agent's priorities by observing their behavior is criti...
research
12/13/2022

Policy learning for many outcomes of interest: Combining optimal policy trees with multi-objective Bayesian optimisation

Methods for learning optimal policies use causal machine learning models...
research
05/27/2023

Optimization's Neglected Normative Commitments

Optimization is offered as an objective approach to resolving complex, r...
research
09/09/2020

Multi-Objective Reinforcement Learning for Infectious Disease Control with Application to COVID-19 Spread

Severe infectious diseases such as the novel coronavirus (COVID-19) pose...
research
05/21/2017

Balanced Policy Evaluation and Learning

We present a new approach to the problems of evaluating and learning per...
research
05/15/2014

Multi-Criteria Optimal Planning for Energy Policies in CLP

In the policy making process a number of disparate and diverse issues su...

Please sign up or login with your details

Forgot password? Click here to reset