Optimal Policies for the Homogeneous Selective Labels Problem

11/02/2020
by   Dennis Wei, et al.
0

Selective labels are a common feature of consequential decision-making applications, referring to the lack of observed outcomes under one of the possible decisions. This paper reports work in progress on learning decision policies in the face of selective labels. The setting considered is both a simplified homogeneous one, disregarding individuals' features to facilitate determination of optimal policies, and an online one, to balance costs incurred in learning with future utility. For maximizing discounted total reward, the optimal policy is shown to be a threshold policy, and the problem is one of optimal stopping. In contrast, for undiscounted infinite-horizon average reward, optimal policies have positive acceptance probability in all states. Future work stemming from these results is discussed.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/02/2018

Learning under selective labels in the presence of expert consistency

We explore the problem of learning under selective labels in the context...
research
05/22/2019

Optimal Decision Making Under Strategic Behavior

We are witnessing an increasing use of data-driven predictive models to ...
research
07/23/2020

Batch Policy Learning in Average Reward Markov Decision Processes

We consider the batch (off-line) policy learning problem in the infinite...
research
01/30/2013

An Anytime Algorithm for Decision Making under Uncertainty

We present an anytime algorithm which computes policies for decision pro...
research
01/23/2013

My Brain is Full: When More Memory Helps

We consider the problem of finding good finite-horizon policies for POMD...
research
10/18/2020

Average-reward model-free reinforcement learning: a systematic review and literature mapping

Model-free reinforcement learning (RL) has been an active area of resear...
research
12/22/2020

The Value of Information and Efficient Switching in Channel Selection

We consider a collection of statistically identical two-state continuous...

Please sign up or login with your details

Forgot password? Click here to reset