Offline Policy Evaluation with Out-of-Sample Guarantees

01/20/2023
by   Sofia Ek, et al.
0

We consider the problem of evaluating the performance of a decision policy using past observational data. The outcome of a policy is measured in terms of a loss or disutility (or negative reward) and the problem is to draw valid inferences about the out-of-sample loss of the specified policy when the past data is observed under a, possibly unknown, policy. Using a sample-splitting method, we show that it is possible to draw such inferences with finite-sample coverage guarantees that evaluate the entire loss distribution. Importantly, the method takes into account model misspecifications of the past policy – including unmeasured confounding. The evaluation method can be used to certify the performance of a policy using observational data under an explicitly specified range of credible model assumptions.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/03/2020

Learning Robust Decision Policies from Observational Data

We address the problem of learning a decision policy from observational ...
research
05/21/2017

Balanced Policy Evaluation and Learning

We present a new approach to the problems of evaluating and learning per...
research
02/12/2020

Efficient Policy Learning from Surrogate-Loss Classification Reductions

Recent work on policy learning from observational data has highlighted t...
research
06/05/2023

Inference under constrained distribution shifts

Large-scale administrative or observational datasets are increasingly us...
research
05/21/2023

Federated Offline Policy Learning with Heterogeneous Observational Data

We consider the problem of learning personalized decision policies on ob...
research
02/11/2015

Off-policy evaluation for MDPs with unknown structure

Off-policy learning in dynamic decision problems is essential for provid...
research
01/08/2019

Model-Predictive Policy Learning with Uncertainty Regularization for Driving in Dense Traffic

Learning a policy using only observational data is challenging because t...

Please sign up or login with your details

Forgot password? Click here to reset