Targeting for long-term outcomes

10/29/2020
by   Jeremy Yang, et al.
25

Decision-makers often want to target interventions (e.g., marketing campaigns) so as to maximize an outcome that is observed only in the long-term. This typically requires delaying decisions until the outcome is observed or relying on simple short-term proxies for the long-term outcome. Here we build on the statistical surrogacy and off-policy learning literature to impute the missing long-term outcomes and then approximate the optimal targeting policy on the imputed outcomes via a doubly-robust approach. We apply our approach in large-scale proactive churn management experiments at The Boston Globe by targeting optimal discounts to its digital subscribers to maximize their long-term revenue. We first show that conditions for validity of average treatment effect estimation with imputed outcomes are also sufficient for valid policy evaluation and optimization; furthermore, these conditions can be somewhat relaxed for policy optimization. We then validate this approach empirically by comparing it with a policy learned on the ground truth long-term outcomes and show that they are statistically indistinguishable. Our approach also outperforms a policy learned on short-term proxies for the long-term outcome. In a second field experiment, we implement the optimal targeting policy with additional randomized exploration, which allows us to update the optimal policy for each new cohort of customers to account for potential non-stationarity. Over three years, our approach had a net-positive revenue impact in the range of 4-5 million compared to The Boston Globe's current policies.

READ FULL TEXT

page 26

page 27

research
02/15/2022

Long-term Causal Inference Under Persistent Confounding via Data Combination

We study the identification and estimation of long-term treatment effect...
research
03/21/2023

Policy Optimization for Personalized Interventions in Behavioral Health

Problem definition: Behavioral health interventions, delivered through d...
research
07/24/2018

Learning from Delayed Outcomes with Intermediate Observations

Optimizing for long term value is desirable in many practical applicatio...
research
02/09/2018

Long-Term-Unemployed hirings: Should targeted or untargeted policies be preferred?

To what extent, hiring incentives targeting a specific group of vulnerab...
research
11/26/2017

Simulating outcomes of interventions using a multipurpose simulation program based on the Evolutionary Causal Matrices and Markov Chain

Predicting long-term outcomes of interventions is necessary for educatio...
research
02/10/2022

Network Interference in Micro-Randomized Trials

The micro-randomized trial (MRT) is an experimental design that can be u...
research
07/03/2023

Pareto optimal proxy metrics

North star metrics and online experimentation play a central role in how...

Please sign up or login with your details

Forgot password? Click here to reset