A Framework for the Meta-Analysis of Randomized Experiments with Applications to Heavy-Tailed Response Data

12/14/2021
by   Nilesh Tripuraneni, et al.
0

A central obstacle in the objective assessment of treatment effect (TE) estimators in randomized control trials (RCTs) is the lack of ground truth (or validation set) to test their performance. In this paper, we provide a novel cross-validation-like methodology to address this challenge. The key insight of our procedure is that the noisy (but unbiased) difference-of-means estimate can be used as a ground truth "label" on a portion of the RCT, to test the performance of an estimator trained on the other portion. We combine this insight with an aggregation scheme, which borrows statistical strength across a large collection of RCTs, to present an end-to-end methodology for judging an estimator's ability to recover the underlying treatment effect. We evaluate our methodology across 709 RCTs implemented in the Amazon supply chain. In the corpus of AB tests at Amazon, we highlight the unique difficulties associated with recovering the treatment effect due to the heavy-tailed nature of the response variables. In this heavy-tailed setting, our methodology suggests that procedures that aggressively downweight or truncate large values, while introducing bias, lower the variance enough to ensure that the treatment effect is more accurately estimated.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/30/2021

Efficiency of Regression (Un)-Adjusted Rosenbaum's Rank-based Estimator in Randomized Experiments

A completely randomized experiment allows us to estimate the causal effe...
research
05/22/2019

Measuring Average Treatment Effect from Heavy-tailed Data

Heavy-tailed metrics are common and often critical to product evaluation...
research
10/13/2021

Estimation and Inference of Extremal Quantile Treatment Effects for Heavy-Tailed Distributions

Causal inference for extreme events has many potential applications in f...
research
12/16/2020

No-harm calibration for generalized Oaxaca-Blinder estimators

In randomized experiments, linear regression with baseline features can ...
research
11/24/2018

A multiple comparison procedure for dose-finding trials with subpopulations

Identifying subgroups of patients with an enhanced response to a new tre...
research
10/12/2020

A Matching Procedure for Sequential Experiments that Iteratively Learns which Covariates Improve Power

We propose a dynamic allocation procedure that increases power and effic...

Please sign up or login with your details

Forgot password? Click here to reset