Time-uniform confidence bands for the CDF under nonstationarity

02/28/2023
by   Paul Mineiro, et al.
0

Estimation of the complete distribution of a random variable is a useful primitive for both manual and automated decision making. This problem has received extensive attention in the i.i.d. setting, but the arbitrary data dependent setting remains largely unaddressed. Consistent with known impossibility results, we present computationally felicitous time-uniform and value-uniform bounds on the CDF of the running averaged conditional distribution of a real-valued random variable which are always valid and sometimes trivial, along with an instance-dependent convergence guarantee. The importance-weighted extension is appropriate for estimating complete counterfactual distributions of rewards given controlled experimentation data exhaust, e.g., from an A/B test or a contextual bandit.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2022

Anytime-valid off-policy inference for contextual bandits

Contextual bandit algorithms are ubiquitous tools for active sequential ...
research
06/28/2012

Extension of Three-Variable Counterfactual Casual Graphic Model: from Two-Value to Three-Value Random Variable

The extension of counterfactual causal graphic model with three variable...
research
10/07/2021

Uniform Guarded Fragments

In this paper we prove that the uniform one-dimensional guarded fragment...
research
12/18/2020

Local Dvoretzky-Kiefer-Wolfowitz confidence bands

In this paper, we revisit the concentration inequalities for the supremu...
research
07/01/2023

Universal kernel-type estimation of random fields

Consistent weighted least square estimators are proposed for a wide clas...
research
05/13/2014

On the Complexity of A/B Testing

A/B testing refers to the task of determining the best option among two ...

Please sign up or login with your details

Forgot password? Click here to reset