A Framework for Statistical Inference via Randomized Algorithms

07/20/2023
by   Zhixiang Zhang, et al.
0

Randomized algorithms, such as randomized sketching or projections, are a promising approach to ease the computational burden in analyzing large datasets. However, randomized algorithms also produce non-deterministic outputs, leading to the problem of evaluating their accuracy. In this paper, we develop a statistical inference framework for quantifying the uncertainty of the outputs of randomized algorithms. We develop appropriate statistical methods – sub-randomization, multi-run plug-in and multi-run aggregation inference – by using multiple runs of the same randomized algorithm, or by estimating the unknown parameters of the limiting distribution. As an example, we develop methods for statistical inference for least squares parameters via random sketching using matrices with i.i.d.entries, or uniform partial orthogonal matrices. For this, we characterize the limiting distribution of estimators obtained via sketch-and-solve as well as partial sketching methods. The analysis of i.i.d. sketches uses a trigonometric interpolation argument to establish a differential equation for the limiting expected characteristic function and find the dependence on the kurtosis of the entries of the sketching matrix. The results are supported via a broad range of simulations.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/16/2020

Statistical Inference and Power Analysis for Direct and Spillover Effects in Two-Stage Randomized Experiments

Two-stage randomized experiments are becoming an increasingly popular ex...
research
02/03/2020

Limiting Spectrum of Randomized Hadamard Transform and Optimal Iterative Sketching Methods

We provide an exact analysis of the limiting spectrum of matrices random...
research
05/23/2022

Statistical inference as Green's functions

Statistical inference from data is foundational task in science. Recentl...
research
06/06/2023

Statistical inference for sketching algorithms

Sketching algorithms use random projections to generate a smaller sketch...
research
02/21/2020

Optimal Randomized First-Order Methods for Least-Squares Problems

We provide an exact analysis of a class of randomized algorithms for sol...
research
12/13/2021

Inference via Randomized Test Statistics

We show that external randomization may enforce the convergence of test ...
research
07/03/2023

Statistical Inference on Multi-armed Bandits with Delayed Feedback

Multi armed bandit (MAB) algorithms have been increasingly used to compl...

Please sign up or login with your details

Forgot password? Click here to reset