Optimal Sub-sampling with Influence Functions

09/06/2017
by   Daniel Ting, et al.
0

Sub-sampling is a common and often effective method to deal with the computational challenges of large datasets. However, for most statistical models, there is no well-motivated approach for drawing a non-uniform subsample. We show that the concept of an asymptotically linear estimator and the associated influence function leads to optimal sampling procedures for a wide class of popular models. Furthermore, for linear regression models which have well-studied procedures for non-uniform sub-sampling, we show our optimal influence function based method outperforms previous approaches. We empirically show the improved performance of our method on real datasets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/02/2016

Sub-sampled Newton Methods with Non-uniform Sampling

We consider the problem of finding the minimizer of a convex function F:...
research
07/07/2013

Achieving greater Explanatory Power and Forecasting Accuracy with Non-uniform spread Fuzzy Linear Regression

Fuzzy regression models have been applied to several Operations Research...
research
02/03/2021

Optimal Non-Uniform Deployments of LoRa Networks

LoRa wireless technology is an increasingly prominent solution for massi...
research
01/24/2017

By chance is not enough: Preserving relative density through non uniform sampling

Dealing with visualizations containing large data set is a challenging i...
research
12/16/2020

Testing the Stationarity Assumption in Software Effort Estimation Datasets

Software effort estimation (SEE) models are typically developed based on...
research
04/29/2022

AdaInt: Learning Adaptive Intervals for 3D Lookup Tables on Real-time Image Enhancement

The 3D Lookup Table (3D LUT) is a highly-efficient tool for real-time im...
research
03/22/2023

Revisiting the Fragility of Influence Functions

In the last few years, many works have tried to explain the predictions ...

Please sign up or login with your details

Forgot password? Click here to reset