Design-unbiased statistical learning in survey sampling

03/25/2020
by   Luis Sanguiao Sande, et al.
0

Design-consistent model-assisted estimation has become the standard practice in survey sampling. However, a general theory is lacking so far, which allows one to incorporate modern machine-learning techniques that can lead to potentially much more powerful assisting models. We propose a subsampling Rao-Blackwell method, and develop a statistical learning theory for exactly design-unbiased estimation with the help of linear or non-linear prediction models. Our approach makes use of classic ideas from Statistical Science as well as the rapidly growing field of Machine Learning. Provided rich auxiliary information, it can yield considerable efficiency gains over standard linear model-assisted methods, while ensuring valid estimation for the given target population, which is robust against potential mis-specifications of the assisting model at the individual level.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/11/2019

Shapley regressions: A framework for statistical inference on machine learning models

Machine learning models often excel in the accuracy of their predictions...
research
04/08/2020

Incidence weighting estimation under bipartite incidence graph sampling

Bipartite incidence graph sampling provides a unified representation of ...
research
12/15/2017

Automated Selection of Post-Strata using a Model-Assisted Regression Tree Estimator

Auxiliary information can increase the efficiency of survey estimators t...
research
05/02/2019

A Conditional Empirical Likelihood Based Method for Model Parameter Estimation from Complex survey Datasets

We consider an empirical likelihood framework for inference for a statis...
research
04/16/2019

Bayesian Mixed Effects Model Estimation under Informative Sampling

When random effects are correlated with the response variable of interes...
research
12/14/2020

Model-assisted estimation in high-dimensional settings for survey data

Model-assisted estimators have attracted a lot of attention in the last ...
research
05/31/2017

The ALAMO approach to machine learning

ALAMO is a computational methodology for leaning algebraic functions fro...

Please sign up or login with your details

Forgot password? Click here to reset