Maximum sampled conditional likelihood for informative subsampling

11/11/2020
by   HaiYing Wang, et al.
0

Subsampling is a computationally effective approach to extract information from massive data sets when computing resources are limited. After a subsample is taken from the full data, most available methods use an inverse probability weighted objective function to estimate the model parameters. This type of weighted estimator does not fully utilize information in the selected subsample. In this paper, we propose to use the maximum sampled conditional likelihood estimator (MSCLE) based on the sampled data. We established the asymptotic normality of the MSCLE and prove that its asymptotic variance covariance matrix is the smallest among a class of asymptotically unbiased estimators, including the inverse probability weighted estimator. We further discuss the asymptotic results with the L-optimal subsampling probabilities and illustrate the estimation procedure with generalized linear models. Numerical experiments are provided to evaluate the practical performance of the proposed method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/14/2021

Robust Estimation of Sparse Precision Matrix using Adaptive Weighted Graphical Lasso Approach

Estimation of a precision matrix (i.e., inverse covariance matrix) is wi...
research
10/25/2021

Nonuniform Negative Sampling and Log Odds Correction with Rare Events Data

We investigate the issue of parameter estimation with nonuniform negativ...
research
10/10/2022

Approximating Partial Likelihood Estimators via Optimal Subsampling

With the growing availability of large-scale biomedical data, it is ofte...
research
06/18/2018

Optimal Subsampling Algorithms for Big Data Generalized Linear Models

To fast approximate the maximum likelihood estimator with massive data, ...
research
04/06/2018

Statistical inference for autoregressive models under heteroscedasticity of unknown form

This paper provides an entire inference procedure for the autoregressive...
research
01/28/2020

Optimal subsampling for quantile regression in big data

We investigate optimal subsampling for quantile regression. We derive th...
research
01/03/2023

Least product relative error estimation for functional multiplicative model and optimal subsampling

In this paper, we study the functional linear multiplicative model based...

Please sign up or login with your details

Forgot password? Click here to reset