A bootstrap analysis for finite populations

04/14/2018
by   Tina Nane, et al.
0

Bootstrap methods are increasingly accepted as one of the common approaches in constructing confidence intervals in bibliometric studies. Typical bootstrap methods assume that the statistical population is infinite. When the statistical population is finite, a correction needs to be applied in computing the estimated variance of the estimators and thus constructing confidence intervals. We investigate the effect of overlooking the finiteness assumption of the statistical population using a dataset containing all articles in Web of Science (WoS) for Delft University of Technology from 2006 until 2009. We regard the data as our finite statistical population and consider simple random samples of various sizes. Standard bootstrap methods are firstly employed in accounting for the variability of the estimates, as well as constructing the confidence intervals. The results unveil two issues, namely that the variability in the estimates does not decrease to zero as the sample size approaches the population size and that the confidence intervals are not valid. Both issues are addressed when accounting for a finite population correction in the bootstrap methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/30/2023

Credible intervals and bootstrap confidence intervals in monotone regression

In the recent paper [5], a Bayesian approach for constructing confidence...
research
10/20/2022

Finite-Sample Coverage Errors of the Cheap Bootstrap With Minimal Resampling Effort

The bootstrap is a popular data-driven method to quantify statistical un...
research
11/18/2013

Confidence Intervals for Random Forests: The Jackknife and the Infinitesimal Jackknife

We study the variability of predictions made by bagged learners and rand...
research
07/12/2018

Statistical Inference with Local Optima

We study the statistical properties of an estimator derived by applying ...
research
12/23/2019

Quantifying the Effects of the 2008 Recession using the Zillow Dataset

This report explores the use of Zillow's housing metrics dataset to inve...
research
06/26/2020

Parametric Bootstrap Confidence Intervals for the Multivariate Fay-Herriot Model

The multivariate Fay-Herriot model is quite effective in combining infor...
research
07/31/2020

Partial identification and dependence-robust confidence intervals for capture-recapture surveys

Capture-recapture (CRC) surveys are widely used to estimate the size of ...

Please sign up or login with your details

Forgot password? Click here to reset