Big Data and model-based survey sampling

02/11/2020
by   Deldossi Laura, et al.
0

Big Data are huge amounts of digital information that are automatically accrued or merged from several sources and rarely result from properly planned surveys. A Big Dataset is herein conceived of as a collection of information concerning a finite population. We suggest selecting a sample of observations to get the inferential goal. We assume a super-population model has generated the Big Dataset. With this assumption, we can apply the theory of optimal design to draw a sample from the Big Dataset that contains the majority of the information about the unknown parameters.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/29/2018

Sampling techniques for big data analysis in finite population inference

In analyzing big data for finite population inference, it is critical to...
research
04/01/2022

Real-world K-Anonymity Applications: the KGen approach and its evaluation in Fraudulent Transactions

K-Anonymity is a property for the measurement, management, and governanc...
research
06/28/2023

Integrating Big Data and Survey Data for Efficient Estimation of the Median

An ever-increasing deluge of big data is becoming available to national ...
research
02/15/2022

Survey of Big Data sizes in 2021

The modern increase in data production is driven by multiple factors, an...
research
10/11/2020

On Spatial Lag Models estimated using crowdsourcing, web-scraping or other unconventionally collected data

The Big Data revolution is challenging the state-of-the-art statistical ...
research
04/29/2023

Subdata selection for big data regression: an improved approach

In the big data era researchers face a series of problems. Even standard...
research
06/06/2022

Modeling Big Data-based Systems through Ontological Trading

One of the great challenges the information society faces is dealing wit...

Please sign up or login with your details

Forgot password? Click here to reset