Divide and Recombine for Large and Complex Data: Model Likelihood Functions using MCMC

01/15/2018
by   Qi Liu, et al.
0

In Divide & Recombine (D&R), big data are divided into subsets, each analytic method is applied to subsets, and the outputs are recombined. This enables deep analysis and practical computational performance. An innovate D&R procedure is proposed to compute likelihood functions of data-model (DM) parameters for big data. The likelihood-model (LM) is a parametric probability density function of the DM parameters. The density parameters are estimated by fitting the density to MCMC draws from each subset DM likelihood function, and then the fitted densities are recombined. The procedure is illustrated using normal and skew-normal LMs for the logistic regression DM.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/21/2019

Parallelising MCMC via Random Forests

For Bayesian computation in big data contexts, the divide-and-conquer MC...
research
12/12/2017

A Random Sample Partition Data Model for Big Data Analysis

Big data sets must be carefully partitioned into statistically similar d...
research
12/10/2019

What is the best predictor that you can compute in five minutes using a given Bayesian hierarchical model?

The goal of this paper is to provide a way for statisticians to answer t...
research
06/01/2018

Bayesian Logistic Regression for Small Areas with Numerous Households

We analyze binary data, available for a relatively large number (big dat...
research
07/09/2018

Data Likelihood of Active Fires Satellite Detection and Applications to Ignition Estimation and Data Assimilation

Data likelihood of fire detection is the probability of the observed det...
research
03/06/2021

On the accuracy and precision of correlation functions and field-level inference in cosmology

We present a comparative study of the accuracy and precision of correlat...
research
02/08/2022

Feature subset selection for Big Data via Chaotic Binary Differential Evolution under Apache Spark

Feature subset selection (FSS) using a wrapper approach is essentially a...

Please sign up or login with your details

Forgot password? Click here to reset