Divide-and-Conquer MCMC for Multivariate Binary Data

02/17/2021
by   Suchit Mehrotra, et al.
0

The analysis of large scale medical claims data has the potential to improve quality of care by generating insights which can be used to create tailored medical programs. In particular, the multivariate probit model can be used to investigate the correlation between multiple binary responses of interest in such data, e.g. the presence of multiple chronic conditions. Bayesian modeling is well suited to such analyses because of the automatic uncertainty quantification provided by the posterior distribution. A complicating factor is that large medical claims datasets often do not fit in memory, which renders the estimation of the posterior using traditional Markov Chain Monte Carlo (MCMC) methods computationally infeasible. To address this challenge, we extend existing divide-and-conquer MCMC algorithms to the multivariate probit model, demonstrating, via simulation, that they should be preferred over mean-field variational inference when the estimation of the latent correlation structure between binary responses is of primary interest. We apply this algorithm to a large database of de-identified Medicare Advantage claims from a single large US health insurance provider, where we find medically meaningful groupings of common chronic conditions and asses the impact of the urban-rural health gap by identifying underutilized provider specialties in rural areas.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/27/2020

MetFlow: A New Efficient Method for Bridging the Gap between Markov Chain Monte Carlo and Variational Inference

In this contribution, we propose a new computationally efficient method ...
research
05/22/2023

Fast Variational Inference for Bayesian Factor Analysis in Single and Multi-Study Settings

Factors models are routinely used to analyze high-dimensional data in bo...
research
06/01/2020

Distributed Bayesian Varying Coefficient Modeling Using a Gaussian Process Prior

Varying coefficient models (VCMs) are widely used for estimating nonline...
research
01/25/2020

Bayesian Panel Quantile Regression for Binary Outcomes with Correlated Random Effects: An Application on Crime Recidivism in Canada

This article develops a Bayesian approach for estimating panel quantile ...
research
12/01/2022

Scalable Variational Bayes methods for Hawkes processes

Multivariate Hawkes processes are temporal point processes extensively a...
research
08/04/2021

Gaussian Process Regression and Classification using International Classification of Disease Codes as Covariates

International Classification of Disease (ICD) codes are widely used for ...

Please sign up or login with your details

Forgot password? Click here to reset