Clustering Multivariate Data using Factor Analytic Bayesian Mixtures with an Unknown Number of Components

06/02/2019
by   Panagiotis Papastamoulis, et al.
0

Recent work on overfitting Bayesian mixtures of distributions offers a powerful framework for clustering multivariate data using a latent Gaussian model which resembles the factor analysis model. The flexibility provided by overfitting mixture models yields a simple and efficient way in order to estimate the unknown number of clusters and model parameters by Markov chain Monte Carlo (MCMC) sampling. The present study extends this approach by considering a set of eight parameterizations, giving rise to parsimonious representations of the covariance matrix per cluster. A Gibbs sampler combined with a prior parallel tempering scheme is implemented in order to approximately sample from the posterior distribution of the overfitting mixture. The parameterization and number of factors is selected according to the Bayesian Information Criterion. Identifiability issues related to label switching are dealt by post-processing the simulated output with the Equivalence Classes Representatives algorithm. The contributed method and software are demonstrated and compared to similar models estimated using the Expectation-Maximization algorithm on simulated and real datasets. The software is available online at https://CRAN.R-project.org/package=fabMix.

READ FULL TEXT

page 11

page 13

page 15

page 18

page 19

page 20

research
07/28/2022

Model based clustering of multinomial count data

We consider the problem of inferring an unknown number of clusters in re...
research
03/31/2021

pivmet: Pivotal Methods for Bayesian Relabelling and k-Means Clustering

The identification of groups' prototypes, i.e. elements of a dataset tha...
research
04/10/2020

On the identifiability of Bayesian factor analytic models

A well known identifiability issue in factor analytic models is the inva...
research
10/16/2020

Analysis of professional basketball field goal attempts via a Bayesian matrix clustering approach

We propose a Bayesian nonparametric matrix clustering approach to analyz...
research
07/22/2018

Finite mixtures of matrix-variate Poisson-log normal distributions for three-way count data

Three-way data structures, characterized by three entities, the units, t...
research
06/19/2020

Bayesian analysis of mixture autoregressive models covering the complete parameter space

Mixture autoregressive (MAR) models provide a flexible way to model time...
research
09/22/2020

Finite mixture modeling of censored and missing data using the multivariate skew-normal distribution

Finite mixture models have been widely used to model and analyze data fr...

Please sign up or login with your details

Forgot password? Click here to reset