An Imputation model by Dirichlet Process Mixture of Elliptical Copulas for Data of Mixed Type
Copula-based methods provide a flexible approach to build missing data imputation models of multivariate data of mixed types. However, the choice of copula function is an open question. We consider a Bayesian nonparametric approach by using an infinite mixture of elliptical copulas induced by a Dirichlet process mixture to build a flexible copula function. A slice sampling algorithm is used to sample from the infinite dimensional parameter space. We extend the work on prior parallel tempering used in finite mixture models to the Dirichlet process mixture model to overcome the mixing issue in multimodal distributions. Using simulations, we demonstrate that the infinite mixture copula model provides a better overall fit compared to their single component counterparts, and performs better at capturing tail dependence features of the data. Simulations further show that our proposed model achieves more accurate imputation especially for continuous variables and better inferential results in some analytic models. The proposed model is applied to a medical data set of acute stroke patients in Australia.
READ FULL TEXT