A Unified Probabilistic Model for Learning Latent Factors and Their Connectivities from High-Dimensional Data

by   Ricardo Pio Monti, et al.

Connectivity estimation is challenging in the context of high-dimensional data. A useful preprocessing step is to group variables into clusters, however, it is not always clear how to do so from the perspective of connectivity estimation. Another practical challenge is that we may have data from multiple related classes (e.g., multiple subjects or conditions) and wish to incorporate constraints on the similarities across classes. We propose a probabilistic model which simultaneously performs both a grouping of variables (i.e., detecting community structure) and estimation of connectivities between the groups which correspond to latent variables. The model is essentially a factor analysis model where the factors are allowed to have arbitrary correlations, while the factor loading matrix is constrained to express a community structure. The model can be applied on multiple classes so that the connectivities can be different between the classes, while the community structure is the same for all classes. We propose an efficient estimation algorithm based on score matching, and prove the identifiability of the model. Finally, we present an extension to directed (causal) connectivities over latent variables. Simulations and experiments on fMRI data validate the practical utility of the method.


page 1

page 2

page 3

page 4


Learning Latent Causal Structures with a Redundant Input Neural Network

Most causal discovery algorithms find causal structure among a set of ob...

Inferring Covariance Structure from Multiple Data Sources via Subspace Factor Analysis

Factor analysis provides a canonical framework for imposing lower-dimens...

Dimensionality Detection and Integration of Multiple Data Sources via the GP-LVM

The Gaussian Process Latent Variable Model (GP-LVM) is a non-linear prob...

Group Factor Analysis

Factor analysis provides linear factors that describe relationships betw...

Neural Ideal Point Estimation Network

Understanding politics is challenging because the politics take the infl...

A Differential Evolution-Enhanced Latent Factor Analysis Model for High-dimensional and Sparse Data

High-dimensional and sparse (HiDS) matrices are frequently adopted to de...

Multilevel latent class analysis with covariates: Analysis of cross-national citizenship norms with a two-stage approach

This paper focuses on the substantive application of multilevel LCA to t...

Please sign up or login with your details

Forgot password? Click here to reset