Learning the effect of latent variables in Gaussian Graphical models with unobserved variables

07/20/2018
by   Marina Vinyes, et al.
0

The edge structure of the graph defining an undirected graphical model describes precisely the structure of dependence between the variables in the graph. In many applications, the dependence structure is unknown and it is desirable to learn it from data, often because it is a preliminary step to be able to ascertain causal effects. This problem, known as structure learning, is hard in general, but for Gaussian graphical models it is slightly easier because the structure of the graph is given by the sparsity pattern of the precision matrix of the joint distribution, and because independence coincides with decorrelation. A major difficulty too often ignored in structure learning is the fact that if some variables are not observed, the marginal dependence graph over the observed variables will possibly be significantly more complex and no longer reflect the direct dependencies that are potentially associated with causal effects. In this work, we consider a family of latent variable Gaussian graphical models in which the graph of the joint distribution between observed and unobserved variables is sparse, and the unobserved variables are conditionally independent given the others. Prior work was able to recover the connectivity between observed variables, but could only identify the subspace spanned by unobserved variables, whereas we propose a convex optimization formulation based on structured matrix sparsity to estimate the complete connectivity of the complete graph including unobserved variables, given the knowledge of the number of missing variables, and a priori knowledge of their level of connectivity. Our formulation is supported by a theoretical result of identifiability of the latent dependence structure for sparse graphs in the infinite data limit. We propose an algorithm leveraging recent active set methods, which performs well in the experiments on synthetic data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/19/2020

Learning Exponential Family Graphical Models with Latent Variables using Regularized Conditional Likelihood

Fitting a graphical model to a collection of random variables given samp...
research
07/28/2020

Accounting for missing actors in interaction network inference from abundance data

Network inference aims at unraveling the dependency structure relating j...
research
06/13/2018

High-Dimensional Inference for Cluster-Based Graphical Models

Motivated by modern applications in which one constructs graphical model...
research
11/02/2017

Bayesian latent Gaussian graphical models for mixed data with marginal prior information

Associations between variables of mixed types are of interest in a varie...
research
11/02/2017

Beyond normality: Learning sparse probabilistic graphical models in the non-Gaussian setting

We present an algorithm to identify sparse dependence structure in conti...
research
10/16/2012

Latent Composite Likelihood Learning for the Structured Canonical Correlation Model

Latent variable models are used to estimate variables of interest quanti...
research
01/07/2021

Identification of Latent Variables From Graphical Model Residuals

Graph-based causal discovery methods aim to capture conditional independ...

Please sign up or login with your details

Forgot password? Click here to reset