Learning linear structural equation models in polynomial time and sample complexity

07/15/2017
by   Asish Ghoshal, et al.
0

The problem of learning structural equation models (SEMs) from data is a fundamental problem in causal inference. We develop a new algorithm --- which is computationally and statistically efficient and works in the high-dimensional regime --- for learning linear SEMs from purely observational data with arbitrary noise distribution. We consider three aspects of the problem: identifiability, computational efficiency, and statistical efficiency. We show that when data is generated from a linear SEM over p nodes and maximum degree d, our algorithm recovers the directed acyclic graph (DAG) structure of the SEM under an identifiability condition that is more general than those considered in the literature, and without faithfulness assumptions. In the population setting, our algorithm recovers the DAG structure in O(p(d^2 + p)) operations. In the finite sample setting, if the estimated precision matrix is sparse, our algorithm has a smoothed complexity of O(p^3 + pd^7), while if the estimated precision matrix is dense, our algorithm has a smoothed complexity of O(p^5). For sub-Gaussian noise, we show that our algorithm has a sample complexity of O(d^8/ε^2 (p/√(δ))) to achieve ε element-wise additive error with respect to the true autoregression matrix with probability at most 1 - δ, while for noise with bounded (4m)-th moment, with m being a positive integer, our algorithm has a sample complexity of O(d^8/ε^2 (p^2/δ)^1/m).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/25/2022

Optimal estimation of Gaussian DAG models

We study the optimal sample complexity of learning a Gaussian directed a...
research
07/22/2021

Linear Polytree Structural Equation Models: Structural Learning and Inverse Correlation Estimation

We are interested in the problem of learning the directed acyclic graph ...
research
07/08/2021

Causal Structural Learning Via Local Graphs

We consider the problem of learning causal structures in sparse high-dim...
research
11/14/2013

High-dimensional learning of linear causal networks via inverse covariance estimation

We establish a new framework for statistical estimation of directed acyc...
research
06/04/2023

Optimal neighbourhood selection in structural equation models

We study the optimal sample complexity of neighbourhood selection in lin...
research
06/04/2020

Learning DAGs without imposing acyclicity

We explore if it is possible to learn a directed acyclic graph (DAG) fro...
research
10/05/2018

High-Dimensional Poisson DAG Model Learning Using ℓ_1-Regularized Regression

In this paper we develop a new approach for learning high-dimensional Po...

Please sign up or login with your details

Forgot password? Click here to reset