A model-agnostic hypothesis test for community structure and homophily in networks

07/08/2021
by   Eric Yanchenko, et al.
0

Networks continue to be of great interest to statisticians, with an emphasis on community detection. Less work, however, has addressed this question: given some network, does it exhibit meaningful community structure? We propose to answer this question in a principled manner by framing it as a statistical hypothesis in terms of a formal and model-agnostic homophily metric. Homophily is a well-studied network property where intra-community edges are more likely than between-community edges. We use the homophily metric to identify and distinguish between three concepts: nominal, collateral, and intrinsic homophily. We propose a simple and interpretable test statistic leveraging this homophily parameter and formulate both asymptotic and bootstrap-based rejection thresholds. We prove its asymptotic properties and demonstrate it outperforms benchmark methods on both simulated and real world data. Furthermore, the proposed method yields rich, provocative insights on classic data sets; namely, that meany well-studied networks do not actually have intrinsic homophily.

READ FULL TEXT

page 19

page 22

research
03/03/2022

Modularity of the ABCD Random Graph Model with Community Structure

The Artificial Benchmark for Community Detection (ABCD) graph is a rando...
research
12/16/2018

Community Detection with Dependent Connectivity

In network analysis, within-community members are more likely to be conn...
research
03/12/2014

Efficiently inferring community structure in bipartite networks

Bipartite networks are a common type of network data in which there are ...
research
11/14/2018

SCORE+ for Network Community Detection

SCORE is a recent approach to network community detection proposed by Ji...
research
01/13/2023

Artificial Benchmark for Community Detection with Outliers (ABCD+o)

The Artificial Benchmark for Community Detection graph (ABCD) is a rando...
research
04/09/2020

The Asymptotic Distribution of Modularity in Weighted Signed Networks

Modularity is a popular metric for quantifying the degree of community s...
research
12/29/2020

Resolution limit revisited: community detection using generalized modularity density

Various attempts have been made in recent years to solve the Resolution ...

Please sign up or login with your details

Forgot password? Click here to reset