The Infinite Degree Corrected Stochastic Block Model

11/11/2013
by   Tue Herlau, et al.
0

In Stochastic blockmodels, which are among the most prominent statistical models for cluster analysis of complex networks, clusters are defined as groups of nodes with statistically similar link probabilities within and between groups. A recent extension by Karrer and Newman incorporates a node degree correction to model degree heterogeneity within each group. Although this demonstrably leads to better performance on several networks it is not obvious whether modelling node degree is always appropriate or necessary. We formulate the degree corrected stochastic blockmodel as a non-parametric Bayesian model, incorporating a parameter to control the amount of degree correction which can then be inferred from data. Additionally, our formulation yields principled ways of inferring the number of groups as well as predicting missing links in the network which can be used to quantify the model's predictive performance. On synthetic data we demonstrate that including the degree correction yields better performance both on recovering the true group structure and predicting missing links when degree heterogeneity is present, whereas performance is on par for data with no degree heterogeneity within clusters. On seven real networks (with no ground truth group structure available) we show that predictive performance is about equal whether or not degree correction is included; however, for some networks significantly fewer clusters are discovered when correcting for degree indicating that the data can be more compactly explained by clusters of heterogenous degree nodes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/21/2012

Latent Multi-group Membership Graph Model

We develop the Latent Multi-group Membership Graph (LMMG) model, a model...
research
07/17/2012

Model Selection for Degree-corrected Block Models

The proliferation of models for networks raises challenging problems of ...
research
11/02/2015

An Impossibility Result for Reconstruction in a Degree-Corrected Planted-Partition Model

We consider a Degree-Corrected Planted-Partition model: a random graph o...
research
12/28/2015

Convexified Modularity Maximization for Degree-corrected Stochastic Block Models

The stochastic block model (SBM) is a popular framework for studying com...
research
02/19/2019

A primer on statistically validated networks

In this contribution we discuss some approaches of network analysis prov...
research
06/27/2012

An Infinite Latent Attribute Model for Network Data

Latent variable models for network data extract a summary of the relatio...
research
10/20/2022

The Network Structure of Unequal Diffusion

Social networks affect the diffusion of information, and thus have the p...

Please sign up or login with your details

Forgot password? Click here to reset