Pseudo-likelihood methods for community detection in large sparse networks

07/10/2012
by   Arash A. Amini, et al.
0

Many algorithms have been proposed for fitting network models with communities, but most of them do not scale well to large networks, and often fail on sparse networks. Here we propose a new fast pseudo-likelihood method for fitting the stochastic block model for networks, as well as a variant that allows for an arbitrary degree distribution by conditioning on degrees. We show that the algorithms perform well under a range of settings, including on very sparse networks, and illustrate on the example of a network of political blogs. We also propose spectral clustering with perturbations, a method of independent interest, which works well on sparse networks where regular spectral clustering fails, and use it to provide an initial value for pseudo-likelihood. We prove that pseudo-likelihood provides consistent estimates of the communities under a mild condition on the starting value, for the case of a block model with two communities.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/01/2020

Fast Network Community Detection with Profile-Pseudo Likelihood Methods

The stochastic block model is one of the most studied network models for...
research
09/04/2018

Determining the Number of Communities in Degree-corrected Stochastic Block Models

We propose to estimate the number of communities in degree-corrected sto...
research
06/07/2018

Stochastic Block Models are a Discrete Surface Tension

Networks, which represent agents and interactions between them, arise in...
research
08/18/2017

Two provably consistent divide and conquer clustering algorithms for large networks

In this article, we advance divide-and-conquer strategies for solving th...
research
06/21/2014

On semidefinite relaxations for the block model

The stochastic block model (SBM) is a popular tool for community detecti...
research
08/02/2019

Exact joint likelihood of pseudo-C_ℓ estimates from correlated Gaussian cosmological fields

We present the exact joint likelihood of pseudo-C_ℓ power spectrum estim...
research
10/02/2018

Hierarchical community detection by recursive bi-partitioning

The problem of community detection in networks is usually formulated as ...

Please sign up or login with your details

Forgot password? Click here to reset