Impact of regularization on Spectral Clustering

12/05/2013
by   Antony Joseph, et al.
0

The performance of spectral clustering can be considerably improved via regularization, as demonstrated empirically in Amini et. al (2012). Here, we provide an attempt at quantifying this improvement through theoretical analysis. Under the stochastic block model (SBM), and its extensions, previous results on spectral clustering relied on the minimum degree of the graph being sufficiently large for its good performance. By examining the scenario where the regularization parameter τ is large we show that the minimum degree assumption can potentially be removed. As a special case, for an SBM with two blocks, the results require the maximum degree to be large (grow faster than n) as opposed to the minimum degree. More importantly, we show the usefulness of regularization in situations where not all nodes belong to well-defined clusters. Our results rely on a `bias-variance'-like trade-off that arises from understanding the concentration of the sample Laplacian and the eigen gap as a function of the regularization parameter. As a byproduct of our bounds, we propose a data-driven technique DKest (standing for estimated Davis-Kahan bounds) for choosing the regularization parameter. This technique is shown to work well through simulations and on a real data set.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/16/2013

Regularized Spectral Clustering under the Degree-Corrected Stochastic Blockmodel

Spectral clustering is a fast and popular algorithm for finding clusters...
research
06/05/2018

Understanding Regularized Spectral Clustering via Graph Conductance

This paper uses the relationship between graph conductance and spectral ...
research
02/07/2020

Sparse and Smooth: improved guarantees for Spectral Clustering in the Dynamic Stochastic Block Model

In this paper, we analyse classical variants of the Spectral Clustering ...
research
07/30/2021

Impact of regularization on spectral clustering under the mixed membership stochastic block model

Mixed membership community detection is a challenge problem in network a...
research
12/03/2019

Optimal Laplacian regularization for sparse spectral community detection

Regularization of the classical Laplacian matrices was empirically shown...
research
06/10/2014

Graph Approximation and Clustering on a Budget

We consider the problem of learning from a similarity matrix (such as sp...
research
12/23/2019

Spectral embedding of regularized block models

Spectral embedding is a popular technique for the representation of grap...

Please sign up or login with your details

Forgot password? Click here to reset