Approximating Spectral Clustering via Sampling: a Review

01/29/2019
by   Nicolas Tremblay, et al.
0

Spectral clustering refers to a family of unsupervised learning algorithms that compute a spectral embedding of the original data based on the eigenvectors of a similarity graph. This non-linear transformation of the data is both the key of these algorithms' success and their Achilles heel: forming a graph and computing its dominant eigenvectors can indeed be computationally prohibitive when dealing with more that a few tens of thousands of points. In this paper, we review the principal research efforts aiming to reduce this computational cost. We focus on methods that come with a theoretical control on the clustering performance and incorporate some form of sampling in their operation. Such methods abound in the machine learning, numerical linear algebra, and graph signal processing literature and, amongst others, include Nyström-approximation, landmarks, coarsening, coresets, and compressive spectral clustering. We present the approximation guarantees available for each and discuss practical merits and limitations. Surprisingly, despite the breadth of the literature explored, we conclude that there is still a gap between theory and practice: the most scalable methods are only intuitively motivated or loosely controlled, whereas those that come with end-to-end guarantees rely on strong assumptions or enable a limited gain of computation time.

READ FULL TEXT
research
06/25/2020

Scalable Spectral Clustering with Nystrom Approximation: Practical and Theoretical Aspects

Spectral clustering techniques are valuable tools in signal processing a...
research
02/05/2016

Compressive Spectral Clustering

Spectral clustering has become a popular technique due to its high perfo...
research
06/10/2014

Graph Approximation and Clustering on a Budget

We consider the problem of learning from a similarity matrix (such as sp...
research
04/07/2017

Fast Spectral Clustering Using Autoencoders and Landmarks

In this paper, we introduce an algorithm for performing spectral cluster...
research
09/27/2018

A Note on Spectral Clustering and SVD of Graph Data

Spectral clustering and Singular Value Decomposition (SVD) are both wide...
research
05/16/2023

Spectral Clustering via Orthogonalization-Free Methods

Graph Signal Filter used as dimensionality reduction in spectral cluster...
research
06/07/2020

Unsupervised Learning for Subterranean Junction Recognition Based on 2D Point Cloud

This article proposes a novel unsupervised learning framework for detect...

Please sign up or login with your details

Forgot password? Click here to reset