Leave-one-out Singular Subspace Perturbation Analysis for Spectral Clustering

05/30/2022
by   Anderson Y. Zhang, et al.
0

The singular subspaces perturbation theory is of fundamental importance in probability and statistics. It has various applications across different fields. We consider two arbitrary matrices where one is a leave-one-column-out submatrix of the other one and establish a novel perturbation upper bound for the distance between two corresponding singular subspaces. It is well-suited for mixture models and results in a sharper and finer statistical analysis than classical perturbation bounds such as Wedin's Theorem. Powered by this leave-one-out perturbation theory, we provide a deterministic entrywise analysis for the performance of the spectral clustering under mixture models. Our analysis leads to an explicit exponential error rate for the clustering of sub-Gaussian mixture models. For the mixture of isotropic Gaussians, the rate is optimal under a weaker signal-to-noise condition than that of Löffler et al. (2021).

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/24/2020

An ℓ_p theory of PCA and spectral clustering

Principal Component Analysis (PCA) is a powerful tool in statistics and ...
research
12/15/2020

Spectral Methods for Data Science: A Statistical Perspective

Spectral methods have emerged as a simple yet surprisingly effective app...
research
01/14/2021

Optimal Clustering in Anisotropic Gaussian Mixture Models

We study the clustering task under anisotropic Gaussian Mixture Models w...
research
10/21/2008

Foundations of a Multi-way Spectral Clustering Framework for Hybrid Linear Modeling

The problem of Hybrid Linear Modeling (HLM) is to model and segment data...
research
11/16/2020

An exact sinΘ formula for matrix perturbation analysis and its applications

Singular vector perturbation is an important topic in numerical analysis...
research
03/17/2018

Hidden Integrality of SDP Relaxation for Sub-Gaussian Mixture Models

We consider the problem of estimating the discrete clustering structures...
research
10/16/2017

When Do Birds of a Feather Flock Together? K-Means, Proximity, and Conic Programming

Given a set of data, one central goal is to group them into clusters bas...

Please sign up or login with your details

Forgot password? Click here to reset