A Theoretical Analysis of Noisy Sparse Subspace Clustering on Dimensionality-Reduced Data

10/24/2016
by   Yining Wang, et al.
0

Subspace clustering is the problem of partitioning unlabeled data points into a number of clusters so that data points within one cluster lie approximately on a low-dimensional linear subspace. In many practical scenarios, the dimensionality of data points to be clustered are compressed due to constraints of measurement, computation or privacy. In this paper, we study the theoretical properties of a popular subspace clustering algorithm named sparse subspace clustering (SSC) and establish formal success conditions of SSC on dimensionality-reduced data. Our analysis applies to the most general fully deterministic model where both underlying subspaces and data points within each subspace are deterministically positioned, and also a wide range of dimensionality reduction techniques (e.g., Gaussian random projection, uniform subsampling, sketching) that fall into a subspace embedding framework (Meng & Mahoney, 2013; Avron et al., 2014). Finally, we apply our analysis to a differentially private SSC algorithm and established both privacy and utility guarantees of the proposed method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/27/2014

Subspace clustering of dimensionality-reduced data

Subspace clustering refers to the problem of clustering unlabeled high-d...
research
09/16/2016

Unbiased Sparse Subspace Clustering By Selective Pursuit

Sparse subspace clustering (SSC) is an elegant approach for unsupervised...
research
07/12/2018

Turning Big data into tiny data: Constant-size coresets for k-means, PCA and projective clustering

We develop and analyze a method to reduce the size of a very large set o...
research
03/05/2018

An Analysis of the t-SNE Algorithm for Data Visualization

A first line of attack in exploratory data analysis is data visualizatio...
research
06/22/2022

Noisy ℓ^0-Sparse Subspace Clustering on Dimensionality Reduced Data

Sparse subspace clustering methods with sparsity induced by ℓ^0-norm, su...
research
05/27/2016

Achieving stable subspace clustering by post-processing generic clustering results

We propose an effective subspace selection scheme as a post-processing s...
research
08/29/2016

Robust Discriminative Clustering with Sparse Regularizers

Clustering high-dimensional data often requires some form of dimensional...

Please sign up or login with your details

Forgot password? Click here to reset