Subspace Clustering with Missing and Corrupted Data

07/08/2017
by   Zachary Charles, et al.
0

Subspace clustering is the process of identifying a union of subspaces model underlying a collection of samples and determining which sample belongs to which subspace. A popular approach, sparse subspace clustering (SSC), represents each sample as a weighted combination of the other samples, and then uses those learned weights to cluster the samples. SSC has been shown to be stable in settings where each sample is contaminated by a relatively small amount of noise. However, when a subset of entries in each sample is corrupted by significant noise or even unobserved, providing guarantees for subspace clustering remains an open problem. Instead of analyzing commonly used versions of SSC in the context of missing data, this paper describes a robust variant, mean absolute deviation sparse subspace clustering (MAD-SSC), and characterizes the conditions under which it provably correctly clusters all of the observed samples, even in the presence of noisy or missing data. MAD-SSC is efficiently solvable by linear programming. We show that MAD-SSC performs as predicted by the theoretical guarantees and that it performs comparably to a widely-used variant of SSC in the context of missing data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
09/09/2015

Shape Interaction Matrix Revisited and Robustified: Efficient Subspace Clustering with Corrupted and Incomplete Data

The Shape Interaction Matrix (SIM) is one of the earliest approaches to ...
research
07/11/2016

On Deterministic Conditions for Subspace Clustering under Missing Data

In this paper we present deterministic conditions for success of sparse ...
research
01/01/2018

Theoretical Analysis of Sparse Subspace Clustering with Missing Entries

Sparse Subspace Clustering (SSC) is a popular unsupervised machine learn...
research
10/06/2018

Subspace Tracking from Missing and Outlier Corrupted Data

We study the related problems of subspace tracking in the presence of mi...
research
08/02/2018

Fusion Subspace Clustering: Full and Incomplete Data

Modern inference and learning often hinge on identifying low-dimensional...
research
05/22/2022

Fusion Subspace Clustering for Incomplete Data

This paper introduces fusion subspace clustering, a novel method to lear...
research
06/07/2013

Fast greedy algorithm for subspace clustering from corrupted and incomplete data

We describe the Fast Greedy Sparse Subspace Clustering (FGSSC) algorithm...

Please sign up or login with your details

Forgot password? Click here to reset