RULLS: Randomized Union of Locally Linear Subspaces for Feature Engineering

04/25/2018
by   Namita Lokare, et al.
0

Feature engineering plays an important role in the success of a machine learning model. Most of the effort in training a model goes into data preparation and choosing the right representation. In this paper, we propose a robust feature engineering method, Randomized Union of Locally Linear Subspaces (RULLS). We generate sparse, non-negative, and rotation invariant features in an unsupervised fashion. RULLS aggregates features from a random union of subspaces by describing each point using globally chosen landmarks. These landmarks serve as anchor points for choosing subspaces. Our method provides a way to select features that are relevant in the neighborhood around these chosen landmarks. Distances from each data point to k closest landmarks are encoded in the feature matrix. The final feature representation is a union of features from all chosen subspaces. The effectiveness of our algorithm is shown on various real-world datasets for tasks such as clustering and classification of raw data and in the presence of noise. We compare our method with existing feature generation methods. Results show a high performance of our method on both classification and clustering tasks.

READ FULL TEXT

page 1

page 9

research
01/04/2023

Unsupervised Manifold Linearizing and Clustering

Clustering data lying close to a union of low-dimensional manifolds, wit...
research
12/21/2015

Multilinear Subspace Clustering

In this paper we present a new model and an algorithm for unsupervised c...
research
06/11/2022

Convergence and Recovery Guarantees of the K-Subspaces Method for Subspace Clustering

The K-subspaces (KSS) method is a generalization of the K-means method f...
research
07/25/2019

Theory of Spectral Method for Union of Subspaces-Based Random Geometry Graph

Spectral Method is a commonly used scheme to cluster data points lying c...
research
06/07/2020

Self-Representation Based Unsupervised Exemplar Selection in a Union of Subspaces

Finding a small set of representatives from an unlabeled dataset is a co...
research
07/10/2020

Affine Non-negative Collaborative Representation Based Pattern Classification

During the past decade, representation-based classification methods have...
research
01/20/2018

Side Information for Face Completion: a Robust PCA Approach

Robust principal component analysis (RPCA) is a powerful method for lear...

Please sign up or login with your details

Forgot password? Click here to reset