Natural data structure extracted from neighborhood-similarity graphs

02/15/2018
by   Tom Lorimer, et al.
0

'Big' high-dimensional data are commonly analyzed in low-dimensions, after performing a dimensionality-reduction step that inherently distorts the data structure. For the same purpose, clustering methods are also often used. These methods also introduce a bias, either by starting from the assumption of a particular geometric form of the clusters, or by using iterative schemes to enhance cluster contours, with uncontrollable consequences. The goal of data analysis should, however, be to encode and detect structural data features at all scales and densities simultaneously, without assuming a parametric form of data point distances, or modifying them. We propose a novel approach that directly encodes data point neighborhood similarities as a sparse graph. Our non-iterative framework permits a transparent interpretation of data, without altering the original data dimension and metric. Several natural and synthetic data applications demonstrate the efficacy of our novel approach.

READ FULL TEXT

page 1

page 11

page 12

page 13

page 14

page 19

page 20

page 21

research
01/09/2020

Supervised Discriminative Sparse PCA with Adaptive Neighbors for Dimensionality Reduction

Dimensionality reduction is an important operation in information visual...
research
02/18/2022

Incorporating Texture Information into Dimensionality Reduction for High-Dimensional Images

High-dimensional imaging is becoming increasingly relevant in many field...
research
06/10/2022

Hierarchical mixtures of Gaussians for combined dimensionality reduction and clustering

To avoid the curse of dimensionality, a common approach to clustering hi...
research
05/23/2019

Unsupervised Discovery of Temporal Structure in Noisy Data with Dynamical Components Analysis

Linear dimensionality reduction methods are commonly used to extract low...
research
10/10/2017

Combinatorial and Asymptotical Results on the Neighborhood Grid

In 2009, Joselli et al introduced the Neighborhood Grid data structure f...
research
08/29/2016

Robust Discriminative Clustering with Sparse Regularizers

Clustering high-dimensional data often requires some form of dimensional...
research
10/31/2018

Unsupervised Dimension Selection using a Blue Noise Spectrum

Unsupervised dimension selection is an important problem that seeks to r...

Please sign up or login with your details

Forgot password? Click here to reset