Fiberwise dimensionality reduction of topologically complex data with vector bundles

06/13/2022
by   Luis Scoccola, et al.
0

Datasets with non-trivial large scale topology can be hard to embed in low-dimensional Euclidean space with existing dimensionality reduction algorithms. We propose to model topologically complex datasets using vector bundles, in such a way that the base space accounts for the large scale topology, while the fibers account for the local geometry. This allows one to reduce the dimensionality of the fibers, while preserving the large scale topology. We formalize this point of view, and, as an application, we describe an algorithm which takes as input a dataset together with an initial representation of it in Euclidean space, assumed to recover part of its large scale topology, and outputs a new representation that integrates local representations, obtained through local linear dimensionality reduction, along the initial global representation. We demonstrate this algorithm on examples coming from dynamical systems and chemistry. In these examples, our algorithm is able to learn topologically faithful embeddings of the data in lower target dimension than various well known metric-based dimensionality reduction algorithms.

READ FULL TEXT
research
06/22/2018

Homology-Preserving Dimensionality Reduction via Manifold Landmarking and Tearing

Dimensionality reduction is an integral part of data visualization. It i...
research
04/13/2015

Multiple Measurements and Joint Dimensionality Reduction for Large Scale Image Search with Short Vectors - Extended Version

This paper addresses the construction of a short-vector (128D) image rep...
research
09/13/2023

Latent Representation and Simulation of Markov Processes via Time-Lagged Information Bottleneck

Markov processes are widely used mathematical models for describing dyna...
research
12/13/2022

Dimensionality reduction on complex vector spaces for dynamic weighted Euclidean distance

The weighted Euclidean distance between two vectors is a Euclidean dista...
research
03/04/2020

Visualizing Large-Scale Assessments in Mathematics through Dimensionality Reduction

In this paper, we apply the Logistic PCA (LPCA) as a dimensionality redu...
research
06/20/2023

Unexplainable Explanations: Towards Interpreting tSNE and UMAP Embeddings

It has become standard to explain neural network latent spaces with attr...
research
03/04/2020

Visualizing and Understanding Large-Scale Assessments in Mathematics through Dimensionality Reduction

In this paper, we apply the Logistic PCA (LPCA) as a dimensionality redu...

Please sign up or login with your details

Forgot password? Click here to reset