Manifold Alignment with Feature Correspondence

09/30/2018
by   Jay S. Stanley III, et al.
0

We propose a novel framework for combining datasets via alignment of their associated intrinsic dimensions. Our approach assumes that two datasets are sampled from a common latent space, i.e., they measure equivalent systems. Thus, we expect there to exist a natural (albeit unknown) alignment of the data manifolds associated with the intrinsic geometry of these datasets, which are perturbed by measurement artifacts in the sampling process. Importantly, we do not assume any individual correspondence (partial or complete) between data points. Instead, we rely on our assumption that a subset of data features have correspondence across datasets. We leverage this assumption to estimate relations between intrinsic manifold dimensions, which are given by diffusion map coordinates over each of the datasets. We compute a correlation matrix between diffusion coordinates of the datasets by considering graph (or manifold) Fourier coefficients of corresponding data features. We then orthogonalize this correlation matrix to form an isometric transformation between the diffusion maps of the datasets. Finally, we apply this transformation to the diffusion coordinates and construct a unified diffusion geometry of the datasets together. We show that this approach successfully corrects misalignment artifacts and enables data integration.

READ FULL TEXT
research
01/31/2019

Compressed Diffusion

Diffusion maps are a commonly used kernel-based method for manifold lear...
research
02/14/2018

Geometry-Based Data Generation

Many generative models attempt to replicate the density of their input d...
research
06/15/2022

Diffusion Transport Alignment

The integration of multimodal data presents a challenge in cases when th...
research
07/14/2020

Extendable and invertible manifold learning with geometry regularized autoencoders

A fundamental task in data exploration is to extract simplified low dime...
research
11/19/2015

Diffusion Representations

Diffusion Maps framework is a kernel based method for manifold learning ...
research
10/24/2014

Bayesian Manifold Learning: The Locally Linear Latent Variable Model (LL-LVM)

We introduce the Locally Linear Latent Variable Model (LL-LVM), a probab...
research
07/10/2019

Coarse Graining of Data via Inhomogeneous Diffusion Condensation

Big data often has emergent structure that exists at multiple levels of ...

Please sign up or login with your details

Forgot password? Click here to reset