Diffusion State Distances: Multitemporal Analysis, Fast Algorithms, and Applications to Biological Networks

03/07/2020
by   Lenore Cowen, et al.
0

Data-dependent metrics are powerful tools for learning the underlying structure of high-dimensional data. This article develops and analyzes a data-dependent metric known as diffusion state distance (DSD), which compares points using a data-driven diffusion process. Unlike related diffusion methods, DSDs incorporate information across time scales, which allows for the intrinsic data structure to be inferred in a parameter-free manner. This article develops a theory for DSD based on the multitemporal emergence of mesoscopic equilibria in the underlying diffusion process. New algorithms for denoising and dimension reduction with DSD are also proposed and analyzed. These approaches are based on a weighted spectral decomposition of the underlying diffusion process, and experiments on synthetic datasets and real biological networks illustrate the efficacy of the proposed algorithms in terms of both speed and accuracy. Throughout, comparisons with related methods are made, in order to illustrate the distinct advantages of DSD for datasets exhibiting multiscale structure.

READ FULL TEXT

page 12

page 17

page 18

research
02/25/2021

Diffusion Earth Mover's Distance and Distribution Embeddings

We propose a new fast method of measuring distances between large number...
research
10/15/2018

Learning by Unsupervised Nonlinear Diffusion

This paper proposes and analyzes a novel clustering algorithm that combi...
research
02/22/2023

Aligned Diffusion Schrödinger Bridges

Diffusion Schrödinger bridges (DSB) have recently emerged as a powerful ...
research
03/28/2022

Time-inhomogeneous diffusion geometry and topology

Diffusion condensation is a dynamic process that yields a sequence of mu...
research
01/31/2021

A Multiscale Environment for Learning by Diffusion

Clustering algorithms partition a dataset into groups of similar points....
research
07/07/2023

Fermat Distances: Metric Approximation, Spectral Convergence, and Clustering Algorithms

We analyze the convergence properties of Fermat distances, a family of d...
research
11/06/2019

Spatially regularized active diffusion learning for high-dimensional images

An active learning algorithm for the classification of high-dimensional ...

Please sign up or login with your details

Forgot password? Click here to reset