Estimating Mutual Information via Geodesic kNN

10/26/2021
by   Alexander Marx, et al.
0

Estimating mutual information (MI) between two continuous random variables X and Y allows to capture non-linear dependencies between them, non-parametrically. As such, MI estimation lies at the core of many data science applications. Yet, robustly estimating MI for high-dimensional X and Y is still an open research question. In this paper, we formulate this problem through the lens of manifold learning. That is, we leverage the common assumption that the information of X and Y is captured by a low-dimensional manifold embedded in the observed high-dimensional space and transfer it to MI estimation. As an extension to state-of-the-art kNN estimators, we propose to determine the k-nearest neighbours via geodesic distances on this manifold rather than form the ambient space, which allows us to estimate MI even in the high-dimensional setting. An empirical evaluation of our method, G-KSG, against the state-of-the-art shows that it yields good estimations of the MI in classical benchmark, and manifold tasks, even for high dimensional datasets, which none of the existing methods can provide.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/05/2020

DEMI: Discriminative Estimator of Mutual Information

Estimating mutual information between continuous random variables is oft...
research
06/29/2014

Estimating the distribution of Galaxy Morphologies on a continuous space

The incredible variety of galaxy shapes cannot be summarized by human de...
research
05/06/2020

Regularized Estimation of Information via High Dimensional Canonical Correlation Analysis

In recent years, there has been an upswing of interest in estimating inf...
research
11/20/2022

Diffeomorphic Information Neural Estimation

Mutual Information (MI) and Conditional Mutual Information (CMI) are mul...
research
02/19/2018

Entropy-Isomap: Manifold Learning for High-dimensional Dynamic Processes

Scientific and engineering processes produce massive high-dimensional da...
research
06/18/2019

Estimating a Manifold from a Tangent Bundle Learner

Manifold hypotheses are typically used for tasks such as dimensionality ...
research
01/13/2021

Estimating Conditional Mutual Information for Discrete-Continuous Mixtures using Multi-Dimensional Adaptive Histograms

Estimating conditional mutual information (CMI) is an essential yet chal...

Please sign up or login with your details

Forgot password? Click here to reset