The information-geometric perspective of Compositional Data Analysis

05/23/2020
by   Ionas Erb, et al.
0

Information geometry uses the formal tools of differential geometry to describe the space of probability distributions as a Riemannian manifold with an additional dual structure. The formal equivalence of compositional data with discrete probability distributions makes it possible to apply the same description to the sample space of Compositional Data Analysis (CoDA). The latter has been formally described as a Euclidean space with an orthonormal basis featuring components that are suitable combinations of the original parts. In contrast to the Euclidean metric, the information-geometric description singles out the Fisher information metric as the only one keeping the manifold's geometric structure invariant under equivalent representations of the underlying random variables. Well-known concepts that are valid in Euclidean coordinates, e.g., the Pythogorean theorem, are generalized by information geometry to corresponding notions that hold for more general coordinates. In briefly reviewing Euclidean CoDA and, in more detail, the information-geometric approach, we show how the latter justifies the use of distance measures and divergences that so far have received little attention in CoDA as they do not fit the Euclidean geometry favored by current thinking. We also show how entropy and relative entropy can describe amalgamations in a simple way, while Aitchison distance requires the use of geometric means to obtain more succinct relationships. We proceed to prove the information monotonicity property for Aitchison distance. We close with some thoughts about new directions in CoDA where the rich structure that is provided by information geometry could be exploited.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
07/06/2023

Probability Metrics for Tropical Spaces of Different Dimensions

The problem of comparing probability distributions is at the heart of ma...
research
08/20/2004

Notes on information geometry and evolutionary processes

In order to analyze and extract different structural properties of distr...
research
07/15/2021

Moufang Patterns and Geometry of Information

Technology of data collection and information transmission is based on v...
research
11/29/2018

Gaussian asymptotic limits for the α-transformation in the analysis of compositional data

Compositional data consists of vectors of proportions whose components s...
research
05/06/2021

A Unifying and Canonical Description of Measure-Preserving Diffusions

A complete recipe of measure-preserving diffusions in Euclidean space wa...
research
03/20/2019

Topological Data Analysis in Information Space

Various kinds of data are routinely represented as discrete probability ...
research
06/03/2020

Classifying histograms of medical data using information geometry of beta distributions

In this paper, we use tools of information geometry to compare, average ...

Please sign up or login with your details

Forgot password? Click here to reset