Information geometry for phylogenetic trees

03/29/2020
by   Maryam K. Garba, et al.
0

We propose a new space to model phylogenetic trees. It is based on a biologically motivated Markov model for genetic sequence evolution. As a point set, this space comprises the previously developed Billera-Holmes-Vogtmann (BHV) tree space while its geometry is motivated from the edge-product space. As the latter, our new wald space also involves disconnected forests, it does not contain certain singularities of the latter, though. The geometry of wald space is that of the Fisher information metric of character distributions, either from a discrete Bernoulli or from a continuous Gaussian model. The latter can be viewed as the trace metric of the affine-invariant metric for covariance matrices, the former is that of the Hellinger divergence, or, as we show, equivalent to any metric obtained from an f -divergence, such as the Jensen-Shannon metric. For the latter (continuous) we derive a gradient descent algorithm to project from the ambient space of covariance matrices to wald space and for both we derive computational methods to compute geodesics in polynomial time and show numerically that the two information geometries (discrete and continuous) are very similar. In particular geodesics are approximated extrinsically. Comparison with the BHV geometry shows that our canonical and biologically motived space is substantially different.

READ FULL TEXT
research
05/27/2022

Information geometry of the Tojo-Yoshino's exponential family on the Poincaré upper plane

We study the dually flat information geometry of the Tojo-Yoshino expone...
research
09/12/2022

Foundations of the Wald Space for Phylogenetic Trees

Evolutionary relationships between species are represented by phylogenet...
research
05/29/2018

Regularization of covariance matrices on Riemannian manifolds using linear systems

We propose an approach to use the state covariance of linear systems to ...
research
03/06/2020

Wasserstein statistics in 1D location-scale model

Wasserstein geometry and information geometry are two important structur...
research
05/31/2018

Tropical Foundations for Probability & Statistics on Phylogenetic Tree Space

We introduce a novel framework for the statistical analysis of phylogene...
research
07/11/2012

An Extended Cencov-Campbell Characterization of Conditional Information Geometry

We formulate and prove an axiomatic characterization of conditional info...
research
11/25/2022

The randomization by Wishart laws and the Fisher information

Consider the centered Gaussian vector X in ^n with covariance matrix Σ. ...

Please sign up or login with your details

Forgot password? Click here to reset