Scalable Bayesian divergence time estimation with ratio transformations

10/25/2021
by   Xiang Ji, et al.
0

Divergence time estimation is crucial to provide temporal signals for dating biologically important events, from species divergence to viral transmissions in space and time. With the advent of high-throughput sequencing, recent Bayesian phylogenetic studies have analyzed hundreds to thousands of sequences. Such large-scale analyses challenge divergence time reconstruction by requiring inference on highly-correlated internal node heights that often become computationally infeasible. To overcome this limitation, we explore a ratio transformation that maps the original N - 1 internal node heights into a space of one height parameter and N - 2 ratio parameters. To make analyses scalable, we develop a collection of linear-time algorithms to compute the gradient and Jacobian-associated terms of the log-likelihood with respect to these ratios. We then apply Hamiltonian Monte Carlo sampling with the ratio transform in a Bayesian framework to learn the divergence times in four pathogenic virus phylogenies: West Nile virus, rabies virus, Lassa virus and Ebola virus. Our method both resolves a mixing issue in the West Nile virus example and improves inference efficiency by at least 5-fold for the Lassa and rabies virus examples. Our method also makes it now computationally feasible to incorporate mixed-effects molecular clock models for the Ebola virus example, confirms the findings from the original study and reveals clearer multimodal distributions of the divergence times of some clades of interest.

READ FULL TEXT

page 20

page 22

page 24

research
05/29/2019

Gradients do grow on trees: a linear-time O( N )-dimensional gradient for statistical phylogenetics

Calculation of the log-likelihood stands as the computational bottleneck...
research
03/08/2023

Many-core algorithms for high-dimensional gradients on phylogenetic trees

The rapid growth in genomic pathogen data spurs the need for efficient i...
research
06/13/2021

Adaptation of the Tuning Parameter in General Bayesian Inference with Robust Divergence

We introduce a methodology for robust Bayesian estimation with robust di...
research
05/18/2018

Model Inference with Stein Density Ratio Estimation

The Kullback-Leilber divergence from model to data is a classic goodness...
research
05/15/2021

Shrinkage-based random local clocks with scalable inference

Local clock models propose that the rate of molecular evolution is const...
research
10/06/2020

Bayesian mitigation of spatial coarsening for a fairly flexible spatiotemporal Hawkes model

Self-exciting spatiotemporal Hawkes processes have found increasing use ...
research
03/29/2018

Prefix-Free Parsing for Building Big BWTs

High-throughput sequencing technologies have led to explosive growth of ...

Please sign up or login with your details

Forgot password? Click here to reset