Tight basis cycle representatives for persistent homology of large data sets

06/06/2022
by   Manu Aggarwal, et al.
0

Persistent homology (PH) is a popular tool for topological data analysis that has found applications across diverse areas of research. It provides a rigorous method to compute robust topological features in discrete experimental observations that often contain various sources of uncertainties. Although powerful in theory, PH suffers from high computation cost that precludes its application to large data sets. Additionally, most analyses using PH are limited to computing the existence of nontrivial features. Precise localization of these features is not generally attempted because, by definition, localized representations are not unique and because of even higher computation cost. For scientific applications, such a precise location is a sine qua non for determining functional significance. Here, we provide a strategy and algorithms to compute tight representative boundaries around nontrivial robust features in large data sets. To showcase the efficiency of our algorithms and the precision of computed boundaries, we analyze three data sets from different scientific fields. In the human genome, we found an unexpected effect on loops through chromosome 13 and the sex chromosomes, upon impairment of chromatin loop formation. In a distribution of galaxies in the universe, we found statistically significant voids. In protein homologs with significantly different topology, we found voids attributable to ligand-interaction, mutation, and differences between species.

READ FULL TEXT
research
03/09/2021

Dory: Overcoming Barriers to Computing Persistent Homology

Persistent homology (PH) is an approach to topological data analysis (TD...
research
07/31/2019

Persistent Intersection Homology for the Analysis of Discrete Data

Topological data analysis is becoming increasingly relevant to support t...
research
07/07/2016

Persistent Homology on Grassmann Manifolds for Analysis of Hyperspectral Movies

The existence of characteristic structure, or shape, in complex data set...
research
02/18/2019

Persistent entropy: a scale-invariant topological statistic for analyzing cell arrangements

In this work, we explain how to use computational topology for detecting...
research
09/30/2022

Fast Topological Signal Identification and Persistent Cohomological Cycle Matching

Within the context of topological data analysis, the problems of identif...
research
06/21/2018

Interpretable Discovery in Large Image Data Sets

Automated detection of new, interesting, unusual, or anomalous images wi...
research
05/14/2021

Minimal Cycle Representatives in Persistent Homology using Linear Programming: an Empirical Study with User's Guide

Cycle representatives of persistent homology classes can be used to prov...

Please sign up or login with your details

Forgot password? Click here to reset