A Geometric Statistic for Quantifying Correlation Between Tree-Shaped Datasets

03/11/2023
by   Shanjun Mao, et al.
0

The magnitude of Pearson correlation between two scalar random variables can be visually judged from the two-dimensional scatter plot of an independent and identically distributed sample drawn from the joint distribution of the two variables: the closer the points lie to a straight slanting line, the greater the correlation. To the best of our knowledge, similar graphical representation or geometric quantification of tree correlation does not exist in the literature although tree-shaped datasets are frequently encountered in various fields, such as academic genealogy tree and embryonic development tree. In this paper, we introduce a geometric statistic to both represent tree correlation intuitively and quantify its magnitude precisely. The theoretical properties of the geometric statistic are provided. Large-scale simulations based on various data distributions demonstrate that the geometric statistic is precise in measuring the tree correlation. Its real application on mathematical genealogy trees also demonstrated its usefulness.

READ FULL TEXT
research
03/25/2021

Logarithmic law of large random correlation matrix

Consider a random vector 𝐲=Σ^1/2𝐱, where the p elements of the vector 𝐱 ...
research
07/29/2018

On L-shaped Point Set Embeddings of Trees: First Non-embeddable Examples

An L-shaped embedding of a tree in a point set is a planar drawing of th...
research
06/01/2021

Quantification of Carbon Sequestration in Urban Forests

Vegetation, trees in particular, sequester carbon by absorbing carbon di...
research
07/23/2020

The Heyde theorem on a group ℝ^n× D, where D is a discrete Abelian group

Heyde proved that a Gaussian distribution on the real line is characteri...
research
12/11/2020

Structure learning for extremal tree models

Extremal graphical models are sparse statistical models for multivariate...
research
11/26/2019

The spatiotemporal tau statistic: a review

Introduction The tau statistic is a recent second-order correlation fu...
research
10/17/2021

On the Statistical Analysis of Complex Tree-shaped 3D Objects

How can one analyze detailed 3D biological objects, such as neurons and ...

Please sign up or login with your details

Forgot password? Click here to reset