The chi-square standardization, combined with Box-Cox transformation, is a valid alternative to transforming to logratios in compositional data analysis

11/12/2022
by   Michael Greenacre, et al.
0

The approach to analysing compositional data with a fixed sum constraint has been dominated by the use of logratio transformations, to ensure exact subcompositional coherence and, in some situations, exact isometry as well. A problem with this approach is that data zeros, found in most applications, have to be replaced to permit the logarithmic transformation. A simpler approach is to use the chi-square standardization that is inherent in correspondence analysis. Combined with the Box-Cox power transformation, this standardization defines chi-square distances that tend to logratio distances for strictly positive data as the power parameter tends to zero, and can thus be considered equivalent to transforming to logratios. For data with zeros, a value of the power can be identified that brings the chi-square standardization as close as possible to transforming by logratios, without having to substitute the zeros. Especially in the field of high-dimensional "omics" data, this alternative presents such a high level of coherence and isometry as to be a valid, and much simpler, approach to the analysis of compositional data.

READ FULL TEXT
research
01/13/2022

Aitchison's Compositional Data Analysis 40 Years On: A Reappraisal

The development of John Aitchison's approach to compositional data analy...
research
11/29/2018

Gaussian asymptotic limits for the α-transformation in the analysis of compositional data

Compositional data consists of vectors of proportions whose components s...
research
10/15/2021

A new class of α-transformations for the spatial analysis of Compositional Data

Georeferenced compositional data are prominent in many scientific fields...
research
07/07/2023

GeoCoDA: Recognizing and Validating Structural Processes in Geochemical Data. A Workflow on Compositional Data Analysis in Lithogeochemistry

Geochemical data are compositional in nature and are subject to the prob...
research
01/03/2023

Notes on Correspondence Analysis of Power Transformed Data Sets

We prospect for a clear simple picture on CA of power transformed or the...
research
12/13/2015

Big Data Scaling through Metric Mapping: Exploiting the Remarkable Simplicity of Very High Dimensional Spaces using Correspondence Analysis

We present new findings in regard to data analysis in very high dimensio...
research
11/07/2019

Data transforming augmentation for heteroscedastic models

Data augmentation (DA) turns seemingly intractable computational problem...

Please sign up or login with your details

Forgot password? Click here to reset