Principal Balances of Compositional Data for Regression and Classification using Partial Least Squares

11/03/2022
by   V. Nesrstová, et al.
0

High-dimensional compositional data are commonplace in the modern omics sciences amongst others. Analysis of compositional data requires a proper choice of orthonormal coordinate representation as their relative nature is not compatible with the direct use of standard statistical methods. Principal balances, a specific class of log-ratio coordinates, are well suited to this context since they are constructed in such a way that the first few coordinates capture most of the variability in the original data. Focusing on regression and classification problems in high dimensions, we propose a novel Partial Least Squares (PLS) based procedure to construct principal balances that maximize explained variability of the response variable and notably facilitates interpretability when compared to the ordinary PLS formulation. The proposed PLS principal balance approach can be understood as a generalized version of common logcontrast models, since multiple orthonormal (instead of one) logcontrasts are estimated simultaneously. We demonstrate the performance of the method using both simulated and real data sets.

READ FULL TEXT
research
03/31/2023

Regression and Classification of Compositional Data via a novel Supervised Log Ratio Method

Compositional data in which only the relative abundances of variables ar...
research
03/11/2021

Overlap of OLS Regression and Principal Loading Analysis

Principal loading analysis is a dimension reduction method that discards...
research
12/29/2021

Compositional Data Regression in Insurance with Exponential Family PCA

Compositional data are multivariate observations that carry only relativ...
research
02/06/2019

Principal Model Analysis Based on Partial Least Squares

Motivated by the Bagging Partial Least Squares (PLS) and Principal Compo...
research
06/03/2019

Copula-based functional Bayes classification with principal components and partial least squares

We present a new functional Bayes classifier that uses principal compone...
research
12/18/2022

Classification of multivariate functional data on different domains with Partial Least Squares approaches

Classification of multivariate functional data is explored in this paper...
research
09/30/2014

Unsupervised Bump Hunting Using Principal Components

Principal Components Analysis is a widely used technique for dimension r...

Please sign up or login with your details

Forgot password? Click here to reset