Cauchy robust principal component analysis with applications to high-deimensional data sets

11/06/2022
by   Ayisha Fayomi, et al.
0

Principal component analysis (PCA) is a standard dimensionality reduction technique used in various research and applied fields. From an algorithmic point of view, classical PCA can be formulated in terms of operations on a multivariate Gaussian likelihood. As a consequence of the implied Gaussian formulation, the principal components are not robust to outliers. In this paper, we propose a modified formulation, based on the use of a multivariate Cauchy likelihood instead of the Gaussian likelihood, which has the effect of robustifying the principal components. We present an algorithm to compute these robustified principal components. We additionally derive the relevant influence function of the first component and examine its theoretical properties. Simulation experiments on high-dimensional datasets demonstrate that the estimated principal components based on the Cauchy likelihood outperform or are on par with existing robust PCA techniques.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2019

Robust Principal Component Analysis Based On Maximum Correntropy Power Iterations

Principal component analysis (PCA) is recognised as a quintessential dat...
research
02/18/2014

High Dimensional Semiparametric Scale-Invariant Principal Component Analysis

We propose a new high dimensional semiparametric principal component ana...
research
05/03/2019

Uncertainty-Aware Principal Component Analysis

We present a technique to perform dimensionality reduction on data that ...
research
08/22/2018

XPCA: Extending PCA for a Combination of Discrete and Continuous Variables

Principal component analysis (PCA) is arguably the most popular tool in ...
research
02/08/2019

Automatic dimensionality selection for principal component analysis models with the ignorance score

Principal component analysis (PCA) is by far the most widespread tool fo...
research
05/11/2020

Robust PCA via Regularized REAPER with a Matrix-Free Proximal Algorithm

Principal component analysis (PCA) is known to be sensitive to outliers,...
research
07/16/2014

Sequential Logistic Principal Component Analysis (SLPCA): Dimensional Reduction in Streaming Multivariate Binary-State System

Sequential or online dimensional reduction is of interests due to the ex...

Please sign up or login with your details

Forgot password? Click here to reset