Influence Function and Robust Variant of Kernel Canonical Correlation Analysis

05/09/2017
by   Md. Ashad Alam, et al.
0

Many unsupervised kernel methods rely on the estimation of the kernel covariance operator (kernel CO) or kernel cross-covariance operator (kernel CCO). Both kernel CO and kernel CCO are sensitive to contaminated data, even when bounded positive definite kernels are used. To the best of our knowledge, there are few well-founded robust kernel methods for statistical unsupervised learning. In addition, while the influence function (IF) of an estimator can characterize its robustness, asymptotic properties and standard error, the IF of a standard kernel canonical correlation analysis (standard kernel CCA) has not been derived yet. To fill this gap, we first propose a robust kernel covariance operator (robust kernel CO) and a robust kernel cross-covariance operator (robust kernel CCO) based on a generalized loss function instead of the quadratic loss function. Second, we derive the IF for robust kernel CCO and standard kernel CCA. Using the IF of the standard kernel CCA, we can detect influential observations from two sets of data. Finally, we propose a method based on the robust kernel CO and the robust kernel CCO, called robust kernel CCA, which is less sensitive to noise than the standard kernel CCA. The introduced principles can also be applied to many other kernel methods involving kernel CO or kernel CCO. Our experiments on synthesized data and imaging genetics analysis demonstrate that the proposed IF of standard kernel CCA can identify outliers. It is also seen that the proposed robust kernel CCA method performs better for ideal and contaminated data than the standard kernel CCA.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/17/2016

Robust Kernel (Cross-) Covariance Operators in Reproducing Kernel Hilbert Space toward Kernel Methods

To the best of our knowledge, there are no general well-founded robust m...
research
06/01/2016

Identifying Outliers using Influence Function of Multiple Kernel Canonical Correlation Analysis

Imaging genetic research has essentially focused on discovering unique a...
research
10/19/2019

Robustifying multiple-set linear canonical analysis with S-estimator

We consider a robust version of multiple-set linear canonical analysis o...
research
11/06/2018

Robust multiple-set linear canonical analysis based on minimum covariance determinant estimator

By deriving influence functions related to multiple-set linear canonical...
research
05/10/2012

A Generalized Kernel Approach to Structured Output Learning

We study the problem of structured output learning from a regression per...
research
07/15/2021

The Completion of Covariance Kernels

We consider the problem of positive-semidefinite continuation: extending...
research
10/29/2020

On the robustness of kernel-based pairwise learning

It is shown that many results on the statistical robustness of kernel-ba...

Please sign up or login with your details

Forgot password? Click here to reset