Equivalence of Kernel Machine Regression and Kernel Distance Covariance for Multidimensional Trait Association Studies

02/11/2014
by   Wen-Yu Hua, et al.
0

Associating genetic markers with a multidimensional phenotype is an important yet challenging problem. In this work, we establish the equivalence between two popular methods: kernel-machine regression (KMR), and kernel distance covariance (KDC). KMR is a semiparametric regression frameworks that models the covariate effects parametrically, while the genetic markers are considered non-parametrically. KDC represents a class of methods that includes distance covariance (DC) and Hilbert-Schmidt Independence Criterion (HSIC), which are nonparametric tests of independence. We show the equivalence between the score test of KMR and the KDC statistic under certain conditions. This result leads to a novel generalization of the KDC test that incorporates the covariates. Our contributions are three-fold: (1) establishing the equivalence between KMR and KDC; (2) showing that the principles of kernel machine regression can be applied to the interpretation of KDC; (3) the development of a broader class of KDC statistics, that the members are the quantities of different kernels. We demonstrate the proposals using simulation studies. Data from the Alzheimer's Disease Neuroimaging Initiative (ADNI) is used to explore the association between the genetic variants on gene FLJ16124 and phenotypes represented in 3D structural brain MR images adjusting for age and gender. The results suggest that SNPs of FLJ16124 exhibit strong pairwise interaction effects that are correlated to the changes of brain region volumes.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
06/14/2018

The Exact Equivalence of Distance and Kernel Methods for Hypothesis Testing

Distance-based methods, also called "energy statistics", are leading met...
research
07/25/2012

Equivalence of distance-based and RKHS-based statistics in hypothesis testing

We provide a unifying framework linking two classes of statistics used i...
research
04/01/2019

Gene-based Association Analysis for Bivariate Time-to-event Data through Functional Regression with Copula Models

Several gene-based association tests for time-to-event traits have been ...
research
05/02/2012

Hypothesis testing using pairwise distances and associated kernels (with Appendix)

We provide a unifying framework linking two classes of statistics used i...
research
04/29/2021

Dynamic Gene Coexpression Analysis with Correlation Modeling

In many transcriptomic studies, the correlation of genes might fluctuate...
research
06/01/2016

Gene-Gene association for Imaging Genetics Data using Robust Kernel Canonical Correlation Analysis

In genome-wide interaction studies, to detect gene-gene interactions, mo...
research
11/30/2022

A Pseudo-Value Regression Approach for Differential Network Analysis of Co-Expression Data

The differential network (DN) analysis identifies changes in measures of...

Please sign up or login with your details

Forgot password? Click here to reset