A General Framework for Powerful Confounder Adjustment in Omics Association Studies

05/23/2022
by   Asmita Roy, et al.
0

Genomic data are subject to various sources of confounding, such as batch effects and cell mixtures. To identify genomic features associated with a variable of interest in the presence of confounders, the traditional approach involves fitting a confounder-adjusted regression model to each genomic feature, followed by multiplicity correction. Previously, we showed that this procedure was sub-optimal and proposed a more powerful procedure named the two-dimensional false discovery rate control (2dFDR) procedure, which relied on the test statistics from both confounder-adjusted and unadjusted linear regression models (Yi et al (2021)). Though 2dFDR provides significant power improvement over the traditional method, it is based on the linear model assumption that may be too restrictive for some practical settings. This study proposes a model-free two-dimensional false discovery rate control procedure (MF-2dFDR) to significantly broaden the scope and applicability of 2dFDR. MF-2dFDR uses marginal independence test statistics as auxiliary information to filter out less promising features, and FDR control is performed based on conditional independence test statistics in the remaining features. MF-2dFDR provides (asymptotically) valid inference from samples in settings in which the conditional distribution of the genomic variables given the covariate of interest and the confounders is arbitrary and completely unknown. To achieve this goal, our method requires the conditional distribution of the covariate given the confounders to be known or can be estimated from the data. We develop a conditional randomization procedure to simultaneously select the two cutoff values for the marginal and conditional independence test statistics. Promising finite sample performance is demonstrated via extensive simulations and real data applications.

READ FULL TEXT

page 19

page 20

page 21

page 22

page 23

page 40

page 41

page 42

research
07/20/2020

Conditional calibration for false discovery rate control under dependence

We introduce a new class of methods for finite-sample false discovery ra...
research
01/11/2018

Robust inference with knockoffs

We consider the variable selection problem, which seeks to identify impo...
research
08/18/2022

DIET: Conditional independence testing with marginal dependence measures of residual information

Conditional randomization tests (CRTs) assess whether a variable x is pr...
research
11/03/2022

The Projected Covariance Measure for assumption-lean variable significance testing

Testing the significance of a variable or group of variables X for predi...
research
03/12/2022

Maxway CRT: Improving the Robustness of Model-X Inference

The model-X conditional randomization test (CRT) proposed by Candès et a...
research
05/17/2022

Individualized conditional independence testing under model-X with heterogeneous samples and interactions

Model-X knockoffs and the conditional randomization test are methods tha...
research
09/06/2018

IPAD: Stable Interpretable Forecasting with Knockoffs Inference

Interpretability and stability are two important features that are desir...

Please sign up or login with your details

Forgot password? Click here to reset