False Discovery Rate Control Under General Dependence By Symmetrized Data Aggregation

02/27/2020
by   Lilun Du, et al.
0

We develop a new class of distribution–free multiple testing rules for false discovery rate (FDR) control under general dependence. A key element in our proposal is a symmetrized data aggregation (SDA) approach to incorporating the dependence structure via sample splitting, data screening and information pooling. The proposed SDA filter first constructs a sequence of ranking statistics that fulfill global symmetry properties, and then chooses a data–driven threshold along the ranking to control the FDR. The SDA filter substantially outperforms the knockoff method in power under moderate to strong dependence, and is more robust than existing methods based on asymptotic p-values. We first develop finite–sample theory to provide an upper bound for the actual FDR under general dependence, and then establish the asymptotic validity of SDA for both the FDR and false discovery proportion (FDP) control under mild regularity conditions. The procedure is implemented in the R package SDA. Numerical results confirm the effectiveness and robustness of SDA in FDR control and show that it achieves substantial power gain over existing methods in many settings.

READ FULL TEXT

page 7

page 26

page 28

page 29

research
07/20/2020

Conditional calibration for false discovery rate control under dependence

We introduce a new class of methods for finite-sample false discovery ra...
research
10/17/2019

Information Loss and Power Distortion from Standardizing in Multiple Hypothesis Testing

Standardization has been a widely adopted practice in multiple testing, ...
research
07/27/2022

Model-Free, Monotone Invariant and Computationally Efficient Feature Screening with Data-adaptive Threshold

Feature screening for ultrahigh-dimension, in general, proceeds with two...
research
07/03/2022

Asymptotic Uncertainty of False Discovery Proportion

Multiple testing has been a popular topic in statistical research. Altho...
research
12/27/2022

Weak Signal Inclusion Under Dependence and Applications in Genome-wide Association Study

Motivated by the inquiries of weak signals in underpowered genome-wide a...
research
02/21/2020

Aggregation of Multiple Knockoffs

We develop an extension of the Knockoff Inference procedure, introduced ...
research
03/20/2021

Distance Assisted Recursive Testing

In many applications, a large number of features are collected with the ...

Please sign up or login with your details

Forgot password? Click here to reset