False Discovery Rate Control via Debiased Lasso

03/12/2018
by   Adel Javanmard, et al.
0

We consider the problem of variable selection in high-dimensional statistical models where the goal is to report a set of variables, out of many predictors X_1, , X_p, that are relevant to a response of interest. For linear high-dimensional model, where the number of parameters exceeds the number of samples (p>n), we propose a procedure for variables selection and prove that it controls the directional false discovery rate (FDR) below a pre-assigned significance level q∈ [0,1]. We further analyze the statistical power of our framework and show that for designs with subgaussian rows and a common precision matrix Ω∈R^p× p, if the minimum nonzero parameter θ_ satisfies √(n)θ_ - σ√(2(_i∈ [p]Ω_ii)(2p/qs_0))→∞ , then this procedure achieves asymptotic power one. Our framework is built upon the debiasing approach and assumes the standard condition s_0 = o(√(n)/( p)^2), where s_0 indicates the number of true positives among the p features. Notably, this framework achieves exact directional FDR control without any assumption on the amplitude of unknown regression parameters, and does not require any knowledge of the distribution of covariates or the noise level. We test our method in synthetic and real data experiments to asses its performance and to corroborate our theoretical results.

READ FULL TEXT

page 15

page 16

research
05/02/2021

Directional FDR Control for Sub-Gaussian Sparse GLMs

High-dimensional sparse generalized linear models (GLMs) have emerged in...
research
06/28/2019

Multiple Testing and Variable Selection along Least Angle Regression's path

In this article we investigate the outcomes of the standard Least Angle ...
research
11/13/2008

P-values for high-dimensional regression

Assigning significance in high-dimensional regression is challenging. Mo...
research
04/09/2018

Efficient Predictor Ranking and False Discovery Proportion Control in High-Dimensional Regression

We propose a ranking and selection procedure to prioritize relevant pred...
research
10/12/2021

The Terminating-Knockoff Filter: Fast High-Dimensional Variable Selection with False Discovery Rate Control

We propose the Terminating-Knockoff (T-Knock) filter, a fast variable se...
research
12/21/2020

Two-directional simultaneous inference for high-dimensional models

This paper proposes a general two directional simultaneous inference (TO...
research
01/11/2018

Robust inference with knockoffs

We consider the variable selection problem, which seeks to identify impo...

Please sign up or login with your details

Forgot password? Click here to reset