Multiple Testing and Variable Selection along Least Angle Regression's path

06/28/2019
by   J. -M. Azaïs, et al.
0

In this article we investigate the outcomes of the standard Least Angle Regression (LAR) algorithm in high dimensions under the Gaussian noise assumption. We give the exact law of the sequence of knots conditional on the sequence of variables entering the model, i.e., the post-selection law of the knots of the LAR. Based on this result, we prove an exact of the False Discovery Rate (FDR) in the orthogonal design case and an exact control of the existence of false negatives in the general design case. First, we build a sequence of testing procedures on the variables entering the model and we give an exact control of the FDR in the orthogonal design case when the noise level can be unknown. Second, we introduce a new exact testing procedure on the existence of false negatives when the noise level can be unknown. This testing procedure can be deployed after any support selection procedure that will produce an estimation of the support (i.e., the indexes of nonzero coefficients) for any designs. The type I error of the test can be exactly controlled as long as the selection procedure follows some elementary hypotheses, referred to as admissible selection procedures. These support selection procedures are such that the estimation of the support is given by the k first variables entering the model where the random variable k is a stopping time. Monte-Carlo simulations and a real data experiment are provided to illustrate our results.

READ FULL TEXT
research
03/12/2018

False Discovery Rate Control via Debiased Lasso

We consider the problem of variable selection in high-dimensional statis...
research
10/23/2017

A shortcut for Hommel's procedure in linearithmic time

Hommel's and Hochberg's procedures for familywise error control are both...
research
12/23/2022

Rényi Distillation for Global Testing in Sparse Regression Problems

Many modern high-dimensional regression applications involve testing whe...
research
05/25/2022

Resampling-Based Multisplit Inference for High-Dimensional Regression

We propose a novel resampling-based method to construct an asymptoticall...
research
08/10/2017

When Does the First Spurious Variable Get Selected by Sequential Regression Procedures?

Applied statisticians use sequential regression procedures to produce a ...
research
07/10/2023

ARK: Robust Knockoffs Inference with Coupling

We investigate the robustness of the model-X knockoffs framework with re...

Please sign up or login with your details

Forgot password? Click here to reset