Consistent selection of tuning parameters via variable selection stability

08/16/2012
by   Wei Sun, et al.
0

Penalized regression models are popularly used in high-dimensional data analysis to conduct variable selection and model fitting simultaneously. Whereas success has been widely reported in literature, their performances largely depend on the tuning parameters that balance the trade-off between model fitting and model sparsity. Existing tuning criteria mainly follow the route of minimizing the estimated prediction error or maximizing the posterior model probability, such as cross-validation, AIC and BIC. This article introduces a general tuning parameter selection criterion based on a novel concept of variable selection stability. The key idea is to select the tuning parameters so that the resultant penalized regression model is stable in variable selection. The asymptotic selection consistency is established for both fixed and diverging dimensions. The effectiveness of the proposed criterion is also demonstrated in a variety of simulated examples as well as an application to the prostate cancer data.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/30/2013

A note on selection stability: combining stability and prediction

Recently, many regularized procedures have been proposed for variable se...
research
08/31/2021

Variable Selection in Regression Model with AR(p) Error Terms Based on Heavy Tailed Distributions

Parameter estimation and the variable selection are two pioneer issues i...
research
12/01/2019

Penalized Matrix Regression for Two-Dimensional Variable Selection

The root-cause diagnostics for product quality defects in multistage man...
research
04/02/2014

Don't Fall for Tuning Parameters: Tuning-Free Variable Selection in High Dimensions With the TREX

Lasso is a seminal contribution to high-dimensional statistics, but it h...
research
01/29/2018

Fast Penalized Regression and Cross Validation for Tall Data with the oem Package

A large body of research has focused on theory and computation for varia...
research
04/04/2021

Scalable algorithms for semiparametric accelerated failure time models in high dimensions

Semiparametric accelerated failure time (AFT) models are a useful altern...
research
09/20/2021

Variable Selection in GLM and Cox Models with Second-Generation P-Values

Variable selection has become a pivotal choice in data analyses that imp...

Please sign up or login with your details

Forgot password? Click here to reset