Nonparametric Assessment of Variable Selection and Ranking Algorithms

08/22/2023
by   Zhou Tang, et al.
0

Selecting from or ranking a set of candidates variables in terms of their capacity for predicting an outcome of interest is an important task in many scientific fields. A variety of methods for variable selection and ranking have been proposed in the literature. In practice, it can be challenging to know which method is most appropriate for a given dataset. In this article, we propose methods of comparing variable selection and ranking algorithms. We first introduce measures of the quality of variable selection and ranking algorithms. We then define estimators of our proposed measures, and establish asymptotic results for our estimators in the regime where the dimension of the covariates is fixed as the sample size grows. We use our results to conduct large-sample inference for our measures, and we propose a computationally efficient partial bootstrap procedure to potentially improve finite-sample inference. We assess the properties of our proposed methods using numerical studies, and we illustrate our methods with an analysis of data for predicting wine quality from its physicochemical properties.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/25/2021

Kernel Knockoffs Selection for Nonparametric Additive Models

Thanks to its fine balance between model flexibility and interpretabilit...
research
05/03/2019

Robust Model Selection for Finite Mixture of Regression Models Through Trimming

In this article, we introduce a new variable selection technique through...
research
11/05/2020

Nonparametric Variable Screening with Optimal Decision Stumps

Decision trees and their ensembles are endowed with a rich set of diagno...
research
06/15/2023

Ranking and Selection in Large-Scale Inference of Heteroscedastic Units

The allocation of limited resources to a large number of potential candi...
research
10/14/2019

All of Linear Regression

Least squares linear regression is one of the oldest and widely used dat...
research
06/05/2020

Integrative Sparse Partial Least Squares

Partial least squares, as a dimension reduction method, has become incre...
research
06/05/2018

Selection and Estimation Optimality in High Dimensions with the TWIN Penalty

We introduce a novel class of variable selection penalties called TWIN, ...

Please sign up or login with your details

Forgot password? Click here to reset