Prediction out-of-sample using block shrinkage estimators: model selection and predictive inference

09/12/2018
by   Hannes Leeb, et al.
0

In a linear regression model with random design, we consider a family of candidate models from which we want to select a `good' model for prediction out-of-sample. We fit the models using block shrinkage estimators, and we focus on the challenging situation where the number of explanatory variables can be of the same order as sample size and where the number of candidate models can be much larger than sample size. We develop an estimator for the out-of-sample predictive performance, and we show that the empirically best model is asymptotically as good as the truly best model. Using the estimator corresponding to the empirically best model, we construct a prediction interval that is approximately valid and short with high probability, i.e., we show that the actual coverage probability is close to the nominal one and that the length of this prediction interval is close to the length of the shortest but infeasible prediction interval. All results hold uniformly over a large class of data-generating processes. These findings extend results of Leeb (2009), where the models are fit using least-squares estimators, and of Huber (2013), where the models are fit using shrinkage estimators without block structure.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/30/2022

Two-step estimators of high dimensional correlation matrices

We investigate block diagonal and hierarchical nested stochastic multiva...
research
06/16/2021

Estimating timber volume loss due to storm damage in Carinthia, Austria, using ALS/TLS and spatial regression models

A spatial regression model framework is presented to predict growing sto...
research
04/29/2011

Model Selection Consistency for Cointegrating Regressions

We study the asymptotic properties of the adaptive Lasso in cointegratio...
research
07/26/2019

On the variability of regression shrinkage methods for clinical prediction models: simulation study on predictive performance

When developing risk prediction models, shrinkage methods are recommende...
research
06/19/2015

Information-based inference for singular models and finite sample sizes

A central problem in statistics is model selection, the choice between c...
research
12/24/2020

On Statistical Efficiency in Learning

A central issue of many statistical learning problems is to select an ap...
research
10/24/2022

Post-Selection Confidence Bounds for Prediction Performance

In machine learning, the selection of a promising model from a potential...

Please sign up or login with your details

Forgot password? Click here to reset