An a Priori Exponential Tail Bound for k-Folds Cross-Validation

06/19/2017
by   Karim Abou-Moustafa, et al.
0

We consider a priori generalization bounds developed in terms of cross-validation estimates and the stability of learners. In particular, we first derive an exponential Efron-Stein type tail inequality for the concentration of a general function of n independent random variables. Next, under some reasonable notion of stability, we use this exponential tail bound to analyze the concentration of the k-fold cross-validation (KFCV) estimate around the true risk of a hypothesis generated by a general learning rule. While the accumulated literature has often attributed this concentration to the bias and variance of the estimator, our bound attributes this concentration to the stability of the learning rule and the number of folds k. This insight raises valid concerns related to the practical use of KFCV and suggests research directions to obtain reliable empirical estimates of the actual risk.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/12/2019

An Exponential Efron-Stein Inequality for Lq Stable Learning Rules

There is accumulating evidence in the literature that stability of learn...
research
11/23/2010

Concentration inequalities of the cross-validation estimate for stable predictors

In this article, we derive concentration inequalities for the cross-vali...
research
02/01/2022

Cross Validation for Rare Events

We derive sanity-check bounds for the cross-validation (CV) estimate of ...
research
05/20/2017

( β, ϖ)-stability for cross-validation and the choice of the number of folds

In this paper, we introduce a new concept of stability for cross-validat...
research
05/28/2021

Optimality of Cross-validation in Scattered Data Approximation

Choosing models from a hypothesis space is a frequent task in approximat...
research
11/04/2022

Concentration inequalities for leave-one-out cross validation

In this article we prove that estimator stability is enough to show that...
research
09/06/2018

A note on concentration inequality for vector-valued martingales with weak exponential-type tails

We present novel martingale concentration inequalities for martingale di...

Please sign up or login with your details

Forgot password? Click here to reset