On Optimal Generalizability in Parametric Learning

11/14/2017
by   Ahmad Beirami, et al.
0

We consider the parametric learning problem, where the objective of the learner is determined by a parametric loss function. Employing empirical risk minimization with possibly regularization, the inferred parameter vector will be biased toward the training samples. Such bias is measured by the cross validation procedure in practice where the data set is partitioned into a training set used for training and a validation set, which is not used in training and is left to measure the out-of-sample performance. A classical cross validation strategy is the leave-one-out cross validation (LOOCV) where one sample is left out for validation and training is done on the rest of the samples that are presented to the learner, and this process is repeated on all of the samples. LOOCV is rarely used in practice due to the high computational complexity. In this paper, we first develop a computationally efficient approximate LOOCV (ALOOCV) and provide theoretical guarantees for its performance. Then we use ALOOCV to provide an optimization algorithm for finding the regularizer in the empirical risk minimization framework. In our numerical experiments, we illustrate the accuracy and efficiency of ALOOCV as well as our proposed framework for the optimization of the regularizer.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
01/29/2020

Asymptotics of Cross-Validation

Cross validation is a central tool in evaluating the performance of mach...
research
03/09/2017

Cross-validation

This text is a survey on cross-validation. We define all classical cross...
research
07/07/2018

Approximate Leave-One-Out for Fast Parameter Tuning in High Dimensions

Consider the following class of learning schemes: β̂ := _β ∑_j=1^n ℓ(x_j...
research
10/04/2018

Approximate Leave-One-Out for High-Dimensional Non-Differentiable Learning Problems

Consider the following class of learning schemes: β̂ := β∈C ∑_j=1^n ℓ(x_...
research
11/18/2021

DIVA: Dataset Derivative of a Learning Task

We present a method to compute the derivative of a learning task with re...
research
03/05/2023

Iterative Approximate Cross-Validation

Cross-validation (CV) is one of the most popular tools for assessing and...
research
09/11/2019

Aggregated Hold-Out

Aggregated hold-out (Agghoo) is a method which averages learning rules s...

Please sign up or login with your details

Forgot password? Click here to reset