Double Double Descent: On Generalization Errors in Transfer Learning between Linear Regression Tasks

06/12/2020
by   Yehuda Dar, et al.
0

We study the transfer learning process between two linear regression problems. An important and timely special case is when the regressors are overparameterized and perfectly interpolate their training data. We examine a parameter transfer mechanism whereby a subset of the parameters of the target task solution are constrained to the values learned for a related source task. We analytically characterize the generalization error of the target task in terms of the salient factors in the transfer learning architecture, i.e., the number of examples available, the number of (free) parameters in each of the tasks, the number of parameters transferred from the source to target task, and the correlation between the two tasks. Our non-asymptotic analysis shows that the generalization error of the target task follows a two-dimensional double descent trend (with respect to the number of free parameters in each of the tasks) that is controlled by the transfer learning factors. Our analysis points to specific cases where the transfer of parameters is beneficial.

READ FULL TEXT

page 7

page 8

page 16

page 19

research
03/09/2021

Transfer Learning Can Outperform the True Prior in Double Descent Regularization

We study a fundamental transfer learning process from source to target l...
research
11/20/2022

Overfreezing Meets Overparameterization: A Double Descent Perspective on Transfer Learning of Deep Neural Networks

We study the generalization behavior of transfer learning of deep neural...
research
01/06/2021

Phase Transitions in Transfer Learning for High-Dimensional Perceptrons

Transfer learning seeks to improve the generalization performance of a t...
research
06/08/2023

Generalization Performance of Transfer Learning: Overparameterized and Underparameterized Regimes

Transfer learning is a useful technique for achieving improved performan...
research
02/25/2020

Subspace Fitting Meets Regression: The Effects of Supervision and Orthonormality Constraints on Double Descent of Generalization Errors

We study the linear subspace fitting problem in the overparameterized se...
research
09/27/2018

An analytic theory of generalization dynamics and transfer learning in deep linear networks

Much attention has been devoted recently to the generalization puzzle in...
research
03/26/2023

Guided Transfer Learning

Machine learning requires exuberant amounts of data and computation. Als...

Please sign up or login with your details

Forgot password? Click here to reset