Transfer learning of regression models from a sequence of datasets by penalized estimation

07/04/2020
by   Wessel N. van Wieringen, et al.
0

Transfer learning refers to the promising idea of initializing model fits based on pre-training on other data. We particularly consider regression modeling settings where parameter estimates from previous data can be used as anchoring points, yet may not be available for all parameters, thus covariance information cannot be reused. A procedure that updates through targeted penalized estimation, which shrinks the estimator towards a nonzero value, is presented. The parameter estimate from the previous data serves as this nonzero value when an update is sought from novel data. This naturally extends to a sequence of data sets with the same response, but potentially only partial overlap in covariates. The iteratively updated regression parameter estimator is shown to be asymptotically unbiased and consistent. The penalty parameter is chosen through constrained cross-validated loglikelihood optimization. The constraint bounds the amount of shrinkage of the updated estimator toward the current one from below. The bound aims to preserve the (updated) estimator's goodness-of-fit on all-but-the-novel data. The proposed approach is compared to other regression modeling procedures. Finally, it is illustrated on an epidemiological study where the data arrive in batches with different covariate-availability and the model is re-fitted with the availability of a novel batch.

READ FULL TEXT
research
12/05/2018

Jeffreys' prior, finiteness and shrinkage in binomial-response generalized linear models

This paper studies the finiteness properties of a reduced-bias estimator...
research
11/26/2022

Transfer learning with high-dimensional quantile regression

Transfer learning has become an essential technique to exploit informati...
research
04/15/2023

On the existence of Firth's modified estimates in logistic regression models

In logistic regression modeling, Firth's modified estimator is widely us...
research
06/16/2020

Multi-Model Penalized Regression

Model fitting often aims to fit a single model, assuming that the impose...
research
05/14/2023

Nonlinear regression: finite sample guarantees

This paper offers a new approach for study the frequentist properties of...
research
09/26/2015

Targeted Fused Ridge Estimation of Inverse Covariance Matrices from Multiple High-Dimensional Data Classes

We consider the problem of jointly estimating multiple precision matrice...
research
04/06/2021

Generation of new exciting regressors for consistent on-line estimation for a scalar parameter

In this paper the problem of estimation of a single parameter from a lin...

Please sign up or login with your details

Forgot password? Click here to reset