Modular regression

11/18/2022
by   Ying Jin, et al.
0

This paper develops a new framework, called modular regression, to utilize auxiliary information – such as variables other than the original features or additional data sets – in the training process of linear models. At a high level, our method follows the routine: (i) decomposing the regression task into several sub-tasks, (ii) fitting the sub-task models, and (iii) using the sub-task models to provide an improved estimate for the original regression problem. This routine applies to widely-used low-dimensional (generalized) linear models and high-dimensional regularized linear regression. It also naturally extends to missing-data settings where only partial observations are available. By incorporating auxiliary information, our approach improves the estimation efficiency and prediction accuracy compared to linear regression or the Lasso under a conditional independence assumption. For high-dimensional settings, we develop an extension of our procedure that is robust to violations of the conditional independence assumption, in the sense that it improves efficiency if this assumption holds and coincides with the Lasso otherwise. We demonstrate the efficacy of our methods with both simulated and real data sets.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/26/2022

High-dimensional sparse vine copula regression with application to genomic prediction

High-dimensional data sets are often available in genome-enabled predict...
research
10/13/2020

Spike-and-Slab Meets LASSO: A Review of the Spike-and-Slab LASSO

High-dimensional data sets have become ubiquitous in the past few decade...
research
12/06/2014

A Likelihood Ratio Framework for High Dimensional Semiparametric Regression

We propose a likelihood ratio based inferential framework for high dimen...
research
06/18/2020

Transfer Learning for High-dimensional Linear Regression: Prediction, Estimation, and Minimax Optimality

This paper considers the estimation and prediction of a high-dimensional...
research
07/03/2017

Regression Phalanxes

Tomal et al. (2015) introduced the notion of "phalanxes" in the context ...
research
12/10/2017

Ensembles of Regularized Linear Models

We propose an approach for building ensembles of regularized linear mode...
research
09/23/2021

High-dimensional regression with potential prior information on variable importance

There are a variety of settings where vague prior information may be ava...

Please sign up or login with your details

Forgot password? Click here to reset