A Cluster Fusion Penalty for Grouping Response Variables in Multivariate Regression Models

07/12/2017
by   Bradley S. Price, et al.
0

We propose a method for estimating coefficients in multivariate regression when there is a clustering structure to the response variables. The proposed method includes a fusion penalty, to shrink the difference in fitted values from responses in the same cluster, and an L1 penalty for simultaneous variable selection and estimation. The method can be used when the grouping structure of the response variables is known or unknown. When the clustering structure is unknown the method will simultaneously estimate the clusters of the response and the regression coefficients. Theoretical results are presented for the penalized least squares case, including asymptotic results allowing for p >> n. We extend our method to the setting where the responses are binomial variables. We propose a coordinate descent algorithm for both the normal and binomial likelihood, which can easily be extended to other generalized linear model (GLM) settings. Simulations and data examples from business operations and genomics are presented to show the merits of both the least squares and binomial methods.

READ FULL TEXT

page 20

page 22

page 24

page 25

research
09/27/2022

Wilcoxon-type Multivariate Cluster Elastic Net

We propose a method for high dimensional multivariate regression that is...
research
04/29/2018

Simultaneous Parameter Learning and Bi-Clustering for Multi-Response Models

We consider multi-response and multitask regression models, where the pa...
research
06/09/2021

On the Use of Minimum Penalties in Statistical Learning

Modern multivariate machine learning and statistical methodologies estim...
research
02/28/2020

Modelling High-Dimensional Categorical Data Using Nonconvex Fusion Penalties

We propose a method for estimation in high-dimensional linear models wit...
research
12/05/2018

Least absolute deviations uncertain regression with imprecise observations

Traditionally regression analysis answers questions about the relationsh...
research
12/15/2017

Fast algorithms for fitting L_1-penalized multivariate linear models to structured high-throughput data

We present fast methods for fitting sparse multivariate linear models to...
research
12/17/2021

Supervised Multivariate Learning with Simultaneous Feature Auto-grouping and Dimension Reduction

Modern high-dimensional methods often adopt the "bet on sparsity" princi...

Please sign up or login with your details

Forgot password? Click here to reset