Interaction pursuit in high-dimensional multi-response regression via distance correlation

05/11/2016
by   Yinfei Kong, et al.
0

Feature interactions can contribute to a large proportion of variation in many prediction models. In the era of big data, the coexistence of high dimensionality in both responses and covariates poses unprecedented challenges in identifying important interactions. In this paper, we suggest a two-stage interaction identification method, called the interaction pursuit via distance correlation (IPDC), in the setting of high-dimensional multi-response interaction models that exploits feature screening applied to transformed variables with distance correlation followed by feature selection. Such a procedure is computationally efficient, generally applicable beyond the heredity assumption, and effective even when the number of responses diverges with the sample size. Under mild regularity conditions, we show that this method enjoys nice theoretical properties including the sure screening property, support union recovery, and oracle inequalities in prediction and estimation for both interactions and main effects. The advantages of our method are supported by several simulation studies and real data analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/28/2016

Interaction Pursuit with Feature Screening and Selection

Understanding how features interact with each other is of paramount impo...
research
01/05/2015

Innovated interaction screening for high-dimensional nonlinear classification

This paper is concerned with the problems of interaction screening and n...
research
04/11/2021

Parallel integrative learning for large-scale multi-response regression with incomplete outcomes

Multi-task learning is increasingly used to investigate the association ...
research
03/09/2019

Distributed Feature Screening via Componentwise Debiasing

Feature screening is a powerful tool in the analysis of high dimensional...
research
06/13/2016

Tuning-Free Heterogeneity Pursuit in Massive Networks

Heterogeneity is often natural in many contemporary applications involvi...
research
08/13/2022

A sequential stepwise screening procedure for sparse recovery in high-dimensional multiresponse models with complex group structures

Multiresponse data with complex group structures in both responses and p...
research
05/26/2022

Factor selection in screening experiments by aggregation over random models

Screening experiments are useful for screening out a small number of tru...

Please sign up or login with your details

Forgot password? Click here to reset