Simultaneous sparse estimation of canonical vectors in the p>>N setting

03/24/2014
by   Irina Gaynanova, et al.
0

This article considers the problem of sparse estimation of canonical vectors in linear discriminant analysis when p≫ N. Several methods have been proposed in the literature that estimate one canonical vector in the two-group case. However, G-1 canonical vectors can be considered if the number of groups is G. In the multi-group context, it is common to estimate canonical vectors in a sequential fashion. Moreover, separate prior estimation of the covariance structure is often required. We propose a novel methodology for direct estimation of canonical vectors. In contrast to existing techniques, the proposed method estimates all canonical vectors at once, performs variable selection across all the vectors and comes with theoretical guarantees on the variable selection and classification consistency. First, we highlight the fact that in the N>p setting the canonical vectors can be expressed in a closed form up to an orthogonal transformation. Secondly, we propose an extension of this form to the p≫ N setting and achieve feature selection by using a group penalty. The resulting optimization problem is convex and can be solved using a block-coordinate descent algorithm. The practical performance of the method is evaluated through simulation studies as well as real data applications.

READ FULL TEXT
research
11/13/2017

Sparse quadratic classification rules via linear dimension reduction

We consider the problem of high-dimensional classification between the t...
research
10/28/2015

Canonical Divergence Analysis

We aim to analyze the relation between two random vectors that may poten...
research
10/29/2016

A general multiblock method for structured variable selection

Regularised canonical correlation analysis was recently extended to more...
research
11/23/2014

Optimal variable selection in multi-group sparse discriminant analysis

This article considers the problem of multi-group classification in the ...
research
08/07/2020

Grouping effects of sparse CCA models in variable selection

The sparse canonical correlation analysis (SCCA) is a bi-multivariate as...
research
09/25/2022

Simultaneous Estimation and Group Identification for Network Vector Autoregressive Model with Heterogeneous Nodes

We study the dynamic behaviors of heterogeneous individuals observed in ...
research
11/29/2017

Bayesian analysis of finite population sampling in multivariate co-exchangeable structures with separable covariance matric

We explore the effect of finite population sampling in design problems w...

Please sign up or login with your details

Forgot password? Click here to reset