View selection in multi-view stacking: Choosing the meta-learner

10/30/2020
by   Wouter van Loon, et al.
0

Multi-view stacking is a framework for combining information from different views (i.e. different feature sets) describing the same set of objects. In this framework, a base-learner algorithm is trained on each view separately, and their predictions are then combined by a meta-learner algorithm. In a previous study, stacked penalized logistic regression, a special case of multi-view stacking, has been shown to be useful in identifying which views are most important for prediction. In this article we expand this research by considering seven different algorithms to use as the meta-learner, and evaluating their view selection and classification performance in simulations and two applications on real gene-expression data sets. Our results suggest that if both view selection and classification accuracy are important to the research at hand, then the nonnegative lasso, nonnegative adaptive lasso and nonnegative elastic net are suitable meta-learners. Exactly which among these three is to be preferred depends on the research context. The remaining four meta-learners, namely nonnegative ridge regression, nonnegative forward selection, stability selection and the interpolating predictor, show little advantages in order to be preferred over the other three.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
11/06/2018

Stacked Penalized Logistic Regression for Selecting Views in Multi-View Learning

In multi-view learning, features are organized into multiple sets called...
research
08/12/2021

Analyzing hierarchical multi-view MRI data with StaPLR: An application to Alzheimer's disease classification

Multi-view data refers to a setting where features are divided into feat...
research
03/11/2023

MetaViewer: Towards A Unified Multi-View Representation

Existing multi-view representation learning methods typically follow a s...
research
03/27/2019

Feature Selection for Data Integration with Mixed Multi-view Data

Data integration methods that analyze multiple sources of data simultane...
research
10/25/2021

Integrative Clustering of Multi-View Data by Nonnegative Matrix Factorization

Learning multi-view data is an emerging problem in machine learning rese...
research
08/02/2021

A Framework for Multi-View Classification of Features

One of the most important problems in the field of pattern recognition i...
research
04/03/2020

Stacked Generalizations in Imbalanced Fraud Data Sets using Resampling Methods

This study uses stacked generalization, which is a two-step process of c...

Please sign up or login with your details

Forgot password? Click here to reset