Systematic Ensemble Learning for Regression

03/28/2014
by   Roberto Aldave, et al.
0

The motivation of this work is to improve the performance of standard stacking approaches or ensembles, which are composed of simple, heterogeneous base models, through the integration of the generation and selection stages for regression problems. We propose two extensions to the standard stacking approach. In the first extension we combine a set of standard stacking approaches into an ensemble of ensembles using a two-step ensemble learning in the regression setting. The second extension consists of two parts. In the initial part a diversity mechanism is injected into the original training data set, systematically generating different training subsets or partitions, and corresponding ensembles of ensembles. In the final part after measuring the quality of the different partitions or ensembles, a max-min rule-based selection algorithm is used to select the most appropriate ensemble/partition on which to make the final prediction. We show, based on experiments over a broad range of data sets, that the second extension performs better than the best of the standard stacking approaches, and is as good as the oracle of databases, which has the best base model selected by cross-validation for each data set. In addition to that, the second extension performs better than two state-of-the-art ensemble methods for regression, and it is as good as a third state-of-the-art ensemble method.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
08/14/2019

Optimizing Ensemble Weights and Hyperparameters of Machine Learning Models for Regression Problems

Aggregating multiple learners through an ensemble of models aims to make...
research
02/04/2014

Sequential Model-Based Ensemble Optimization

One of the most tedious tasks in the application of machine learning is ...
research
06/07/2019

Ensemble Pruning via Margin Maximization

Ensemble models refer to methods that combine a typically large number o...
research
07/29/2017

KNN Ensembles for Tweedie Regression: The Power of Multiscale Neighborhoods

Very few K-nearest-neighbor (KNN) ensembles exist, despite the efficacy ...
research
07/05/2021

Unsupervised Ensemble Selection for Multilayer Bootstrap Networks

Multilayer bootstrap network (MBN), which is a recent simple unsupervise...
research
02/21/2018

Pooling homogeneous ensembles to build heterogeneous ensembles

In ensemble methods, the outputs of a collection of diverse classifiers ...
research
12/29/2020

Twin Neural Network Regression

We introduce twin neural network (TNN) regression. This method predicts ...

Please sign up or login with your details

Forgot password? Click here to reset