On-the-Fly Ensemble Pruning in Evolving Data Streams

09/15/2021
by   Sanem Elbasi, et al.
0

Ensemble pruning is the process of selecting a subset of componentclassifiers from an ensemble which performs at least as well as theoriginal ensemble while reducing storage and computational costs.Ensemble pruning in data streams is a largely unexplored area ofresearch. It requires analysis of ensemble components as they arerunning on the stream, and differentiation of useful classifiers fromredundant ones. We present CCRP, an on-the-fly ensemble prun-ing method for multi-class data stream classification empoweredby an imbalance-aware fusion of class-wise component rankings.CCRP aims that the resulting pruned ensemble contains the bestperforming classifier for each target class and hence, reduces the ef-fects of class imbalance. The conducted experiments on real-worldand synthetic data streams demonstrate that different types of en-sembles that integrate CCRP as their pruning scheme consistentlyyield on par or superior performance with 20 the proposed pruningscheme by comparing our approach against pruning schemes basedon ensemble weights and basic rank fusion methods.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
02/21/2023

Classy Ensemble: A Novel Ensemble Algorithm for Classification

We present Classy Ensemble, a novel ensemble-generation algorithm for cl...
research
06/13/2018

Ensemble Pruning based on Objection Maximization with a General Distributed Framework

Ensemble pruning, selecting a subset of individual learners from an orig...
research
09/17/2023

Imbalanced Data Stream Classification using Dynamic Ensemble Selection

Modern streaming data categorization faces significant challenges from c...
research
01/30/2021

Hellinger Distance Weighted Ensemble for Imbalanced Data Stream Classification

The imbalanced data classification remains a vital problem. The key is t...
research
04/07/2022

A survey on learning from imbalanced data streams: taxonomy, challenges, empirical study, and reproducible experimental framework

Class imbalance poses new challenges when it comes to classifying data s...
research
01/15/2022

Weighting and Pruning based Ensemble Deep Random Vector Functional Link Network for Tabular Data Classification

In this paper, we first introduce batch normalization to the edRVFL netw...
research
10/15/2018

Unsupervised Ensemble Learning via Ising Model Approximation with Application to Phenotyping Prediction

Unsupervised ensemble learning has long been an interesting yet challeng...

Please sign up or login with your details

Forgot password? Click here to reset