LIUBoost : Locality Informed Underboosting for Imbalanced Data Classification

11/15/2017
by   Sajid Ahmed, et al.
0

The problem of class imbalance along with class-overlapping has become a major issue in the domain of supervised learning. Most supervised learning algorithms assume equal cardinality of the classes under consideration while optimizing the cost function and this assumption does not hold true for imbalanced datasets which results in sub-optimal classification. Therefore, various approaches, such as undersampling, oversampling, cost-sensitive learning and ensemble based methods have been proposed for dealing with imbalanced datasets. However, undersampling suffers from information loss, oversampling suffers from increased runtime and potential overfitting while cost-sensitive methods suffer due to inadequately defined cost assignment schemes. In this paper, we propose a novel boosting based method called LIUBoost. LIUBoost uses under sampling for balancing the datasets in every boosting iteration like RUSBoost while incorporating a cost term for every instance based on their hardness into the weight update formula minimizing the information loss introduced by undersampling. LIUBoost has been extensively evaluated on 18 imbalanced datasets and the results indicate significant improvement over existing best performing method RUSBoost.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
12/18/2017

MEBoost: Mixing Estimators with Boosting for Imbalanced Data Classification

Class imbalance problem has been a challenging research problem in the f...
research
09/17/2022

AdaCC: Cumulative Cost-Sensitive Boosting for Imbalanced Classification

Class imbalance poses a major challenge for machine learning as most sup...
research
09/08/2019

Training Effective Ensemble on Imbalanced Data by Self-paced Harmonizing Classification Hardness

Many real-world applications reveal difficulties in learning classifiers...
research
09/08/2019

Self-paced Ensemble for Highly Imbalanced Massive Data Classification

Many real-world applications reveal difficulties in learning classifiers...
research
09/11/2019

Factorized MultiClass Boosting

In this paper, we introduce a new approach to multiclass classification ...
research
04/28/2018

A Cost-Sensitive Deep Belief Network for Imbalanced Classification

Imbalanced data with a skewed class distribution are common in many real...
research
06/05/2017

Progressive Boosting for Class Imbalance

Pattern recognition applications often suffer from skewed data distribut...

Please sign up or login with your details

Forgot password? Click here to reset