Visual-Based Analysis of Classification Measures with Applications to Imbalanced Data

04/24/2017
by   Dariusz Brzezinski, et al.
0

With a plethora of available classification performance measures, choosing the right metric for the right task requires careful thought. To make this decision in an informed manner, one should study and compare general properties of candidate measures. However, analysing measures with respect to complete ranges of their domain values is a difficult and challenging task. In this study, we attempt to support such analyses with a specialized visualization technique, which operates in a barycentric coordinate system using a 3D tetrahedron. Additionally, we adapt this technique to the context of imbalanced data and put forward a set of properties which should be taken into account when selecting a classification performance measure. As a result, we compare 22 popular measures and show important differences in their behaviour. Moreover, for parametric measures such as the F_β and IBA_α(G-mean), we analytically derive parameter thresholds that change measure properties. Finally, we provide an online visualization tool that can aid the analysis of complete domain ranges of performance measures.

READ FULL TEXT
research
06/23/2020

Classification Performance Metric for Imbalance Data Based on Recall and Selectivity Normalized in Class Labels

In the classification of a class imbalance dataset, the performance meas...
research
12/09/2021

A Note on Comparison of F-measures

We comment on a recent TKDE paper "Linear Approximation of F-measure for...
research
01/22/2022

Good Classification Measures and How to Find Them

Several performance measures can be used for evaluating classification r...
research
09/30/2018

Specificity measures and reference

In this paper we study empirically the validity of measures of referenti...
research
12/14/2017

Rate of Change Analysis for Interestingness Measures

The use of Association Rule Mining techniques in diverse contexts and do...
research
11/10/2022

A classification performance evaluation measure considering data separability

Machine learning and deep learning classification models are data-driven...

Please sign up or login with your details

Forgot password? Click here to reset