Using theoretical ROC curves for analysing machine learning binary classifiers

09/21/2019
by   Luma Omar, et al.
0

Most binary classifiers work by processing the input to produce a scalar response and comparing it to a threshold value. The various measures of classifier performance assume, explicitly or implicitly, probability distributions P_s and P_n of the response belonging to either class, probability distributions for the cost of each type of misclassification, and compute a performance score from the expected cost. In machine learning, classifier responses are obtained experimentally and performance scores are computed directly from them, without any assumptions on P_s and P_n. Here, we argue that the omitted step of estimating theoretical distributions for P_s and P_n can be useful. In a biometric security example, we fit beta distributions to the responses of two classifiers, one based on logistic regression and one on ANNs, and use them to establish a categorisation into a small number of classes with different extremal behaviours at the ends of the ROC curves.

READ FULL TEXT
research
10/08/2012

Semisupervised Classifier Evaluation and Recalibration

How many labeled examples are needed to estimate a classifier's performa...
research
06/18/2012

Predicting accurate probabilities with a ranking loss

In many real-world applications of machine learning classifiers, it is e...
research
01/05/2018

Equivalences between learning of data and probability distributions, and their applications

Algorithmic learning theory traditionally studies the learnability of ef...
research
01/05/2018

An equivalence between learning of data and probability distributions, and some applications

Algorithmic learning theory traditionally studies the learnability of ef...
research
01/18/2016

Domain based classification

The majority of traditional classification ru les minimizing the expecte...
research
06/21/2020

Equivalence of several curves assessing the similarity between probability distributions

The recent advent of powerful generative models has triggered the renewe...
research
02/09/2021

Classifier Calibration: with implications to threat scores in cybersecurity

This paper explores the calibration of a classifier output score in bina...

Please sign up or login with your details

Forgot password? Click here to reset