Don't guess what's true: choose what's optimal. A probability transducer for machine-learning classifiers

02/21/2023
by   K. Dyrland, et al.
0

In fields such as medicine and drug discovery, the ultimate goal of a classification is not to guess a class, but to choose the optimal course of action among a set of possible ones, usually not in one-one correspondence with the set of classes. This decision-theoretic problem requires sensible probabilities for the classes. Probabilities conditional on the features are computationally almost impossible to find in many important cases. The main idea of the present work is to calculate probabilities conditional not on the features, but on the trained classifier's output. This calculation is cheap, needs to be made only once, and provides an output-to-probability "transducer" that can be applied to all future outputs of the classifier. In conjunction with problem-dependent utilities, the probabilities of the transducer allow us to find the optimal choice among the classes or among a set of more general decisions, by means of expected-utility maximization. This idea is demonstrated in a simplified drug-discovery problem with a highly imbalanced dataset. The transducer and utility maximization together always lead to improved results, sometimes close to theoretical maximum, for all sets of problem-dependent utilities. The one-time-only calculation of the transducer also provides, automatically: (i) a quantification of the uncertainty about the transducer itself; (ii) the expected utility of the augmented algorithm (including its uncertainty), which can be used for algorithm selection; (iii) the possibility of using the algorithm in a "generative mode", useful if the training dataset is biased.

READ FULL TEXT

page 18

page 28

research
06/19/2019

Efficient Algorithms for Set-Valued Prediction in Multi-Class Classification

In cases of uncertainty, a multi-class classifier preferably returns a s...
research
07/09/2018

Decision making under uncertainty using imprecise probabilities

Various ways for decision making with imprecise probabilities (admissibi...
research
05/07/2015

Optimal Decision-Theoretic Classification Using Non-Decomposable Performance Metrics

We provide a general theoretical analysis of expected out-of-sample util...
research
04/01/2022

DBCal: Density Based Calibration of classifier predictions for uncertainty quantification

Measurement of uncertainty of predictions from machine learning methods ...
research
07/11/2012

Pre-Selection of Independent Binary Features: An Application to Diagnosing Scrapie in Sheep

Suppose that the only available information in a multi-class problem are...
research
12/21/2017

DropMax: Adaptive Stochastic Softmax

We propose DropMax, a stochastic version of softmax classifier which at ...
research
09/13/2021

Specified Certainty Classification, with Application to Read Classification for Reference-Guided Metagenomic Assembly

Specified Certainty Classification (SCC) is a new paradigm for employing...

Please sign up or login with your details

Forgot password? Click here to reset