Combining Predictions under Uncertainty: The Case of Random Decision Trees

08/15/2022
by   Florian Busch, et al.
0

A common approach to aggregate classification estimates in an ensemble of decision trees is to either use voting or to average the probabilities for each class. The latter takes uncertainty into account, but not the reliability of the uncertainty estimates (so to say, the "uncertainty about the uncertainty"). More generally, much remains unknown about how to best combine probabilistic estimates from multiple sources. In this paper, we investigate a number of alternative prediction methods. Our methods are inspired by the theories of probability, belief functions and reliable classification, as well as a principle that we call evidence accumulation. Our experiments on a variety of data sets are based on random decision trees which guarantees a high diversity in the predictions to be combined. Somewhat unexpectedly, we found that taking the average over the probabilities is actually hard to beat. However, evidence accumulation showed consistently better results on all but very small leafs.

READ FULL TEXT
research
03/27/2013

Multiple decision trees

This paper describes experiments, on two domains, to investigate the eff...
research
06/23/2022

Indecision Trees: Learning Argument-Based Reasoning under Quantified Uncertainty

Using Machine Learning systems in the real world can often be problemati...
research
07/11/2012

MOB-ESP and other Improvements in Probability Estimation

A key prerequisite to optimal reasoning under uncertainty in intelligent...
research
01/08/2008

Imprecise probability trees: Bridging two theories of imprecise probability

We give an overview of two approaches to probability theory where lower ...
research
03/03/2021

Combining Prediction and Interpretation in Decision Trees (PrInDT) – a Linguistic Example

In this paper, we show that conditional inference trees and ensembles ar...
research
07/15/2021

Multi-label Chaining with Imprecise Probabilities

We present two different strategies to extend the classical multi-label ...
research
06/28/2016

Reviving Threshold-Moving: a Simple Plug-in Bagging Ensemble for Binary and Multiclass Imbalanced Data

Class imbalance presents a major hurdle in the application of data minin...

Please sign up or login with your details

Forgot password? Click here to reset