Interpretable Machines: Constructing Valid Prediction Intervals with Random Forests

03/09/2021
by   Burim Ramosaj, et al.
9

An important issue when using Machine Learning algorithms in recent research is the lack of interpretability. Although these algorithms provide accurate point predictions for various learning problems, uncertainty estimates connected with point predictions are rather sparse. A contribution to this gap for the Random Forest Regression Learner is presented here. Based on its Out-of-Bag procedure, several parametric and non-parametric prediction intervals are provided for Random Forest point predictions and theoretical guarantees for its correct coverage probability is delivered. In a second part, a thorough investigation through Monte-Carlo simulation is conducted evaluating the performance of the proposed methods from three aspects: (i) Analyzing the correct coverage rate of the proposed prediction intervals, (ii) Inspecting interval width and (iii) Verifying the competitiveness of the proposed intervals with existing methods. The simulation yields that the proposed prediction intervals are robust towards non-normal residual distributions and are competitive by providing correct coverage rates and comparably narrow interval lengths, even for comparably small samples.

READ FULL TEXT

page 14

page 16

page 27

page 28

page 30

page 32

page 34

page 36

research
06/15/2021

RFpredInterval: An R Package for Prediction Intervals with Random Forests and Boosted Forests

Like many predictive models, random forests provide a point prediction f...
research
06/11/2023

Well-Calibrated Probabilistic Predictive Maintenance using Venn-Abers

When using machine learning for fault detection, a common problem is the...
research
04/24/2020

Bayesian Non-parametric Bragg-edge Fitting for Neutron Transmission Strain Imaging

Energy resolved neutron transmission techniques can provide high-resolut...
research
10/07/2022

Constructing Prediction Intervals with Neural Networks: An Empirical Evaluation of Bootstrapping and Conformal Inference Methods

Artificial neural networks (ANNs) are popular tools for accomplishing ma...
research
07/11/2022

On Exact and Efficient Inference for Many Normal Means

Inference about the unknown means θ=(θ_1,...,θ_n)' ∈ℝ^n in the sampling ...
research
01/22/2018

Optimizing Prediction Intervals by Tuning Random Forest via Meta-Validation

Recent studies have shown that tuning prediction models increases predic...
research
01/05/2021

Online Multivalid Learning: Means, Moments, and Prediction Intervals

We present a general, efficient technique for providing contextual predi...

Please sign up or login with your details

Forgot password? Click here to reset