Evaluating Predictive Distributions: Does Bayesian Deep Learning Work?

10/09/2021
by   Ian Osband, et al.
0

Posterior predictive distributions quantify uncertainties ignored by point estimates. This paper introduces The Neural Testbed, which provides tools for the systematic evaluation of agents that generate such predictions. Crucially, these tools assess not only the quality of marginal predictions per input, but also joint predictions given many inputs. Joint distributions are often critical for useful uncertainty quantification, but they have been largely overlooked by the Bayesian deep learning community. We benchmark several approaches to uncertainty estimation using a neural-network-based data generating process. Our results reveal the importance of evaluation beyond marginal predictions. Further, they reconcile sources of confusion in the field, such as why Bayesian deep learning approaches that generate accurate marginal predictions perform poorly in sequential decision tasks, how incorporating priors can be helpful, and what roles epistemic versus aleatoric uncertainty play when evaluating performance. We also present experiments on real-world challenge datasets, which show a high correlation with testbed results, and that the importance of evaluating joint predictive distributions carries over to real data. As part of this effort, we opensource The Neural Testbed, including all implementations from this paper.

READ FULL TEXT

page 6

page 7

research
11/26/2022

Looking at the posterior: on the origin of uncertainty in neural-network classification

Bayesian inference can quantify uncertainty in the predictions of neural...
research
02/28/2022

Evaluating High-Order Predictive Distributions in Deep Learning

Most work on supervised learning research has focused on marginal predic...
research
11/06/2020

Beyond Marginal Uncertainty: How Accurately can Bayesian Regression Models Estimate Posterior Predictive Correlations?

While uncertainty estimation is a well-studied topic in deep learning, m...
research
07/20/2021

Evaluating Probabilistic Inference in Deep Learning: Beyond Marginal Predictions

A fundamental challenge for any intelligent system is prediction: given ...
research
02/18/2023

Approximate Thompson Sampling via Epistemic Neural Networks

Thompson sampling (TS) is a popular heuristic for action selection, but ...
research
05/18/2022

Marginal and Joint Cross-Entropies Predictives for Online Bayesian Inference, Active Learning, and Active Sampling

Principled Bayesian deep learning (BDL) does not live up to its potentia...
research
07/19/2021

Epistemic Neural Networks

We introduce the epistemic neural network (ENN) as an interface for unce...

Please sign up or login with your details

Forgot password? Click here to reset