Adaptive Ensemble Learning of Spatiotemporal Processes with Calibrated Predictive Uncertainty: A Bayesian Nonparametric Approach

04/01/2019
by   Jeremiah Zhe Liu, et al.
0

Ensemble learning is a mainstay in modern data science practice. Conventional ensemble algorithms assign to base models a set of deterministic, constant model weights that (1) do not fully account for individual models' varying accuracy across data subgroups, nor (2) provide uncertainty estimates for the ensemble prediction. These shortcomings can yield predictions that are precise but biased, which can negatively impact the performance of the algorithm in real-word applications. In this work, we present an adaptive, probabilistic approach to ensemble learning using a transformed Gaussian process as a prior for the ensemble weights. Given input features, our method optimally combines base models based on their predictive accuracy in the feature space, and provides interpretable estimates of the uncertainty associated with both model selection, as reflected by the ensemble weights, and the overall ensemble predictions. Furthermore, to ensure that this quantification of the model uncertainty is accurate, we propose additional machinery to non-parametrically model the ensemble's predictive cumulative density function (CDF) so that it is consistent with the empirical distribution of the data. We apply the proposed method to data simulated from a nonlinear regression model, and to generate a spatial prediction model and associated prediction uncertainties for fine particle levels in eastern Massachusetts, USA.

READ FULL TEXT
research
12/08/2018

Adaptive and Calibrated Ensemble Learning with Dependent Tail-free Process

Ensemble learning is a mainstay in modern data science practice. Convent...
research
11/09/2022

REDS: Random Ensemble Deep Spatial prediction

There has been a great deal of recent interest in the development of spa...
research
03/10/2022

Spatially-Varying Bayesian Predictive Synthesis for Flexible and Interpretable Spatial Prediction

Spatial data are characterized by their spatial dependence, which is oft...
research
03/30/2021

Scalable Statistical Inference of Photometric Redshift via Data Subsampling

Handling big data has largely been a major bottleneck in traditional sta...
research
11/11/2019

Accurate Uncertainty Estimation and Decomposition in Ensemble Learning

Ensemble learning is a standard approach to building machine learning sy...
research
07/11/2022

RRMSE Voting Regressor: A weighting function based improvement to ensemble regression

This paper describes the RRMSE (Relative Root Mean Square Error) based w...
research
05/23/2019

Hierarchical Multimodel Ensemble Estimates of Soil Water Retention with Global Coverage

A correct quantification of mass and energy exchange processes among lan...

Please sign up or login with your details

Forgot password? Click here to reset