Impact of Parameter Sparsity on Stochastic Gradient MCMC Methods for Bayesian Deep Learning

02/08/2022
by   Meet P. Vadera, et al.
0

Bayesian methods hold significant promise for improving the uncertainty quantification ability and robustness of deep neural network models. Recent research has seen the investigation of a number of approximate Bayesian inference methods for deep neural networks, building on both the variational Bayesian and Markov chain Monte Carlo (MCMC) frameworks. A fundamental issue with MCMC methods is that the improvements they enable are obtained at the expense of increased computation time and model storage costs. In this paper, we investigate the potential of sparse network structures to flexibly trade-off model storage costs and inference run time against predictive performance and uncertainty quantification ability. We use stochastic gradient MCMC methods as the core Bayesian inference method and consider a variety of approaches for selecting sparse network structures. Surprisingly, our results show that certain classes of randomly selected substructures can perform as well as substructures derived from state-of-the-art iterative pruning methods while drastically reducing model training times.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/24/2022

On the optimization and pruning for Bayesian deep learning

The goal of Bayesian deep learning is to provide uncertainty quantificat...
research
04/02/2023

Bayesian neural networks via MCMC: a Python-based tutorial

Bayesian inference provides a methodology for parameter estimation and u...
research
12/15/2022

Scalable Bayesian Uncertainty Quantification for Neural Network Potentials: Promise and Pitfalls

Neural network (NN) potentials promise highly accurate molecular dynamic...
research
02/07/2020

Assessing the Adversarial Robustness of Monte Carlo and Distillation Methods for Deep Bayesian Neural Network Classification

In this paper, we consider the problem of assessing the adversarial robu...
research
04/17/2021

Bayesian graph convolutional neural networks via tempered MCMC

Deep learning models, such as convolutional neural networks, have long b...
research
04/13/2021

Revisiting Bayesian Autoencoders with MCMC

Autoencoders gained popularity in the deep learning revolution given the...
research
11/13/2017

Uncertainty quantification for radio interferometric imaging: II. MAP estimation

Uncertainty quantification is a critical missing component in radio inte...

Please sign up or login with your details

Forgot password? Click here to reset