Isotropic SGD: a Practical Approach to Bayesian Posterior Sampling

06/09/2020
by   Giulio Franzese, et al.
0

In this work we define a unified mathematical framework to deepen our understanding of the role of stochastic gradient (SG) noise on the behavior of Markov chain Monte Carlo sampling (SGMCMC) algorithms. Our formulation unlocks the design of a novel, practical approach to posterior sampling, which makes the SG noise isotropic using a fixed learning rate that we determine analytically, and that requires weaker assumptions than existing algorithms. In contrast, the common traits of existing algorithms is to approximate the isotropy condition either by drowning the gradients in additive noise (annealing the learning rate) or by making restrictive assumptions on the noise covariance and the geometry of the loss landscape. Extensive experimental validations indicate that our proposal is competitive with the state-of-the-art on , while being much more practical to use.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
10/29/2015

Covariance-Controlled Adaptive Langevin Thermostat for Large-Scale Bayesian Sampling

Monte Carlo sampling for Bayesian posterior inference is a common approa...
research
05/22/2018

Langevin Markov Chain Monte Carlo with stochastic gradients

Monte Carlo sampling techniques have broad applications in machine learn...
research
04/13/2017

Stochastic Gradient Descent as Approximate Bayesian Inference

Stochastic Gradient Descent with a constant learning rate (constant SGD)...
research
04/27/2015

Fast Sampling for Bayesian Max-Margin Models

Bayesian max-margin models have shown superiority in various practical a...
research
11/04/2020

Direction Matters: On the Implicit Regularization Effect of Stochastic Gradient Descent with Moderate Learning Rate

Understanding the algorithmic regularization effect of stochastic gradie...
research
02/13/2020

Stochastic Approximate Gradient Descent via the Langevin Algorithm

We introduce a novel and efficient algorithm called the stochastic appro...
research
09/29/2021

Position-free Multiple-bounce Computations for Smith Microfacet BSDFs

Bidirectional Scattering Distribution Functions (BSDFs) encode how a mat...

Please sign up or login with your details

Forgot password? Click here to reset