We propose a new algorithm for variance reduction when estimating f(X_T)...
For V : ℝ^d →ℝ coercive, we study the convergence rate
for the L^1-dista...
Training a very deep neural network is a challenging task, as the deeper...
Stochastic Gradient Descent Langevin Dynamics (SGLD) algorithms, which a...