research
∙
01/28/2022
Interplay between depth of neural networks and locality of target functions
It has been recognized that heavily overparameterized deep neural networ...
research
∙
05/20/2021
Logarithmic landscape and power-law escape rate of SGD
Stochastic gradient descent (SGD) undergoes complicated multiplicative n...
research
∙
02/10/2021
On Minibatch Noise: Discrete-Time SGD, Overparametrization, and Bayes
The noise in stochastic gradient descent (SGD), caused by minibatch samp...
research
∙
09/28/2020
Improved generalization by noise enhancement
Recent studies have demonstrated that noise in stochastic gradient desce...
research
∙
05/26/2020