research
∙
09/19/2023
On the different regimes of Stochastic Gradient Descent
Modern deep networks are trained with stochastic gradient descent (SGD) ...
research
∙
01/31/2023
Dissecting the Effects of SGD Noise in Distinct Regimes of Deep Learning
Understanding when the noise in stochastic gradient descent (SGD) affect...
research
∙
02/07/2022