research
          
      
      ∙
      09/19/2023
    On the different regimes of Stochastic Gradient Descent
Modern deep networks are trained with stochastic gradient descent (SGD) ...
          
            research
          
      
      ∙
      01/31/2023
    Dissecting the Effects of SGD Noise in Distinct Regimes of Deep Learning
Understanding when the noise in stochastic gradient descent (SGD) affect...
          
            research
          
      
      ∙
      02/07/2022
     
             
  
  
     
                             share
 share