research
          
      
      ∙
      02/08/2021
    Eliminating Sharp Minima from SGD with Truncated Heavy-tailed Noise
The empirical success of deep learning is often attributed to SGD's myst...
          
            research
          
      
      ∙
      10/31/2017
     
             
  
  
     
                             
                             share
 share