research
          
      
      ∙
      06/07/2023
    Stochastic Collapse: How Gradient Noise Attracts SGD Dynamics Towards Simpler Subnetworks
In this work, we reveal a strong implicit bias of stochastic gradient de...
          
            research
          
      
      ∙
      10/07/2022
     
             
  
  
     
                             
                             
                             share
 share