We introduce Nuclear Co-Learned Representations (NuCLR), a deep learning...
A novel neural architecture was recently developed that enforces an exac...
We aim to understand grokking, a phenomenon where models generalize long...
The Lipschitz constant of the map between the input and output space
rep...