With the ever-growing model size and the limited availability of labeled...
While cross entropy (CE) is the most commonly used loss to train deep ne...
In this paper, we study the problem of recovering a low-rank matrix from...
When training deep neural networks for classification tasks, an intrigui...
We provide the first global optimization landscape analysis of
Neural Co...