research
∙
05/31/2023
Mildly Overparameterized ReLU Networks Have a Favorable Loss Landscape
We study the loss landscape of two-layer mildly overparameterized ReLU n...
research
∙
01/17/2023
Expected Gradients of Maxout Networks and Consequences to Parameter Initialization
We study the gradients of a maxout network with respect to inputs and pa...
research
∙
07/01/2021