We develop a variant of the stochastic prox-linear method for minimizing...
As adaptive gradient methods are typically used for training
over-parame...
The problem of language grounding has attracted much attention in recent...
We consider stochastic second order methods for minimizing strongly-conv...