Flat minima generalize for low-rank matrix recovery

03/07/2022
by   Lijun Ding, et al.
0

Empirical evidence suggests that for a variety of overparameterized nonlinear models, most notably in neural network training, the growth of the loss around a minimizer strongly impacts its performance. Flat minima – those around which the loss grows slowly – appear to generalize well. This work takes a step towards understanding this phenomenon by focusing on the simplest class of overparameterized nonlinear models: those arising in low-rank matrix recovery. We analyze overparameterized matrix and bilinear sensing, robust PCA, covariance matrix estimation, and single hidden layer neural networks with quadratic activation functions. In all cases, we show that flat minima, measured by the trace of the Hessian, exactly recover the ground truth under standard statistical assumptions. For matrix completion, we establish weak recovery, although empirical evidence suggests exact recovery holds here as well. We complete the paper with synthetic experiments that illustrate our findings.

READ FULL TEXT

page 25

page 26

research
04/21/2021

Sharp Global Guarantees for Nonconvex Low-Rank Matrix Recovery in the Overparameterized Regime

We prove that it is possible for nonconvex low-rank matrix recovery to c...
research
05/23/2016

Global Optimality of Local Search for Low Rank Matrix Recovery

We show that there are no spurious local minima in the non-convex factor...
research
02/21/2023

On the Optimization Landscape of Burer-Monteiro Factorization: When do Global Solutions Correspond to Ground Truth?

In low-rank matrix recovery, the goal is to recover a low-rank matrix, g...
research
05/24/2023

On progressive sharpening, flat minima and generalisation

We present a new approach to understanding the relationship between loss...
research
08/31/2020

Low-rank matrix recovery with non-quadratic loss: projected gradient method and regularity projection oracle

Existing results for low-rank matrix recovery largely focus on quadratic...
research
10/28/2016

Dynamic matrix recovery from incomplete observations under an exact low-rank constraint

Low-rank matrix factorizations arise in a wide variety of applications -...
research
05/25/2018

How Much Restricted Isometry is Needed In Nonconvex Matrix Recovery?

When the linear measurements of an instance of low-rank matrix recovery ...

Please sign up or login with your details

Forgot password? Click here to reset