Symmetry, Saddle Points, and Global Geometry of Nonconvex Matrix Factorization

12/29/2016
by   Xingguo Li, et al.
0

We propose a general theory for studying the geometry of nonconvex objective functions with underlying symmetric structures. In specific, we characterize the locations of stationary points and the null space of the associated Hessian matrices via the lens of invariant groups. As a major motivating example, we apply the proposed general theory to characterize the global geometry of the low-rank matrix factorization problem. In particular, we illustrate how the rotational symmetry group gives rise to infinitely many non-isolated strict saddle points and equivalent global minima of the objective function. By explicitly identifying all stationary points, we divide the entire parameter space into three regions: (_1) the region containing the neighborhoods of all strict saddle points, where the objective has negative curvatures; (_2) the region containing neighborhoods of all global minima, where the objective enjoys strong convexity along certain directions; and (_3) the complement of the above regions, where the gradient has sufficiently large magnitudes. We further extend our result to the matrix sensing problem. This allows us to establish strong global convergence guarantees for popular iterative algorithms with arbitrary initial solutions.

READ FULL TEXT

page 12

page 15

research
02/27/2020

The Landscape of Matrix Factorization Revisited

We revisit the landscape of the simple matrix factorization problem. For...
research
03/08/2022

Noisy Low-rank Matrix Optimization: Geometry of Local Minima and Convergence Rate

This paper is concerned with low-rank matrix optimization, which has fou...
research
04/29/2022

Escaping Spurious Local Minima of Low-Rank Matrix Factorization Through Convex Lifting

This work proposes a rapid global solver for nonconvex low-rank matrix f...
research
02/24/2021

Noisy Gradient Descent Converges to Flat Minima for Nonconvex Matrix Factorization

Numerous empirical evidences have corroborated the importance of noise i...
research
12/25/2021

Over-Parametrized Matrix Factorization in the Presence of Spurious Stationary Points

Motivated by the emerging role of interpolating machines in signal proce...
research
02/13/2020

Fast Convergence for Langevin Diffusion with Matrix Manifold Structure

In this paper, we study the problem of sampling from distributions of th...

Please sign up or login with your details

Forgot password? Click here to reset