Visualizing high-dimensional loss landscapes with Hessian directions

08/28/2022
by   Lucas Böttcher, et al.
7

Analyzing geometric properties of high-dimensional loss functions, such as local curvature and the existence of other optima around a certain point in loss space, can help provide a better understanding of the interplay between neural network structure, implementation attributes, and learning performance. In this work, we combine concepts from high-dimensional probability and differential geometry to study how curvature properties in lower-dimensional loss representations depend on those in the original loss space. We show that saddle points in the original space are rarely correctly identified as such in lower-dimensional representations if random projections are used. In such projections, the expected curvature in a lower-dimensional representation is proportional to the mean curvature in the original loss space. Hence, the mean curvature in the original loss space determines if saddle points appear, on average, as either minima, maxima, or almost flat regions. We use the connection between expected curvature and mean curvature (i.e., the normalized Hessian trace) to estimate the trace of Hessians without calculating the Hessian or Hessian-vector products as in Hutchinson's method. Because random projections are not able to correctly identify saddle information, we propose to study projections along Hessian directions that are associated with the largest and smallest principal curvatures. We connect our findings to the ongoing debate on loss landscape flatness and generalizability. Finally, we illustrate our method in numerical experiments on different image classifiers with up to about 7× 10^6 parameters.

READ FULL TEXT

page 12

page 14

page 15

research
10/14/2019

Emergent properties of the local geometry of neural loss landscapes

The local geometry of high dimensional neural network loss landscapes ca...
research
06/08/2022

Diffusion Curvature for Estimating Local Curvature in High Dimensional Data

We introduce a new intrinsic measure of local curvature on point-cloud d...
research
12/07/2020

A Deeper Look at the Hessian Eigenspectrum of Deep Neural Networks and its Applications to Regularization

Loss landscape analysis is extremely useful for a deeper understanding o...
research
05/08/2020

The critical locus of overparameterized neural networks

Many aspects of the geometry of loss functions in deep learning remain m...
research
07/20/2020

GarNet++: Improving Fast and Accurate Static3D Cloth Draping by Curvature Loss

In this paper, we tackle the problem of static 3D cloth draping on virtu...
research
06/12/2023

Unveiling the Hessian's Connection to the Decision Boundary

Understanding the properties of well-generalizing minima is at the heart...
research
03/22/2023

Convergence of Hessian estimator from random samples on a manifold

We provide a systematic convergence analysis of the Hessian operator est...

Please sign up or login with your details

Forgot password? Click here to reset