Piecewise Strong Convexity of Neural Networks

10/30/2018
by   Tristan Milne, et al.
0

We study the loss surface of a fully connected neural network with ReLU non-linearities, regularized with weight decay. We start by expressing the output of the network as a matrix determinant, which allows us to establish that the loss function is piecewise strongly convex on a bounded set where the training set error is below a threshold that we can estimate. This is used to prove that local minima of the loss function in this open set are isolated, and that every critical point below this error threshold is a local minimum, partially addressing an open problem given at the Conference on Learning Theory (COLT) 2015. Our results also give quantitative understanding of the improved performance if dropout is used as well as quantitative evidence that deeper networks are harder to train.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
05/23/2016

Deep Learning without Poor Local Minima

In this paper, we prove a conjecture published in 1989 and also partiall...
research
11/30/2014

The Loss Surfaces of Multilayer Networks

We study the connection between the highly non-convex loss function of a...
research
08/22/2023

A free from local minima algorithm for training regressive MLP neural networks

In this article an innovative method for training regressive MLP network...
research
02/25/2021

Spurious Local Minima Are Common for Deep Neural Networks with Piecewise Linear Activations

In this paper, it is shown theoretically that spurious local minima are ...
research
07/06/2018

The Goldilocks zone: Towards better understanding of neural network loss landscapes

We explore the loss landscape of fully-connected neural networks using r...
research
04/01/2022

Estimating the Jacobian matrix of an unknown multivariate function from sample values by means of a neural network

We describe, implement and test a novel method for training neural netwo...
research
06/28/2022

Diffeomorphic Registration using Sinkhorn Divergences

The diffeomorphic registration framework enables to define an optimal ma...

Please sign up or login with your details

Forgot password? Click here to reset