Shadowing Properties of Optimization Algorithms

11/12/2019
by   Antonio Orvieto, et al.
0

Ordinary differential equation (ODE) models of gradient-based optimization methods can provide insights into the dynamics of learning and inspire the design of new algorithms. Unfortunately, this thought-provoking perspective is weakened by the fact that, in the worst case, the error between the algorithm steps and its ODE approximation grows exponentially with the number of iterations. In an attempt to encourage the use of continuous-time methods in optimization, we show that, if some additional regularity on the objective is assumed, the ODE representations of Gradient Descent and Heavy-ball do not suffer from the aforementioned problem, once we allow for a small perturbation on the algorithm initial condition. In the dynamical systems literature, this phenomenon is called shadowing. Our analysis relies on the concept of hyperbolicity, as well as on tools from numerical analysis.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
04/15/2020

On Dissipative Symplectic Integration with Applications to Gradient-Based Optimization

Continuous-time dynamical systems have proved useful in providing concep...
research
05/17/2019

A Dynamical Systems Perspective on Nesterov Acceleration

We present a dynamical system framework for understanding Nesterov's acc...
research
12/27/2021

Last-Iterate Convergence of Saddle Point Optimizers via High-Resolution Differential Equations

Several widely-used first-order saddle point optimization methods yield ...
research
03/29/2023

Ordinary Differential Equation-based Sparse Signal Recovery

This study investigates the use of continuous-time dynamical systems for...
research
08/19/2020

SODEN: A Scalable Continuous-Time Survival Model through Ordinary Differential Equation Networks

In this paper, we propose a flexible model for survival analysis using n...
research
11/17/2022

Optimization on the symplectic Stiefel manifold: SR decomposition-based retraction and applications

Numerous problems in optics, quantum physics, stability analysis, and co...
research
09/07/2020

Efficient Projection Algorithms onto the Weighted l1 Ball

Projected gradient descent has been proved efficient in many optimizatio...

Please sign up or login with your details

Forgot password? Click here to reset