AGGLIO: Global Optimization for Locally Convex Functions

11/06/2021
by   Debojyoti Dey, et al.
0

This paper presents AGGLIO (Accelerated Graduated Generalized LInear-model Optimization), a stage-wise, graduated optimization technique that offers global convergence guarantees for non-convex optimization problems whose objectives offer only local convexity and may fail to be even quasi-convex at a global scale. In particular, this includes learning problems that utilize popular activation functions such as sigmoid, softplus and SiLU that yield non-convex training objectives. AGGLIO can be readily implemented using point as well as mini-batch SGD updates and offers provable convergence to the global optimum in general conditions. In experiments, AGGLIO outperformed several recently proposed optimization techniques for non-convex and locally convex objectives in terms of convergence rate as well as convergent accuracy. AGGLIO relies on a graduation technique for generalized linear models, as well as a novel proof strategy, both of which may be of independent interest.

READ FULL TEXT

page 1

page 2

page 3

page 4

research
03/19/2022

Convergence Error Analysis of Reflected Gradient Langevin Dynamics for Globally Optimizing Non-Convex Constrained Problems

Non-convex optimization problems have various important applications, wh...
research
07/08/2020

Non-parametric Models for Non-negative Functions

Linear models have shown great effectiveness and flexibility in many fie...
research
05/31/2019

Testing that a Local Optimum of the Likelihood is Globally Optimum using Reparameterized Embeddings

Many mathematical imaging problems are posed as non-convex optimization ...
research
05/19/2023

Accelerating Convergence in Global Non-Convex Optimization with Reversible Diffusion

Langevin Dynamics has been extensively employed in global non-convex opt...
research
09/09/2020

Meta-learning for Multi-variable Non-convex Optimization Problems: Iterating Non-optimums Makes Optimum Possible

In this paper, we aim to address the problem of solving a non-convex opt...
research
07/01/2016

Convergence Rate of Frank-Wolfe for Non-Convex Objectives

We give a simple proof that the Frank-Wolfe algorithm obtains a stationa...
research
05/17/2019

SSFN: Self Size-estimating Feed-forward Network and Low Complexity Design

We design a self size-estimating feed-forward network (SSFN) using a joi...

Please sign up or login with your details

Forgot password? Click here to reset