Related papers: Learning to be Global Optimizer

An Algebraically Converging Stochastic Gradient Descent Algorithm for Global Optimization

We propose a new gradient descent algorithm with added stochastic terms for finding the global optimizers of nonconvex optimization problems. A key component in the algorithm is the adaptive tuning of the randomness based on the value of…

Optimization and Control · Mathematics 2025-06-16 Björn Engquist , Kui Ren , Yunan Yang

Rover Descent: Learning to optimize by learning to navigate on prototypical loss surfaces

Learning to optimize - the idea that we can learn from data algorithms that optimize a numerical criterion - has recently been at the heart of a growing number of research efforts. One of the most challenging issues within this approach is…

Machine Learning · Computer Science 2018-02-21 Louis Faury , Flavian Vasile

On Graduated Optimization for Stochastic Non-Convex Problems

The graduated optimization approach, also known as the continuation method, is a popular heuristic to solving non-convex problems that has received renewed interest over the last decade. Despite its popularity, very little is known in terms…

Machine Learning · Computer Science 2015-07-28 Elad Hazan , Kfir Y. Levy , Shai Shalev-Shwartz

Finding Global Optima in Nonconvex Stochastic Semidefinite Optimization with Variance Reduction

There is a recent surge of interest in nonconvex reformulations via low-rank factorization for stochastic convex semidefinite optimization problem in the purpose of efficiency and scalability. Compared with the original convex formulations,…

Optimization and Control · Mathematics 2018-02-27 Jinshan Zeng , Ke Ma , Yuan Yao

A Convexity-dependent Two-Phase Training Algorithm for Deep Neural Networks

The key task of machine learning is to minimize the loss function that measures the model fit to the training data. The numerical methods to do this efficiently depend on the properties of the loss function. The most decisive among these…

Machine Learning · Computer Science 2025-10-31 Tomas Hrycej , Bernhard Bermeitinger , Massimo Pavone , Götz-Henrik Wiegand , Siegfried Handschuh

Global Optimality in Bivariate Gradient-based DAG Learning

Recently, a new class of non-convex optimization problems motivated by the statistical problem of learning an acyclic directed graphical model from data has attracted significant interest. While existing work uses standard first-order…

Machine Learning · Computer Science 2023-07-03 Chang Deng , Kevin Bello , Bryon Aragam , Pradeep Ravikumar

A Stochastic Gradient Descent Method for Globally Minimizing Nearly Convex Functions

This paper proposes a stochastic gradient descent method with an adaptive Gaussian noise term for the global minimization of nearly convex functions, which are nonconvex and possess multiple strict local minimizers. The noise term,…

Optimization and Control · Mathematics 2025-08-05 Chenglong Bao , Liang Chen , Weizhi Shao

Greedy Learning to Optimize with Convergence Guarantees

Learning to optimize is an approach that leverages training data to accelerate the solution of optimization problems. Many approaches use unrolling to parametrize the update step and learn optimal parameters. Although L2O has shown…

Optimization and Control · Mathematics 2025-07-15 Patrick Fahy , Mohammad Golbabaee , Matthias J. Ehrhardt

Explicit and Implicit Graduated Optimization in Deep Neural Networks

Graduated optimization is a global optimization technique that is used to minimize a multimodal nonconvex function by smoothing the objective function with noise and gradually refining the solution. This paper experimentally evaluates the…

Machine Learning · Computer Science 2024-12-17 Naoki Sato , Hideaki Iiduka

Random Scaling and Momentum for Non-smooth Non-convex Optimization

Training neural networks requires optimizing a loss function that may be highly irregular, and in particular neither convex nor smooth. Popular training algorithms are based on stochastic gradient descent with momentum (SGDM), for which…

Machine Learning · Computer Science 2026-03-17 Qinzi Zhang , Ashok Cutkosky

A globally convergent algorithm for nonconvex optimization based on block coordinate update

Nonconvex optimization problems arise in many areas of computational science and engineering and are (approximately) solved by a variety of algorithms. Existing algorithms usually only have local convergence or subsequence convergence of…

Optimization and Control · Mathematics 2015-08-21 Yangyang Xu , Wotao Yin

A Globally Convergent Algorithm for Neural Network Parameter Optimization Based on Difference-of-Convex Functions

We propose an algorithm for optimizing the parameters of single hidden layer neural networks. Specifically, we derive a blockwise difference-of-convex (DC) functions representation of the objective function. Based on the latter, we propose…

Machine Learning · Computer Science 2024-01-17 Daniel Tschernutter , Mathias Kraus , Stefan Feuerriegel

A ranking approach to global optimization

We consider the problem of maximizing an unknown function over a compact and convex set using as few observations as possible. We observe that the optimization of the function essentially relies on learning the induced bipartite ranking…

Machine Learning · Statistics 2017-03-08 Cédric Malherbe , Nicolas Vayatis

Learning to optimize with convergence guarantees using nonlinear system theory

The increasing reliance on numerical methods for controlling dynamical systems and training machine learning models underscores the need to devise algorithms that dependably and efficiently navigate complex optimization landscapes.…

Systems and Control · Electrical Eng. & Systems 2024-06-04 Andrea Martin , Luca Furieri

Guided Policy Search as Approximate Mirror Descent

Guided policy search algorithms can be used to optimize complex nonlinear policies, such as deep neural networks, without directly computing policy gradients in the high-dimensional parameter space. Instead, these methods use supervised…

Machine Learning · Computer Science 2016-07-18 William Montgomery , Sergey Levine

Super Gradient Descent: Global Optimization requires Global Gradient

Global minimization is a fundamental challenge in optimization, especially in machine learning, where finding the global minimum of a function directly impacts model performance and convergence. This article introduces a novel optimization…

Machine Learning · Computer Science 2024-10-31 Seifeddine Achour

Recent Theoretical Advances in Non-Convex Optimization

Motivated by recent increased interest in optimization algorithms for non-convex optimization in application to training deep neural networks and other optimization problems in data analysis, we give an overview of recent theoretical…

Optimization and Control · Mathematics 2021-11-29 Marina Danilova , Pavel Dvurechensky , Alexander Gasnikov , Eduard Gorbunov , Sergey Guminov , Dmitry Kamzolov , Innokentiy Shibaev

Faster Differentially Private Convex Optimization via Second-Order Methods

Differentially private (stochastic) gradient descent is the workhorse of DP private machine learning in both the convex and non-convex settings. Without privacy constraints, second-order methods, like Newton's method, converge faster than…

Machine Learning · Computer Science 2023-05-23 Arun Ganesh , Mahdi Haghifam , Thomas Steinke , Abhradeep Thakurta

An empirical analysis of the optimization of deep network loss surfaces

The success of deep neural networks hinges on our ability to accurately and efficiently optimize high-dimensional, non-convex functions. In this paper, we empirically investigate the loss functions of state-of-the-art networks, and how…

Machine Learning · Computer Science 2017-12-11 Daniel Jiwoong Im , Michael Tao , Kristin Branson

An Alternative Globalization Strategy for Unconstrained Optimization

We propose a new globalization strategy that can be used in unconstrained optimization algorithms to support rapid convergence from remote starting points. Our approach is based on using multiple points at each iteration to build a…

Optimization and Control · Mathematics 2017-05-16 Figen Öztoprak , Ş. İlker Birbil