Related papers: Comparative Analysis of Gradient-Based Optimizatio…

Do optimization methods in deep learning applications matter?

With advances in deep learning, exponential data growth and increasing model complexity, developing efficient optimization methods are attracting much research attention. Several implementations favor the use of Conjugate Gradient (CG) and…

Machine Learning · Computer Science 2020-03-02 Buse Melis Ozyildirim , Mariam Kiran

Comparison of Minimization Methods for Rosenbrock Functions

This paper gives an in-depth review of the most common iterative methods for unconstrained optimization using two functions that belong to a class of Rosenbrock functions as a performance test. This study covers the Steepest Gradient…

Optimization and Control · Mathematics 2021-04-26 Iyanuoluwa Emiola , Robson Adem

Adaptive Proximal Gradient Method for Convex Optimization

In this paper, we explore two fundamental first-order algorithms in convex optimization, namely, gradient descent (GD) and proximal gradient method (ProxGD). Our focus is on making these algorithms entirely adaptive by leveraging local…

Optimization and Control · Mathematics 2024-02-13 Yura Malitsky , Konstantin Mishchenko

qNBO: quasi-Newton Meets Bilevel Optimization

Bilevel optimization, addressing challenges in hierarchical learning tasks, has gained significant interest in machine learning. The practical implementation of the gradient descent method to bilevel optimization encounters computational…

Machine Learning · Computer Science 2025-02-04 Sheng Fang , Yong-Jin Liu , Wei Yao , Chengming Yu , Jin Zhang

Quasi-Newton Optimization Methods For Deep Learning Applications

Deep learning algorithms often require solving a highly non-linear and nonconvex unconstrained optimization problem. Methods for solving optimization problems in large-scale machine learning, such as deep learning and deep reinforcement…

Machine Learning · Computer Science 2019-09-06 Jacob Rafati , Roummel F. Marcia

An empirical analysis of the optimization of deep network loss surfaces

The success of deep neural networks hinges on our ability to accurately and efficiently optimize high-dimensional, non-convex functions. In this paper, we empirically investigate the loss functions of state-of-the-art networks, and how…

Machine Learning · Computer Science 2017-12-11 Daniel Jiwoong Im , Michael Tao , Kristin Branson

Optimization methods for achieving high diffraction efficiency with perfect electric conducting gratings

This work presents the implementation, numerical examples and experimental convergence study of first- and second-order optimization methods applied to one-dimensional periodic gratings. Through boundary integral equations and shape…

Optimization and Control · Mathematics 2020-11-04 Rubén Aylwin , Gerardo Silva-Oelker , Carlos Jerez-Hanckes , Patrick Fay

The conjugate gradient method with various viewpoints

Connections of the conjugate gradient (CG) method with other methods in computational mathematics are surveyed, including the connections with the conjugate direction method, the subspace optimization method and the quasi-Newton method BFGS…

Numerical Analysis · Mathematics 2019-12-17 Xuping Zhang , Jiefei Yang , Ziying Liu

Initial Placement for Fruchterman--Reingold Force Model With Coordinate Newton Direction

Graph drawing is a fundamental task in information visualization, with the Fruchterman--Reingold (FR) force model being one of the most popular choices. We can interpret this visualization task as a continuous optimization problem, which…

Computational Geometry · Computer Science 2025-03-04 Hiroki Hamaguchi , Naoki Marumo , Akiko Takeda

Fast convex optimization via time scale and averaging of the steepest descent

In a Hilbert setting, we develop a gradient-based dynamic approach for fast solving convex optimization problems. By applying time scaling, averaging, and perturbation techniques to the continuous steepest descent (SD), we obtain…

Optimization and Control · Mathematics 2023-05-05 Hedy Attouch , Radu Ioan Bot , Dang-Khoa Nguyen

Accelerated Gradient Methods for Networked Optimization

We develop multi-step gradient methods for network-constrained optimization of strongly convex functions with Lipschitz-continuous gradients. Given the topology of the underlying network and bounds on the Hessian of the objective function,…

Optimization and Control · Mathematics 2015-06-12 Euhanna Ghadimi , Iman Shames , Mikael Johansson

Gradient Descent with Provably Tuned Learning-rate Schedules

Gradient-based iterative optimization methods are the workhorse of modern machine learning. They crucially rely on careful tuning of parameters like learning rate and momentum. However, one typically sets them using heuristic approaches…

Machine Learning · Computer Science 2025-12-05 Dravyansh Sharma

Multilevel Bregman Proximal Gradient Descent

We present the Multilevel Bregman Proximal Gradient Descent (ML BPGD) method, a novel multilevel optimization framework tailored to constrained convex problems with relative Lipschitz smoothness. Our approach extends the classical…

Optimization and Control · Mathematics 2026-05-06 Yara Elshiaty , Stefania Petra

Gradient-type subspace iteration methods for the symmetric eigenvalue problem

This paper explores variants of the subspace iteration algorithm for computing approximate invariant subspaces. The standard subspace iteration approach is revisited and new variants that exploit gradient-type techniques combined with a…

Numerical Analysis · Mathematics 2024-05-14 Foivos Alimisis , Yousef Saad , Bart Vandereycken

A Gentle Introduction to Gradient-Based Optimization and Variational Inequalities for Machine Learning

The rapid progress in machine learning in recent years has been based on a highly productive connection to gradient-based optimization. Further progress hinges in part on a shift in focus from pattern recognition to decision-making and…

Machine Learning · Computer Science 2024-02-27 Neha S. Wadia , Yatin Dandi , Michael I. Jordan

On the Efficient Implementation of the Matrix Exponentiated Gradient Algorithm for Low-Rank Matrix Optimization

Convex optimization over the spectrahedron, i.e., the set of all real $n\times n$ positive semidefinite matrices with unit trace, has important applications in machine learning, signal processing and statistics, mainly as a convex…

Optimization and Control · Mathematics 2022-11-01 Dan Garber , Atara Kaplan

A Principle for Global Optimization with Gradients

This work demonstrates the utility of gradients for the global optimization of certain differentiable functions with many suboptimal local minima. To this end, a principle for generating search directions from non-local quadratic…

Optimization and Control · Mathematics 2023-08-21 Nils Müller

A Newton-Type Proximal Gradient Method for Nonlinear Multi-objective Optimization Problems

In this paper, a globally convergent Newton-type proximal gradient method is developed for composite multi-objective optimization problems where each objective function can be represented as the sum of a smooth function and a nonsmooth…

Optimization and Control · Mathematics 2024-10-25 Md Abu Talhamainuddin Ansary

Nonlinear conjugate gradient methods: worst-case convergence rates via computer-assisted analyses

We propose a computer-assisted approach to the analysis of the worst-case convergence of nonlinear conjugate gradient methods (NCGMs). Those methods are known for their generally good empirical performances for large-scale optimization,…

Optimization and Control · Mathematics 2024-09-20 Shuvomoy Das Gupta , Robert M. Freund , Xu Andy Sun , Adrien Taylor

Towards Differentiable Multilevel Optimization: A Gradient-Based Approach

Multilevel optimization has gained renewed interest in machine learning due to its promise in applications such as hyperparameter tuning and continual learning. However, existing methods struggle with the inherent difficulty of efficiently…

Machine Learning · Computer Science 2024-10-16 Yuntian Gu , Xuzheng Chen