Related papers: Subspace-based Approximate Hessian Method for Zero…

Hessian-Aware Zeroth-Order Optimization for Black-Box Adversarial Attack

Zeroth-order optimization is an important research topic in machine learning. In recent years, it has become a key tool in black-box adversarial attack to neural network based image classifiers. However, existing zeroth-order optimization…

Machine Learning · Computer Science 2019-03-21 Haishan Ye , Zhichao Huang , Cong Fang , Chris Junchi Li , Tong Zhang

Zeroth-Order Randomized Subspace Newton Methods

Zeroth-order methods have become important tools for solving problems where we have access only to function evaluations. However, the zeroth-order methods only using gradient approximations are $n$ times slower than classical first-order…

Optimization and Control · Mathematics 2022-02-10 Erik Berglund , Sarit Khirirat , Xiaoyu Wang

Query-Efficient Zeroth-Order Algorithms for Nonconvex Constrained Optimization

Zeroth-order optimization (ZO) has been a powerful framework for solving black-box problems, which estimates gradients using zeroth-order data to update variables iteratively. The practical applicability of ZO critically depends on the…

Optimization and Control · Mathematics 2026-03-03 Ruiyang Jin , Yuke Zhou , Yujie Tang , Jie Song , Siyang Gao

Zeroth-Order Optimization Finds Flat Minima

Zeroth-order methods are extensively used in machine learning applications where gradients are infeasible or expensive to compute, such as black-box attacks, reinforcement learning, and language model fine-tuning. Existing optimization…

Machine Learning · Computer Science 2025-11-12 Liang Zhang , Bingcong Li , Kiran Koshy Thekumparampil , Sewoong Oh , Michael Muehlebach , Niao He

Obtaining Lower Query Complexities through Lightweight Zeroth-Order Proximal Gradient Algorithms

Zeroth-order (ZO) optimization is one key technique for machine learning problems where gradient calculation is expensive or impossible. Several variance reduced ZO proximal algorithms have been proposed to speed up ZO optimization for…

Optimization and Control · Mathematics 2024-10-04 Bin Gu , Xiyuan Wei , Hualin Zhang , Yi Chang , Heng Huang

Zeroth-Order Hybrid Gradient Descent: Towards A Principled Black-Box Optimization Framework

In this work, we focus on the study of stochastic zeroth-order (ZO) optimization which does not require first-order gradient information and uses only function evaluations. The problem of ZO optimization has emerged in many recent machine…

Machine Learning · Statistics 2020-12-22 Pranay Sharma , Kaidi Xu , Sijia Liu , Pin-Yu Chen , Xue Lin , Pramod K. Varshney

A Zeroth-order Proximal Stochastic Gradient Method for Weakly Convex Stochastic Optimization

In this paper we analyze a zeroth-order proximal stochastic gradient method suitable for the minimization of weakly convex stochastic optimization problems. We consider nonsmooth and nonlinear stochastic composite problems, for which…

Optimization and Control · Mathematics 2025-04-21 Spyridon Pougkakiotis , Dionysios S. Kalogerias

Powering Up Zeroth-Order Training via Subspace Gradient Orthogonalization

Zeroth-order (ZO) optimization provides a gradient-free alternative to first-order (FO) methods by estimating gradients via finite differences of function evaluations, and has recently emerged as a memory-efficient paradigm for fine-tuning…

Machine Learning · Computer Science 2026-02-24 Yicheng Lang , Changsheng Wang , Yihua Zhang , Mingyi Hong , Zheng Zhang , Wotao Yin , Sijia Liu

A Zeroth-Order Proximal Algorithm for Consensus Optimization

This paper considers a consensus optimization problem, where all the nodes in a network, with access to the zeroth-order information of its local objective function only, attempt to cooperatively achieve a common minimizer of the sum of…

Optimization and Control · Mathematics 2024-06-17 Chengan Wang , Zichong Ou , Jie Lu

A Primer on Zeroth-Order Optimization in Signal Processing and Machine Learning

Zeroth-order (ZO) optimization is a subset of gradient-free optimization that emerges in many signal processing and machine learning applications. It is used for solving optimization problems similarly to gradient-based methods. However, it…

Machine Learning · Computer Science 2020-06-23 Sijia Liu , Pin-Yu Chen , Bhavya Kailkhura , Gaoyuan Zhang , Alfred Hero , Pramod K. Varshney

SASSHA: Sharpness-aware Adaptive Second-order Optimization with Stable Hessian Approximation

Approximate second-order optimization methods often exhibit poorer generalization compared to first-order approaches. In this work, we look into this issue through the lens of the loss landscape and find that existing second-order methods…

Machine Learning · Computer Science 2025-06-25 Dahun Shin , Dongyeop Lee , Jinseok Chung , Namhoon Lee

Gradient-Free Method for Heavily Constrained Nonconvex Optimization

Zeroth-order (ZO) method has been shown to be a powerful method for solving the optimization problem where explicit expression of the gradients is difficult or infeasible to obtain. Recently, due to the practical value of the constrained…

Optimization and Control · Mathematics 2024-09-04 Wanli Shi , Hongchang Gao , Bin Gu

Elucidating Subspace Perturbation in Zeroth-Order Optimization: Theory and Practice at Scale

Zeroth-order (ZO) optimization has emerged as a promising alternative to gradient-based backpropagation methods, particularly for black-box optimization and large language model (LLM) fine-tuning. However, ZO methods often suffer from slow…

Machine Learning · Computer Science 2025-05-26 Sihwan Park , Jihun Yun , SungYub Kim , Souvik Kundu , Eunho Yang

Zeroth-order Low-rank Hessian Estimation via Matrix Recovery

A zeroth-order Hessian estimator aims to recover the Hessian matrix of an objective function at any given point, using minimal finite-difference computations. This paper studies zeroth-order Hessian estimation for low-rank Hessians, from a…

Optimization and Control · Mathematics 2024-02-09 Tianyu Wang , Zicheng Wang , Jiajia Yu

ZO-JADE: Zeroth-order Curvature-Aware Multi-Agent Convex Optimization

In this work we address the problem of convex optimization in a multi-agent setting where the objective is to minimize the mean of local cost functions whose derivatives are not available (e.g. black-box models). Moreover agents can only…

Optimization and Control · Mathematics 2023-06-14 Alessio Maritan , Luca Schenato

Zeroth-order Nonconvex Stochastic Optimization: Handling Constraints, High-Dimensionality and Saddle-Points

In this paper, we propose and analyze zeroth-order stochastic approximation algorithms for nonconvex and convex optimization, with a focus on addressing constrained optimization, high-dimensional setting and saddle-point avoiding. To handle…

Optimization and Control · Mathematics 2019-01-16 Krishnakumar Balasubramanian , Saeed Ghadimi

Co-iterative augmented Hessian method for orbital optimization

Orbital optimization procedure is widely called in electronic structure simulation. To efficiently find the orbital optimization solution, we developed a new second order orbital optimization algorithm, co-iteration augmented Hessian (CIAH)…

Chemical Physics · Physics 2017-01-11 Qiming Sun

ZOBA: An Efficient Single-loop Zeroth-order Bilevel Optimization Algorithm

Bilevel optimization problems consist of minimizing a value function whose evaluation depends on the solution of an inner optimization problem. These problems are typically tackled using first-order methods that require computing the…

Optimization and Control · Mathematics 2026-01-30 Marco Rando , Samuel Vaiter

On the Convergence Theory for Hessian-Free Bilevel Algorithms

Bilevel optimization has arisen as a powerful tool in modern machine learning. However, due to the nested structure of bilevel optimization, even gradient-based methods require second-order derivative approximations via Jacobian- or/and…

Machine Learning · Computer Science 2022-06-07 Daouda Sow , Kaiyi Ji , Yingbin Liang

Zeroth-order Stochastic Cubic Newton Method Revisited

This paper studies stochastic minimization of a finite-sum loss $ F (\mathbf{x}) = \frac{1}{N} \sum_{\xi=1}^N f(\mathbf{x};\xi) $. In many real-world scenarios, the Hessian matrix of such objectives exhibits a low-rank structure on a batch…

Optimization and Control · Mathematics 2025-08-12 Yu Liu , Weibin Peng , Tianyu Wang , Jiajia Yu