Related papers: Why Does Adaptive Zeroth-Order Optimization Work?

Query-Efficient Zeroth-Order Algorithms for Nonconvex Constrained Optimization

Zeroth-order optimization (ZO) has been a powerful framework for solving black-box problems, which estimates gradients using zeroth-order data to update variables iteratively. The practical applicability of ZO critically depends on the…

Optimization and Control · Mathematics 2026-03-03 Ruiyang Jin , Yuke Zhou , Yujie Tang , Jie Song , Siyang Gao

Obtaining Lower Query Complexities through Lightweight Zeroth-Order Proximal Gradient Algorithms

Zeroth-order (ZO) optimization is one key technique for machine learning problems where gradient calculation is expensive or impossible. Several variance reduced ZO proximal algorithms have been proposed to speed up ZO optimization for…

Optimization and Control · Mathematics 2024-10-04 Bin Gu , Xiyuan Wei , Hualin Zhang , Yi Chang , Heng Huang

Zeroth-Order Regularized Optimization (ZORO): Approximately Sparse Gradients and Adaptive Sampling

We consider the problem of minimizing a high-dimensional objective function, which may include a regularization term, using (possibly noisy) evaluations of the function. Such optimization is also called derivative-free, zeroth-order, or…

Optimization and Control · Mathematics 2023-03-20 HanQin Cai , Daniel Mckenzie , Wotao Yin , Zhenliang Zhang

A Primer on Zeroth-Order Optimization in Signal Processing and Machine Learning

Zeroth-order (ZO) optimization is a subset of gradient-free optimization that emerges in many signal processing and machine learning applications. It is used for solving optimization problems similarly to gradient-based methods. However, it…

Machine Learning · Computer Science 2020-06-23 Sijia Liu , Pin-Yu Chen , Bhavya Kailkhura , Gaoyuan Zhang , Alfred Hero , Pramod K. Varshney

On Adaptivity in Zeroth-Order Optimization

We investigate the effectiveness of adaptive zeroth-order (ZO) optimization for memory-constrained fine-tuning of large language models (LLMs). Contrary to prior claims, we show that adaptive ZO methods such as ZO-Adam offer no convergence…

Machine Learning · Computer Science 2026-05-06 Hassan Dbouk , Nidham Gazagnadou , Matthias Reisser , Christos Louizos

Prior-Informed Zeroth-Order Optimization with Adaptive Direction Alignment for Memory-Efficient LLM Fine-Tuning

Fine-tuning large language models (LLMs) has achieved remarkable success across various NLP tasks, but the substantial memory overhead during backpropagation remains a critical bottleneck, especially as model scales grow. Zeroth-order (ZO)…

Computation and Language · Computer Science 2026-01-09 Feihu Jin , Shipeng Cen , Ying Tan

Lazy Queries Can Reduce Variance in Zeroth-order Optimization

A major challenge of applying zeroth-order (ZO) methods is the high query complexity, especially when queries are costly. We propose a novel gradient estimation technique for ZO methods based on adaptive lazy queries that we term as LAZO.…

Machine Learning · Computer Science 2022-06-16 Quan Xiao , Qing Ling , Tianyi Chen

Zeroth-Order Stochastic Variance Reduction for Nonconvex Optimization

As application demands for zeroth-order (gradient-free) optimization accelerate, the need for variance reduced and faster converging approaches is also intensifying. This paper addresses these challenges by presenting: a) a comprehensive…

Machine Learning · Computer Science 2018-06-08 Sijia Liu , Bhavya Kailkhura , Pin-Yu Chen , Paishun Ting , Shiyu Chang , Lisa Amini

Zeroth-Order Hard-Thresholding: Gradient Error vs. Expansivity

$\ell_0$ constrained optimization is prevalent in machine learning, particularly for high-dimensional problems, because it is a fundamental approach to achieve sparse learning. Hard-thresholding gradient descent is a dominant technique to…

Machine Learning · Computer Science 2024-03-19 William de Vazelhes , Hualin Zhang , Huimin Wu , Xiao-Tong Yuan , Bin Gu

Zeroth-Order Optimization at the Edge of Stability

Zeroth-order (ZO) methods are widely used when gradients are unavailable or prohibitively expensive, including black-box learning and memory-efficient fine-tuning of large models, yet their optimization dynamics in deep learning remain…

Machine Learning · Computer Science 2026-04-17 Minhak Song , Liang Zhang , Bingcong Li , Niao He , Michael Muehlebach , Sewoong Oh

Zeroth-Order Optimization is Secretly Single-Step Policy Optimization

Zeroth-Order Optimization (ZOO) provides powerful tools for optimizing functions where explicit gradients are unavailable or expensive to compute. However, the underlying mechanisms of popular ZOO methods, particularly those employing…

Machine Learning · Computer Science 2025-06-18 Junbin Qiu , Zhengpeng Xie , Xiangda Yan , Yongjie Yang , Yao Shu

Refining Adaptive Zeroth-Order Optimization at Ease

Recently, zeroth-order (ZO) optimization plays an essential role in scenarios where gradient information is inaccessible or unaffordable, such as black-box systems and resource-constrained environments. While existing adaptive methods such…

Machine Learning · Computer Science 2025-06-10 Yao Shu , Qixin Zhang , Kun He , Zhongxiang Dai

Explicit and Non-asymptotic Query Complexities of Rank-Based Zeroth-order Algorithm on Stochastic Smooth Functions

Zeroth-order (ZO) optimization with ordinal feedback has emerged as a fundamental problem in modern machine learning systems, particularly in human-in-the-loop settings such as reinforcement learning from human feedback, preference…

Optimization and Control · Mathematics 2025-12-23 Haishan Ye

Zeroth-Order Methods for Nonconvex Stochastic Problems with Decision-Dependent Distributions

In this study, we consider an optimization problem with uncertainty dependent on decision variables, which has recently attracted attention due to its importance in machine learning and pricing applications. In this problem, the gradient of…

Optimization and Control · Mathematics 2024-12-31 Yuya Hikima , Akiko Takeda

Powering Up Zeroth-Order Training via Subspace Gradient Orthogonalization

Zeroth-order (ZO) optimization provides a gradient-free alternative to first-order (FO) methods by estimating gradients via finite differences of function evaluations, and has recently emerged as a memory-efficient paradigm for fine-tuning…

Machine Learning · Computer Science 2026-02-24 Yicheng Lang , Changsheng Wang , Yihua Zhang , Mingyi Hong , Zheng Zhang , Wotao Yin , Sijia Liu

On the Optimal Construction of Unbiased Gradient Estimators for Zeroth-Order Optimization

Zeroth-order optimization (ZOO) is an important framework for stochastic optimization when gradients are unavailable or expensive to compute. A potential limitation of existing ZOO methods is the bias inherent in most gradient estimators…

Machine Learning · Computer Science 2025-10-24 Shaocong Ma , Heng Huang

An Empirical Evaluation of Zeroth-Order Optimization Methods on AI-driven Molecule Optimization

Molecule optimization is an important problem in chemical discovery and has been approached using many techniques, including generative modeling, reinforcement learning, genetic algorithms, and much more. Recent work has also applied…

Biomolecules · Quantitative Biology 2022-10-31 Elvin Lo , Pin-Yu Chen

Adaptive First-and Zeroth-order Methods for Weakly Convex Stochastic Optimization Problems

In this paper, we design and analyze a new family of adaptive subgradient methods for solving an important class of weakly convex (possibly nonsmooth) stochastic optimization problems. Adaptive methods that use exponential moving averages…

Optimization and Control · Mathematics 2020-05-26 Parvin Nazari , Davoud Ataee Tarzanagh , George Michailidis

Stochastic Zeroth-Order Optimization Under Heavy-Tailed Noise

We study stochastic zeroth-order (ZO) optimization of smooth nonconvex objectives under heavy-tailed sample-gradient noise. This regime is motivated by empirical evidence that gradient noise in modern machine learning can violate the…

Optimization and Control · Mathematics 2026-05-19 Taha El Bakkali , El Mahdi Chayti , Qiuyi Zhang , Imane Rahali , Omar Saadi

Zero-Order Optimization for LLM Fine-Tuning via Learnable Direction Sampling

Fine-tuning large pretrained language models (LLMs) is a cornerstone of modern NLP, yet its growing memory demands (driven by backpropagation and large optimizer States) limit deployment in resource-constrained settings. Zero-order (ZO)…

Machine Learning · Computer Science 2026-02-17 Valery Parfenov , Grigoriy Evseev , Andrey Veprikov , Nikolay Bushkov , Stanislav Moiseev , Aleksandr Beznosikov