Related papers: ZOBA: An Efficient Single-loop Zeroth-order Bileve…

A Single-Loop Gradient Algorithm for Pessimistic Bilevel Optimization via Smooth Approximation

Bilevel optimization has garnered significant attention in the machine learning community recently, particularly regarding the development of efficient numerical methods. While substantial progress has been made in developing efficient…

Optimization and Control · Mathematics 2026-02-04 Qichao Cao , Shangzhi Zeng , Jin Zhang

A Single-Loop Bilevel Deep Learning Method for Optimal Control of Obstacle Problems

Optimal control of obstacle problems arises in a wide range of applications and is computationally challenging due to its nonsmoothness, nonlinearity, and bilevel structure. Classical numerical approaches rely on mesh-based discretization…

Optimization and Control · Mathematics 2026-01-08 Yongcun Song , Shangzhi Zeng , Jin Zhang , Lvgang Zhang

A Fully Single Loop Algorithm for Bilevel Optimization without Hessian Inverse

In this paper, we propose a new Hessian inverse free Fully Single Loop Algorithm (FSLA) for bilevel optimization problems. Classic algorithms for bilevel optimization admit a double loop structure which is computationally expensive.…

Machine Learning · Computer Science 2021-12-13 Junyi Li , Bin Gu , Heng Huang

BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach

Bilevel optimization (BO) is useful for solving a variety of important machine learning problems including but not limited to hyperparameter optimization, meta-learning, continual learning, and reinforcement learning. Conventional BO…

Machine Learning · Computer Science 2022-09-20 Mao Ye , Bo Liu , Stephen Wright , Peter Stone , Qiang Liu

Subspace-based Approximate Hessian Method for Zeroth-Order Optimization

Zeroth-order optimization addresses problems where gradient information is inaccessible or impractical to compute. While most existing methods rely on first-order approximations, incorporating second-order (curvature) information can, in…

Machine Learning · Computer Science 2025-07-09 Dongyoon Kim , Sungjae Lee , Wonjin Lee , Kwang In Kim

Optimal Algorithms for Stochastic Bilevel Optimization under Relaxed Smoothness Conditions

Stochastic Bilevel optimization usually involves minimizing an upper-level (UL) function that is dependent on the arg-min of a strongly-convex lower-level (LL) function. Several algorithms utilize Neumann series to approximate certain…

Optimization and Control · Mathematics 2023-06-22 Xuxing Chen , Tesi Xiao , Krishnakumar Balasubramanian

On the Convergence Theory for Hessian-Free Bilevel Algorithms

Bilevel optimization has arisen as a powerful tool in modern machine learning. However, due to the nested structure of bilevel optimization, even gradient-based methods require second-order derivative approximations via Jacobian- or/and…

Machine Learning · Computer Science 2022-06-07 Daouda Sow , Kaiyi Ji , Yingbin Liang

Query-Efficient Zeroth-Order Algorithms for Nonconvex Constrained Optimization

Zeroth-order optimization (ZO) has been a powerful framework for solving black-box problems, which estimates gradients using zeroth-order data to update variables iteratively. The practical applicability of ZO critically depends on the…

Optimization and Control · Mathematics 2026-03-03 Ruiyang Jin , Yuke Zhou , Yujie Tang , Jie Song , Siyang Gao

Distributed Stochastic Bilevel Optimization: Improved Complexity and Heterogeneity Analysis

This paper consider solving a class of nonconvex-strongly-convex distributed stochastic bilevel optimization (DSBO) problems with personalized inner-level objectives. Most existing algorithms require computational loops for hypergradient…

Optimization and Control · Mathematics 2025-04-08 Youcheng Niu , Jinming Xu , Ying Sun , Yan Huang , Li Chai

A Single-Loop Algorithm for Decentralized Bilevel Optimization

Bilevel optimization has gained significant attention in recent years due to its broad applications in machine learning. This paper focuses on bilevel optimization in decentralized networks and proposes a novel single-loop algorithm for…

Optimization and Control · Mathematics 2024-04-24 Youran Dong , Shiqian Ma , Junfeng Yang , Chao Yin

An Inexact Conditional Gradient Method for Constrained Bilevel Optimization

Bilevel optimization is an important class of optimization problems where one optimization problem is nested within another. While various methods have emerged to address unconstrained general bilevel optimization problems, there has been a…

Optimization and Control · Mathematics 2024-03-15 Nazanin Abolfazli , Ruichen Jiang , Aryan Mokhtari , Erfan Yazdandoost Hamedani

A Single-Loop First-Order Algorithm for Linearly Constrained Bilevel Optimization

We study bilevel optimization problems where the lower-level problems are strongly convex and have coupled linear constraints. To overcome the potential non-smoothness of the hyper-objective and the computational challenges associated with…

Optimization and Control · Mathematics 2026-02-06 Wei Shen , Jiawei Zhang , Minhui Huang , Cong Shen

Incentive-Based Load Curtailment with Limited Information: A Bilevel Zeroth-Order Learning Approach

Incentive-based load curtailment unlocks critical demand-side flexibility but is hindered by the limited knowledge of private user parameters and the inherent nonsmoothness of responses due to physical device constraints. We address this…

Systems and Control · Electrical Eng. & Systems 2026-05-27 Zhisen Jiang , Florian Dörfler , Saverio Bolognani

SPABA: A Single-Loop and Probabilistic Stochastic Bilevel Algorithm Achieving Optimal Sample Complexity

While stochastic bilevel optimization methods have been extensively studied for addressing large-scale nested optimization problems in machine learning, it remains an open question whether the optimal complexity bounds for solving bilevel…

Optimization and Control · Mathematics 2026-03-24 Tianshu Chu , Dachuan Xu , Wei Yao , Jin Zhang

Double Momentum Method for Lower-Level Constrained Bilevel Optimization

Bilevel optimization (BO) has recently gained prominence in many machine learning applications due to its ability to capture the nested structure inherent in these problems. Recently, many hypergradient methods have been proposed as…

Optimization and Control · Mathematics 2024-09-04 Wanli Shi , Yi Chang , Bin Gu

Elucidating Subspace Perturbation in Zeroth-Order Optimization: Theory and Practice at Scale

Zeroth-order (ZO) optimization has emerged as a promising alternative to gradient-based backpropagation methods, particularly for black-box optimization and large language model (LLM) fine-tuning. However, ZO methods often suffer from slow…

Machine Learning · Computer Science 2025-05-26 Sihwan Park , Jihun Yun , SungYub Kim , Souvik Kundu , Eunho Yang

A Fully First-Order Method for Stochastic Bilevel Optimization

We consider stochastic unconstrained bilevel optimization problems when only the first-order gradient oracles are available. While numerous optimization methods have been proposed for tackling bilevel problems, existing methods either tend…

Optimization and Control · Mathematics 2023-01-27 Jeongyeol Kwon , Dohyun Kwon , Stephen Wright , Robert Nowak

A Primal-Dual Approach to Bilevel Optimization with Multiple Inner Minima

Bilevel optimization has found extensive applications in modern machine learning problems such as hyperparameter optimization, neural architecture search, meta-learning, etc. While bilevel problems with a unique inner minimal point (e.g.,…

Optimization and Control · Mathematics 2022-06-09 Daouda Sow , Kaiyi Ji , Ziwei Guan , Yingbin Liang

Faster Gradient Methods for Highly-Smooth Stochastic Bilevel Optimization

This paper studies the complexity of finding an $\epsilon$-stationary point for stochastic bilevel optimization when the upper-level problem is nonconvex and the lower-level problem is strongly convex. Recent work proposed the first-order…

Optimization and Control · Mathematics 2026-03-10 Lesi Chen , Junru Li , El Mahdi Chayti , Jingzhao Zhang

Second-Order Bilevel Optimization with Accelerated Convergence Rates

This paper studies second-order methods for nonconvex-strongly-convex bilevel optimization. We propose a novel fully second-order bilevel approximation method (FSBA) that achieves an iteration complexity of…

Optimization and Control · Mathematics 2026-05-08 Sheng Yang , Chengchang Liu , Lesi Chen , John C. S. Lui