Related papers: A Generic First-Order Algorithmic Framework for Bi…

A General Descent Aggregation Framework for Gradient-based Bi-level Optimization

In recent years, a variety of gradient-based methods have been developed to solve Bi-Level Optimization (BLO) problems in machine learning and computer vision areas. However, the theoretical correctness and practical effectiveness of these…

Machine Learning · Computer Science 2022-01-04 Risheng Liu , Pan Mu , Xiaoming Yuan , Shangzhi Zeng , Jin Zhang

ADMM-based Bilevel Descent Aggregation Algorithm for Sparse Hyperparameter Selection

It is widely acknowledged that hyperparameter selection plays a critical role in the effectiveness of sparse optimization problems. The bilevel optimization provides a robust framework for addressing this issue, but these existing methods…

Optimization and Control · Mathematics 2026-03-11 Yunhai Xiao , Anqi Liu , Peili Li , Yanyun Ding

Bridging Constraints and Stochasticity: A Fully First-Order Method for Stochastic Bilevel Optimization with Linear Constraints

This work provides the first finite-time convergence guarantees for linearly constrained stochastic bilevel optimization using only first-order methods, requiring solely gradient information without any Hessian computations or second-order…

Optimization and Control · Mathematics 2025-11-18 Cac Phan , Kai Wang

BOME! Bilevel Optimization Made Easy: A Simple First-Order Approach

Bilevel optimization (BO) is useful for solving a variety of important machine learning problems including but not limited to hyperparameter optimization, meta-learning, continual learning, and reinforcement learning. Conventional BO…

Machine Learning · Computer Science 2022-09-20 Mao Ye , Bo Liu , Stephen Wright , Peter Stone , Qiang Liu

Towards Extremely Fast Bilevel Optimization with Self-governed Convergence Guarantees

Gradient methods have become mainstream techniques for Bi-Level Optimization (BLO) in learning and vision fields. The validity of existing works heavily relies on solving a series of approximation subproblems with extraordinarily high…

Optimization and Control · Mathematics 2022-05-23 Risheng Liu , Xuan Liu , Wei Yao , Shangzhi Zeng , Jin Zhang

A Fully First-Order Method for Stochastic Bilevel Optimization

We consider stochastic unconstrained bilevel optimization problems when only the first-order gradient oracles are available. While numerous optimization methods have been proposed for tackling bilevel problems, existing methods either tend…

Optimization and Control · Mathematics 2023-01-27 Jeongyeol Kwon , Dohyun Kwon , Stephen Wright , Robert Nowak

Effective Bilevel Optimization via Minimax Reformulation

Bilevel optimization has found successful applications in various machine learning problems, including hyper-parameter optimization, data cleaning, and meta-learning. However, its huge computational cost presents a significant challenge for…

Machine Learning · Computer Science 2024-11-05 Xiaoyu Wang , Rui Pan , Renjie Pi , Jipeng Zhang

Beyond Value Functions: Single-Loop Bilevel Optimization under Flatness Conditions

Bilevel optimization, a hierarchical optimization paradigm, has gained significant attention in a wide range of practical applications, notably in the fine-tuning of generative models. However, due to the nested problem structure, most…

Optimization and Control · Mathematics 2025-07-29 Liuyuan Jiang , Quan Xiao , Lisha Chen , Tianyi Chen

Riemannian Bilevel Optimization with Gradient Aggregation

Bilevel optimization (BLO) offers a principled framework for hierarchical decision-making and has been widely applied in machine learning tasks such as hyperparameter optimization and meta-learning. While existing BLO methods are mostly…

Optimization and Control · Mathematics 2025-10-20 Zhuo Chen , Xinjian Xu , Shihui Ying , Tieyong Zeng

On the Stability and Generalization of First-order Bilevel Minimax Optimization

Bilevel optimization and bilevel minimax optimization have recently emerged as unifying frameworks for a range of machine-learning tasks, including hyperparameter optimization and reinforcement learning. The existing literature focuses on…

Machine Learning · Computer Science 2026-04-23 Xuelin Zhang , Peipei Yuan

Directional first order approach for a class of bilevel programs

In this paper, we study a class of bilevel optimization program (BP), where the feasible set of the lower level program is independent of the upper level variable. For bilevel programs it is known that the first order approach requires the…

Optimization and Control · Mathematics 2026-02-27 Kuang Bai , Wei Yao , Jane J. Ye , Jin Zhang

UFO-BLO: Unbiased First-Order Bilevel Optimization

Bilevel optimization (BLO) is a popular approach with many applications including hyperparameter optimization, neural architecture search, adversarial robustness and model-agnostic meta-learning. However, the approach suffers from time and…

Machine Learning · Computer Science 2021-06-08 Valerii Likhosherstov , Xingyou Song , Krzysztof Choromanski , Jared Davis , Adrian Weller

Accelerated Fully First-Order Methods for Bilevel and Minimax Optimization

We present in this paper novel accelerated fully first-order methods in \emph{Bilevel Optimization} (BLO). Firstly, for BLO under the assumption that the lower-level functions admit the typical strong convexity assumption, the…

Optimization and Control · Mathematics 2024-07-10 Chris Junchi Li

On Penalty Methods for Nonconvex Bilevel Optimization and First-Order Stochastic Approximation

In this work, we study first-order algorithms for solving Bilevel Optimization (BO) where the objective functions are smooth but possibly nonconvex in both levels and the variables are restricted to closed convex sets. As a first step, we…

Optimization and Control · Mathematics 2024-02-13 Jeongyeol Kwon , Dohyun Kwon , Stephen Wright , Robert Nowak

Biased Stochastic First-Order Methods for Conditional Stochastic Optimization and Applications in Meta Learning

Conditional stochastic optimization covers a variety of applications ranging from invariant learning and causal inference to meta-learning. However, constructing unbiased gradient estimators for such problems is challenging due to the…

Optimization and Control · Mathematics 2024-06-04 Yifan Hu , Siqi Zhang , Xin Chen , Niao He

Investigating Bi-Level Optimization for Learning and Vision from a Unified Perspective: A Survey and Beyond

Bi-Level Optimization (BLO) is originated from the area of economic game theory and then introduced into the optimization community. BLO is able to handle problems with a hierarchical structure, involving two levels of optimization tasks,…

Machine Learning · Computer Science 2021-09-29 Risheng Liu , Jiaxin Gao , Jin Zhang , Deyu Meng , Zhouchen Lin

Fine-grained Analysis of Stability and Generalization for Stochastic Bilevel Optimization

Stochastic bilevel optimization (SBO) has been integrated into many machine learning paradigms recently, including hyperparameter optimization, meta learning, and reinforcement learning. Along with the wide range of applications, there have…

Machine Learning · Computer Science 2026-04-07 Xuelin Zhang , Hong Chen , Bin Gu , Tieliang Gong , Feng Zheng

Debiasing a First-order Heuristic for Approximate Bi-level Optimization

Approximate bi-level optimization (ABLO) consists of (outer-level) optimization problems, involving numerical (inner-level) optimization loops. While ABLO has many applications across deep learning, it suffers from time and memory…

Machine Learning · Computer Science 2021-06-09 Valerii Likhosherstov , Xingyou Song , Krzysztof Choromanski , Jared Davis , Adrian Weller

First-Order Algorithms Without Lipschitz Gradient: A Sequential Local Optimization Approach

First-order algorithms have been popular for solving convex and non-convex optimization problems. A key assumption for the majority of these algorithms is that the gradient of the objective function is globally Lipschitz continuous, but…

Optimization and Control · Mathematics 2024-02-07 Junyu Zhang , Mingyi Hong

A Conditional Gradient-based Method for Simple Bilevel Optimization with Convex Lower-level Problem

In this paper, we study a class of bilevel optimization problems, also known as simple bilevel optimization, where we minimize a smooth objective function over the optimal solution set of another convex constrained optimization problem.…

Optimization and Control · Mathematics 2023-04-25 Ruichen Jiang , Nazanin Abolfazli , Aryan Mokhtari , Erfan Yazdandoost Hamedani