Related papers: Distributed Bilevel Optimization with Communicatio…

On the Convergence of Distributed Stochastic Bilevel Optimization Algorithms over a Network

Bilevel optimization has been applied to a wide variety of machine learning models, and numerous stochastic bilevel optimization algorithms have been developed in recent years. However, most existing algorithms restrict their focus on the…

Machine Learning · Computer Science 2023-03-28 Hongchang Gao , Bin Gu , My T. Thai

On the Communication Complexity of Decentralized Stochastic Bilevel Optimization

Stochastic bilevel optimization finds widespread applications in machine learning, including meta-learning, hyperparameter optimization, and neural architecture search. To extend stochastic bilevel optimization to distributed data, several…

Machine Learning · Computer Science 2026-05-26 Yihan Zhang , My T. Thai , Jie Wu , Hongchang Gao

Bilevel Distributed Optimization in Directed Networks

Motivated by emerging applications in wireless sensor networks and large-scale data processing, we consider distributed optimization over directed networks where the agents communicate their information locally to their neighbors to…

Optimization and Control · Mathematics 2021-03-22 Farzad Yousefian

Decentralized Bilevel Optimization: A Perspective from Transient Iteration Complexity

Stochastic bilevel optimization (SBO) is becoming increasingly essential in machine learning due to its versatility in handling nested structures. To address large-scale SBO, decentralized approaches have emerged as effective paradigms in…

Optimization and Control · Mathematics 2025-11-27 Boao Kong , Shuchen Zhu , Songtao Lu , Xinmeng Huang , Kun Yuan

Decentralized Gossip-Based Stochastic Bilevel Optimization over Communication Networks

Bilevel optimization have gained growing interests, with numerous applications found in meta learning, minimax games, reinforcement learning, and nested composition optimization. This paper studies the problem of distributed bilevel…

Machine Learning · Statistics 2022-06-23 Shuoguang Yang , Xuezhou Zhang , Mengdi Wang

On Linear Convergence of Distributed Stochastic Bilevel Optimization over Undirected Networks via Gradient Aggregation

Many large-scale constrained optimization problems can be formulated as bilevel distributed optimization tasks over undirected networks, where agents collaborate to minimize a global cost function while adhering to constraints, relying only…

Optimization and Control · Mathematics 2025-11-25 Ajay Tak , Mayank Baranwal

Decentralized Bilevel Optimization

Bilevel optimization has been successfully applied to many important machine learning problems. Algorithms for solving bilevel optimization have been studied under various settings. In this paper, we study the nonconvex-strongly-convex…

Optimization and Control · Mathematics 2022-06-14 Xuxing Chen , Minhui Huang , Shiqian Ma

Asynchronous Distributed Bilevel Optimization

Bilevel optimization plays an essential role in many machine learning tasks, ranging from hyperparameter optimization to meta-learning. Existing studies on bilevel optimization, however, focus on either centralized or synchronous…

Machine Learning · Computer Science 2023-02-27 Yang Jiao , Kai Yang , Tiancheng Wu , Dongjin Song , Chengtao Jian

Communication-efficient Variance-reduced Stochastic Gradient Descent

We consider the problem of communication efficient distributed optimization where multiple nodes exchange important algorithm information in every iteration to solve large problems. In particular, we focus on the stochastic variance-reduced…

Machine Learning · Computer Science 2020-03-16 Hossein S. Ghadikolaei , Sindri Magnusson

FAB: A First-Order AB-based Gradient Algorithm for Distributed Bilevel Optimization over Time-Varying Directed Graphs

Distributed optimization over time-varying directed graphs has shown promising performance in addressing challenges posed by complex communication constraints in real-world scenarios. In many practical settings, however, the direct…

Optimization and Control · Mathematics 2026-05-08 Yaoshuai Ma , Xiao Wang , Wei Yao , Jin Zhang

A Review of Bilevel Optimization: Methods, Emerging Applications, and Recent Advancements

This paper presents a comprehensive review of techniques proposed in the literature for solving bilevel optimization problems encountered in various real-life applications. Bilevel optimization is an appropriate choice for hierarchical…

Optimization and Control · Mathematics 2025-11-06 Dhaval Pujara , Ankur Sinha

Overcoming Lower-Level Constraints in Bilevel Optimization: A Novel Approach with Regularized Gap Functions

Constrained bilevel optimization tackles nested structures present in constrained learning tasks like constrained meta-learning, adversarial learning, and distributed bilevel optimization. However, existing bilevel optimization methods…

Optimization and Control · Mathematics 2024-06-05 Wei Yao , Haian Yin , Shangzhi Zeng , Jin Zhang

Accelerated Methods with Compressed Communications for Distributed Optimization Problems under Data Similarity

In recent years, as data and problem sizes have increased, distributed learning has become an essential tool for training high-performance models. However, the communication bottleneck, especially for high-dimensional data, is a challenge.…

Optimization and Control · Mathematics 2025-04-28 Dmitry Bylinkin , Aleksandr Beznosikov

Communication-Efficient Federated Bilevel Optimization with Local and Global Lower Level Problems

Bilevel Optimization has witnessed notable progress recently with new emerging efficient algorithms. However, its application in the Federated Learning setting remains relatively underexplored, and the impact of Federated Learning's…

Machine Learning · Computer Science 2024-02-28 Junyi Li , Feihu Huang , Heng Huang

A Penalty-Based Method for Communication-Efficient Decentralized Bilevel Programming

Bilevel programming has recently received attention in the literature due to its wide range of applications, including reinforcement learning and hyper-parameter optimization. However, it is widely assumed that the underlying bilevel…

Machine Learning · Computer Science 2024-10-11 Parvin Nazari , Ahmad Mousavi , Davoud Ataee Tarzanagh , George Michailidis

Bilevel learning

Bilevel learning refers to machine learning problems that can be formulated as bilevel optimization models, where decisions are organized in a hierarchical structure. This paradigm has recently gained considerable attention in machine…

Optimization and Control · Mathematics 2026-05-05 Riccardo Grazzi , Massimiliano Pontil , Saverio Salzo , Alain Zemkoho

Bilevel Learning via Inexact Stochastic Gradient Descent

Bilevel optimization is a central tool in machine learning for high-dimensional hyperparameter tuning. Its applications are vast; for instance, in imaging it can be used for learning data-adaptive regularizers and optimizing forward…

Optimization and Control · Mathematics 2025-11-11 Mohammad Sadegh Salehi , Subhadip Mukherjee , Lindon Roberts , Matthias J. Ehrhardt

Distributed Convex Optimization With Limited Communications

In this paper, a distributed convex optimization algorithm, termed \emph{distributed coordinate dual averaging} (DCDA) algorithm, is proposed. The DCDA algorithm addresses the scenario of a large distributed optimization problem with…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-10-31 Milind Rao , Stefano Rini , Andrea Goldsmith

Distributed Stochastic Bilevel Optimization: Improved Complexity and Heterogeneity Analysis

This paper consider solving a class of nonconvex-strongly-convex distributed stochastic bilevel optimization (DSBO) problems with personalized inner-level objectives. Most existing algorithms require computational loops for hypergradient…

Optimization and Control · Mathematics 2025-04-08 Youcheng Niu , Jinming Xu , Ying Sun , Yan Huang , Li Chai

Distributed learning with compressed gradients

Asynchronous computation and gradient compression have emerged as two key techniques for achieving scalability in distributed optimization for large-scale machine learning. This paper presents a unified analysis framework for distributed…

Optimization and Control · Mathematics 2018-11-30 Sarit Khirirat , Hamid Reza Feyzmahdavian , Mikael Johansson