Related papers: Far-HO: A Bilevel Programming Package for Hyperpar…

Bilevel Programming for Hyperparameter Optimization and Meta-Learning

We introduce a framework based on bilevel programming that unifies gradient-based hyperparameter optimization and meta-learning. We show that an approximate version of the bilevel problem can be solved by taking into explicit account the…

Machine Learning · Statistics 2018-07-04 Luca Franceschi , Paolo Frasconi , Saverio Salzo , Riccardo Grazzi , Massimilano Pontil

CPMLHO:Hyperparameter Tuning via Cutting Plane and Mixed-Level Optimization

The hyperparameter optimization of neural network can be expressed as a bilevel optimization problem. The bilevel optimization is used to automatically update the hyperparameter, and the gradient of the hyperparameter is the approximate…

Machine Learning · Computer Science 2022-12-14 Shuo Yang , Yang Jiao , Shaoyu Dou , Mana Zheng , Chen Zhu

Learning to Solve Constrained Bilevel Control Co-Design Problems

Learning to Optimize (L2O) is a subfield of machine learning (ML) in which ML models are trained to solve parametric optimization problems. The general goal is to learn a fast approximator of solutions to constrained optimization problems,…

Optimization and Control · Mathematics 2025-12-04 James Kotary , Himanshu Sharma , Ethan King , Draguna Vrabie , Ferdinando Fioretto , Jan Drgona

Investigating Bi-Level Optimization for Learning and Vision from a Unified Perspective: A Survey and Beyond

Bi-Level Optimization (BLO) is originated from the area of economic game theory and then introduced into the optimization community. BLO is able to handle problems with a hierarchical structure, involving two levels of optimization tasks,…

Machine Learning · Computer Science 2021-09-29 Risheng Liu , Jiaxin Gao , Jin Zhang , Deyu Meng , Zhouchen Lin

TransBO: Hyperparameter Optimization via Two-Phase Transfer Learning

With the extensive applications of machine learning models, automatic hyperparameter optimization (HPO) has become increasingly important. Motivated by the tuning behaviors of human experts, it is intuitive to leverage auxiliary knowledge…

Machine Learning · Computer Science 2022-06-07 Yang Li , Yu Shen , Huaijun Jiang , Wentao Zhang , Zhi Yang , Ce Zhang , Bin Cui

A survey on multi-objective hyperparameter optimization algorithms for Machine Learning

Hyperparameter optimization (HPO) is a necessary step to ensure the best possible performance of Machine Learning (ML) algorithms. Several methods have been developed to perform HPO; most of these are focused on optimizing one performance…

Machine Learning · Computer Science 2022-11-16 Alejandro Morales-Hernández , Inneke Van Nieuwenhuyse , Sebastian Rojas Gonzalez

A Stochastic Approach to Bi-Level Optimization for Hyperparameter Optimization and Meta Learning

We tackle the general differentiable meta learning problem that is ubiquitous in modern deep learning, including hyperparameter optimization, loss function learning, few-shot learning, invariance learning and more. These problems are often…

Machine Learning · Computer Science 2024-10-15 Minyoung Kim , Timothy M. Hospedales

Improving Hyperparameter Optimization by Planning Ahead

Hyperparameter optimization (HPO) is generally treated as a bi-level optimization problem that involves fitting a (probabilistic) surrogate model to a set of observed hyperparameter responses, e.g. validation loss, and consequently…

Machine Learning · Computer Science 2021-10-18 Hadi S. Jomaa , Jonas Falkner , Lars Schmidt-Thieme

A Bridge Between Hyperparameter Optimization and Learning-to-learn

We consider a class of a nested optimization problems involving inner and outer objectives. We observe that by taking into explicit account the optimization dynamics for the inner objective it is possible to derive a general framework that…

Machine Learning · Statistics 2019-08-22 Luca Franceschi , Michele Donini , Paolo Frasconi , Massimiliano Pontil

Efficient Curvature-Aware Hypergradient Approximation for Bilevel Optimization

Bilevel optimization is a powerful tool for many machine learning problems, such as hyperparameter optimization and meta-learning. Estimating hypergradients (also known as implicit gradients) is crucial for developing gradient-based methods…

Optimization and Control · Mathematics 2025-05-06 Youran Dong , Junfeng Yang , Wei Yao , Jin Zhang

Analyzing Inexact Hypergradients for Bilevel Learning

Estimating hyperparameters has been a long-standing problem in machine learning. We consider the case where the task at hand is modeled as the solution to an optimization problem. Here the exact gradient with respect to the hyperparameters…

Optimization and Control · Mathematics 2023-11-16 Matthias J. Ehrhardt , Lindon Roberts

SHADHO: Massively Scalable Hardware-Aware Distributed Hyperparameter Optimization

Computer vision is experiencing an AI renaissance, in which machine learning models are expediting important breakthroughs in academic research and commercial applications. Effectively training these models, however, is not trivial due in…

Machine Learning · Computer Science 2018-01-23 Jeff Kinnison , Nathaniel Kremer-Herman , Douglas Thain , Walter Scheirer

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Since deep neural networks were developed, they have made huge contributions to everyday lives. Machine learning provides more rational advice than humans are capable of in almost every aspect of daily life. However, despite this…

Machine Learning · Computer Science 2020-03-13 Tong Yu , Hong Zhu

On Implicit Bias in Overparameterized Bilevel Optimization

Many problems in machine learning involve bilevel optimization (BLO), including hyperparameter optimization, meta-learning, and dataset distillation. Bilevel problems consist of two nested sub-problems, called the outer and inner problems,…

Machine Learning · Computer Science 2022-12-29 Paul Vicol , Jonathan Lorraine , Fabian Pedregosa , David Duvenaud , Roger Grosse

Improved Bilevel Model: Fast and Optimal Algorithm with Theoretical Guarantee

Due to the hierarchical structure of many machine learning problems, bilevel programming is becoming more and more important recently, however, the complicated correlation between the inner and outer problem makes it extremely challenging…

Machine Learning · Computer Science 2020-09-03 Junyi Li , Bin Gu , Heng Huang

Learning to Optimize Tensor Programs

We introduce a learning-based framework to optimize tensor programs for deep learning workloads. Efficient implementations of tensor operators, such as matrix multiplication and high dimensional convolution, are key enablers of effective…

Machine Learning · Computer Science 2019-01-10 Tianqi Chen , Lianmin Zheng , Eddie Yan , Ziheng Jiang , Thierry Moreau , Luis Ceze , Carlos Guestrin , Arvind Krishnamurthy

HYPPO: A Surrogate-Based Multi-Level Parallelism Tool for Hyperparameter Optimization

We present a new software, HYPPO, that enables the automatic tuning of hyperparameters of various deep learning (DL) models. Unlike other hyperparameter optimization (HPO) methods, HYPPO uses adaptive surrogate models and directly accounts…

Machine Learning · Computer Science 2022-01-03 Vincent Dumont , Casey Garner , Anuradha Trivedi , Chelsea Jones , Vidya Ganapati , Juliane Mueller , Talita Perciano , Mariam Kiran , Marc Day

Understanding the Generalization of Bilevel Programming in Hyperparameter Optimization: A Tale of Bias-Variance Decomposition

Gradient-based hyperparameter optimization (HPO) have emerged recently, leveraging bilevel programming techniques to optimize hyperparameter by estimating hypergradient w.r.t. validation loss. Nevertheless, previous theoretical works mainly…

Machine Learning · Computer Science 2026-02-23 Yubo Zhou , Jun Shu , Junmin Liu , Deyu Meng

Implicit Bilevel Optimization: Differentiating through Bilevel Optimization Programming

Bilevel Optimization Programming is used to model complex and conflicting interactions between agents, for example in Robust AI or Privacy-preserving AI. Integrating bilevel mathematical programming within deep learning is thus an essential…

Machine Learning · Computer Science 2023-03-01 Francesco Alesiani

UFO-BLO: Unbiased First-Order Bilevel Optimization

Bilevel optimization (BLO) is a popular approach with many applications including hyperparameter optimization, neural architecture search, adversarial robustness and model-agnostic meta-learning. However, the approach suffers from time and…

Machine Learning · Computer Science 2021-06-08 Valerii Likhosherstov , Xingyou Song , Krzysztof Choromanski , Jared Davis , Adrian Weller