Related papers: A Linear Programming Enhanced Genetic Algorithm fo…

A Gradient-based Bilevel Optimization Approach for Tuning Hyperparameters in Machine Learning

Hyperparameter tuning is an active area of research in machine learning, where the aim is to identify the optimal hyperparameters that provide the best performance on the validation set. Hyperparameter tuning is often achieved using naive…

Machine Learning · Computer Science 2020-07-23 Ankur Sinha , Tanmay Khandait , Raja Mohanty

Bilevel Programming for Hyperparameter Optimization and Meta-Learning

We introduce a framework based on bilevel programming that unifies gradient-based hyperparameter optimization and meta-learning. We show that an approximate version of the bilevel problem can be solved by taking into explicit account the…

Machine Learning · Statistics 2018-07-04 Luca Franceschi , Paolo Frasconi , Saverio Salzo , Riccardo Grazzi , Massimilano Pontil

Hyperparameter Tuning Through Pessimistic Bilevel Optimization

Automated hyperparameter search in machine learning, especially for deep learning models, is typically formulated as a bilevel optimization problem, with hyperparameter values determined by the upper level and the model learning achieved by…

Machine Learning · Computer Science 2024-12-06 Meltem Apaydin Ustun , Liang Xu , Bo Zeng , Xiaoning Qian

A Globally Convergent Gradient-based Bilevel Hyperparameter Optimization Method

Hyperparameter optimization in machine learning is often achieved using naive techniques that only lead to an approximate set of hyperparameters. Although techniques such as Bayesian optimization perform an intelligent search on a given…

Machine Learning · Computer Science 2023-06-21 Ankur Sinha , Satender Gunwal , Shivam Kumar

Bilevel Programming Approach for Image Restoration Problems with Automatically Hyperparameter Selection

In optimization-based image restoration models, the correct selection of hyperparameters is crucial for achieving superior performance. However, current research typically involves manual tuning of these hyperparameters, which is highly…

Optimization and Control · Mathematics 2026-04-03 Hang Xie , Xuewen Li , Peili Li , Qiuyu Wang

Search Algorithms for Automated Hyper-Parameter Tuning

Machine learning is a powerful method for modeling in different fields such as education. Its capability to accurately predict students' success makes it an ideal tool for decision-making tasks related to higher education. The accuracy of…

Machine Learning · Computer Science 2021-05-03 Leila Zahedi , Farid Ghareh Mohammadi , Shabnam Rezapour , Matthew W. Ohland , M. Hadi Amini

CPMLHO:Hyperparameter Tuning via Cutting Plane and Mixed-Level Optimization

The hyperparameter optimization of neural network can be expressed as a bilevel optimization problem. The bilevel optimization is used to automatically update the hyperparameter, and the gradient of the hyperparameter is the approximate…

Machine Learning · Computer Science 2022-12-14 Shuo Yang , Yang Jiao , Shaoyu Dou , Mana Zheng , Chen Zhu

Lower-level Duality Based Reformulation and Majorization Minimization Algorithm for Hyperparameter Optimization

Hyperparameter tuning is an important task of machine learning, which can be formulated as a bilevel program (BLP). However, most existing algorithms are not applicable for BLP with non-smooth lower-level problems. To address this, we…

Optimization and Control · Mathematics 2024-03-04 He Chen , Haochen Xu , Rujun Jiang , Anthony Man-Cho So

Efficient Hyperparameter Optimization in Deep Learning Using a Variable Length Genetic Algorithm

Convolutional Neural Networks (CNN) have gained great success in many artificial intelligence tasks. However, finding a good set of hyperparameters for a CNN remains a challenging task. It usually takes an expert with deep knowledge, and…

Neural and Evolutionary Computing · Computer Science 2020-06-25 Xueli Xiao , Ming Yan , Sunitha Basodi , Chunyan Ji , Yi Pan

Improved Bilevel Model: Fast and Optimal Algorithm with Theoretical Guarantee

Due to the hierarchical structure of many machine learning problems, bilevel programming is becoming more and more important recently, however, the complicated correlation between the inner and outer problem makes it extremely challenging…

Machine Learning · Computer Science 2020-09-03 Junyi Li , Bin Gu , Heng Huang

Optimizing Hyperparameters in CNNs using Bilevel Programming in Time Series Data

Hyperparameter optimization has remained a central topic within the machine learning community due to its ability to produce state-of-the-art results. With the recent interest growing in the usage of CNNs for time series prediction, we…

Machine Learning · Computer Science 2021-01-20 Taniya Seth , Pranab K. Muhuri

Hyperparameter Optimization in Machine Learning

Hyperparameters are configuration variables controlling the behavior of machine learning algorithms. They are ubiquitous in machine learning and artificial intelligence and the choice of their values determines the effectiveness of systems…

Machine Learning · Statistics 2025-11-26 Luca Franceschi , Michele Donini , Valerio Perrone , Aaron Klein , Cédric Archambeau , Matthias Seeger , Massimiliano Pontil , Paolo Frasconi

Hyperparameter Tuning for Deep Reinforcement Learning Applications

Reinforcement learning (RL) applications, where an agent can simply learn optimal behaviors by interacting with the environment, are quickly gaining tremendous success in a wide variety of applications from controlling simple pendulums to…

Machine Learning · Computer Science 2022-01-28 Mariam Kiran , Melis Ozyildirim

Bilevel optimization for learning hyperparameters: Application to solving PDEs and inverse problems with Gaussian processes

Methods for solving scientific computing and inference problems, such as kernel- and neural network-based approaches for partial differential equations (PDEs), inverse problems, and supervised learning tasks, depend crucially on the choice…

Machine Learning · Statistics 2025-10-08 Nicholas H. Nelsen , Houman Owhadi , Andrew M. Stuart , Xianjin Yang , Zongren Zou

Scalable Gradient-Based Tuning of Continuous Regularization Hyperparameters

Hyperparameter selection generally relies on running multiple full training trials, with selection based on validation set performance. We propose a gradient-based approach for locally adjusting hyperparameters during training of the model.…

Machine Learning · Computer Science 2016-06-20 Jelena Luketina , Mathias Berglund , Klaus Greff , Tapani Raiko

On tuning deep learning models: a data mining perspective

Deep learning algorithms vary depending on the underlying connection mechanism of nodes of them. They have various hyperparameters that are either set via specific algorithms or randomly chosen. Meanwhile, hyperparameters of deep learning…

Machine Learning · Computer Science 2020-11-20 M. M. Ozturk

A Multi-level procedure for enhancing accuracy of machine learning algorithms

We propose a multi-level method to increase the accuracy of machine learning algorithms for approximating observables in scientific computing, particularly those that arise in systems modeled by differential equations. The algorithm relies…

Numerical Analysis · Mathematics 2020-07-06 Kjetil O. Lye , Siddhartha Mishra , Roberto Molinaro

Learning Data Augmentation with Online Bilevel Optimization for Image Classification

Data augmentation is a key practice in machine learning for improving generalization performance. However, finding the best data augmentation hyperparameters requires domain knowledge or a computationally demanding search. We address this…

Computer Vision and Pattern Recognition · Computer Science 2020-11-11 Saypraseuth Mounsaveng , Issam Laradji , Ismail Ben Ayed , David Vazquez , Marco Pedersoli

Deep Genetic Network

Optimizing a neural network's performance is a tedious and time taking process, this iterative process does not have any defined solution which can work for all the problems. Optimization can be roughly categorized into - Architecture and…

Machine Learning · Computer Science 2019-12-16 Siddhartha Dhar Choudhury , Shashank Pandey , Kunal Mehrotra

Hyperparameter Search in Machine Learning

We introduce the hyperparameter search problem in the field of machine learning and discuss its main challenges from an optimization perspective. Machine learning methods attempt to build models that capture some element of interest based…

Machine Learning · Computer Science 2015-04-07 Marc Claesen , Bart De Moor