Related papers: Tune As You Scale: Hyperparameter Optimization For…

Neural Parameter Search for Slimmer Fine-Tuned Models and Better Transfer

Foundation models and their checkpoints have significantly advanced deep learning, boosting performance across various applications. However, fine-tuned models often struggle outside their specific domains and exhibit considerable…

Machine Learning · Computer Science 2025-05-27 Guodong Du , Zitao Fang , Jing Li , Junlin Li , Runhua Jiang , Shuyang Yu , Yifei Guo , Yangneng Chen , Sim Kuan Goh , Ho-Kin Tang , Daojing He , Honghai Liu , Min Zhang

Tuning the Tuner: Introducing Hyperparameter Optimization for Auto-Tuning

Automatic performance tuning (auto-tuning) is widely used to optimize performance-critical applications across many scientific domains by finding the best program variant among many choices. Efficient optimization algorithms are crucial for…

Machine Learning · Computer Science 2025-10-10 Floris-Jan Willemsen , Rob V. van Nieuwpoort , Ben van Werkhoven

Autotune: A Derivative-free Optimization Framework for Hyperparameter Tuning

Machine learning applications often require hyperparameter tuning. The hyperparameters usually drive both the efficiency of the model training process and the resulting model quality. For hyperparameter tuning, machine learning algorithms…

Machine Learning · Computer Science 2018-08-06 Patrick Koch , Oleg Golovidov , Steven Gardner , Brett Wujek , Joshua Griffin , Yan Xu

Automatic Setting of DNN Hyper-Parameters by Mixing Bayesian Optimization and Tuning Rules

Deep learning techniques play an increasingly important role in industrial and research environments due to their outstanding results. However, the large number of hyper-parameters to be set may lead to errors if they are set manually. The…

Machine Learning · Computer Science 2020-06-04 Michele Fraccaroli , Evelina Lamma , Fabrizio Riguzzi

HyperTuner: A Cross-Layer Multi-Objective Hyperparameter Auto-Tuning Framework for Data Analytic Services

Hyper-parameters optimization (HPO) is vital for machine learning models. Besides model accuracy, other tuning intentions such as model training time and energy consumption are also worthy of attention from data analytic service providers.…

Machine Learning · Computer Science 2023-04-21 Hui Dou , Shanshan Zhu , Yiwen Zhang , Pengfei Chen , Zibin Zheng

Bayesian Optimization for auto-tuning GPU kernels

Finding optimal parameter configurations for tunable GPU kernels is a non-trivial exercise for large search spaces, even when automated. This poses an optimization task on a non-convex search space, using an expensive to evaluate function…

Machine Learning · Computer Science 2021-12-01 Floris-Jan Willemsen , Rob van Nieuwpoort , Ben van Werkhoven

Rethinking Pareto Frontier for Performance Evaluation of Deep Neural Networks

Performance optimization of deep learning models is conducted either manually or through automatic architecture search, or a combination of both. On the other hand, their performance strongly depends on the target hardware and how…

Machine Learning · Computer Science 2022-09-23 Vahid Partovi Nia , Alireza Ghaffari , Mahdi Zolnouri , Yvon Savaria

Hyper-parameter Tuning under a Budget Constraint

We study a budgeted hyper-parameter tuning problem, where we optimize the tuning result under a hard resource constraint. We propose to solve it as a sequential decision making problem, such that we can use the partial training progress of…

Machine Learning · Computer Science 2019-02-05 Zhiyun Lu , Chao-Kai Chiang , Fei Sha

Transfer Learning based Search Space Design for Hyperparameter Tuning

The tuning of hyperparameters becomes increasingly important as machine learning (ML) models have been extensively applied in data mining applications. Among various approaches, Bayesian optimization (BO) is a successful methodology to tune…

Machine Learning · Computer Science 2022-06-07 Yang Li , Yu Shen , Huaijun Jiang , Tianyi Bai , Wentao Zhang , Ce Zhang , Bin Cui

On tuning deep learning models: a data mining perspective

Deep learning algorithms vary depending on the underlying connection mechanism of nodes of them. They have various hyperparameters that are either set via specific algorithms or randomly chosen. Meanwhile, hyperparameters of deep learning…

Machine Learning · Computer Science 2020-11-20 M. M. Ozturk

Uncovering Energy-Efficient Practices in Deep Learning Training: Preliminary Steps Towards Green AI

Modern AI practices all strive towards the same goal: better results. In the context of deep learning, the term "results" often refers to the achieved accuracy on a competitive problem set. In this paper, we adopt an idea from the emerging…

Machine Learning · Computer Science 2023-03-27 Tim Yarally , Luís Cruz , Daniel Feitosa , June Sallou , Arie van Deursen

When Losses Align: Gradient-Based Composite Loss Weighting for Efficient Pretraining

Modern deep models are often pretrained on large-scale data with missing labels using composite objectives, where the relative weights of multiple loss terms act as hyperparameters. Tuning these weights with random search or Bayesian…

Machine Learning · Computer Science 2026-05-11 Ivan Karpukhin , Andrey Savchenko

Predicting the utility of search spaces for black-box optimization: a simple, budget-aware approach

Black box optimization requires specifying a search space to explore for solutions, e.g. a d-dimensional compact space, and this choice is critical for getting the best results at a reasonable budget. Unfortunately, determining a high…

Machine Learning · Computer Science 2021-12-20 Setareh Ariafar , Justin Gilmer , Zachary Nado , Jasper Snoek , Rodolphe Jenatton , George E. Dahl

Hyper-Tune: Towards Efficient Hyper-parameter Tuning at Scale

The ever-growing demand and complexity of machine learning are putting pressure on hyper-parameter tuning systems: while the evaluation cost of models continues to increase, the scalability of state-of-the-arts starts to become a crucial…

Machine Learning · Computer Science 2022-01-19 Yang Li , Yu Shen , Huaijun Jiang , Wentao Zhang , Jixiang Li , Ji Liu , Ce Zhang , Bin Cui

Weighting Is Worth the Wait: Bayesian Optimization with Importance Sampling

Many contemporary machine learning models require extensive tuning of hyperparameters to perform well. A variety of methods, such as Bayesian optimization, have been developed to automate and expedite this process. However, tuning remains…

Machine Learning · Computer Science 2020-02-25 Setareh Ariafar , Zelda Mariet , Ehsan Elhamifar , Dana Brooks , Jennifer Dy , Jasper Snoek

MetaTune: Meta-Learning Based Cost Model for Fast and Efficient Auto-tuning Frameworks

Deep learning compiler frameworks are gaining ground as a more portable back-end for deep learning applications on increasingly diverse hardware. However, they face the daunting challenge of matching performance offered by hand-tuned…

Machine Learning · Computer Science 2021-02-10 Jaehun Ryu , Hyojin Sung

Fast Hyperparameter Tuning using Bayesian Optimization with Directional Derivatives

In this paper we develop a Bayesian optimization based hyperparameter tuning framework inspired by statistical learning theory for classifiers. We utilize two key facts from PAC learning theory; the generalization bound will be higher for a…

Machine Learning · Computer Science 2019-02-08 Tinu Theckel Joy , Santu Rana , Sunil Gupta , Svetha Venkatesh

Efficient Hyperparameter Tuning for Large Scale Kernel Ridge Regression

Kernel methods provide a principled approach to nonparametric learning. While their basic implementations scale poorly to large problems, recent advances showed that approximate solvers can efficiently handle massive datasets. A shortcoming…

Machine Learning · Computer Science 2022-01-19 Giacomo Meanti , Luigi Carratino , Ernesto De Vito , Lorenzo Rosasco

Simple and Effective Gradient-Based Tuning of Sequence-to-Sequence Models

Recent trends towards training ever-larger language models have substantially improved machine learning performance across linguistic tasks. However, the huge cost of training larger models can make tuning them prohibitively expensive,…

Computation and Language · Computer Science 2022-09-13 Jared Lichtarge , Chris Alberti , Shankar Kumar

Bayesian Optimization for Iterative Learning

The performance of deep (reinforcement) learning systems crucially depends on the choice of hyperparameters. Their tuning is notoriously expensive, typically requiring an iterative training process to run for numerous steps to convergence.…

Machine Learning · Computer Science 2021-01-19 Vu Nguyen , Sebastian Schulze , Michael A Osborne