English
Related papers

Related papers: Tune As You Scale: Hyperparameter Optimization For…

200 papers

Foundation models and their checkpoints have significantly advanced deep learning, boosting performance across various applications. However, fine-tuned models often struggle outside their specific domains and exhibit considerable…

Automatic performance tuning (auto-tuning) is widely used to optimize performance-critical applications across many scientific domains by finding the best program variant among many choices. Efficient optimization algorithms are crucial for…

Machine Learning · Computer Science 2025-10-10 Floris-Jan Willemsen , Rob V. van Nieuwpoort , Ben van Werkhoven

Machine learning applications often require hyperparameter tuning. The hyperparameters usually drive both the efficiency of the model training process and the resulting model quality. For hyperparameter tuning, machine learning algorithms…

Machine Learning · Computer Science 2018-08-06 Patrick Koch , Oleg Golovidov , Steven Gardner , Brett Wujek , Joshua Griffin , Yan Xu

Deep learning techniques play an increasingly important role in industrial and research environments due to their outstanding results. However, the large number of hyper-parameters to be set may lead to errors if they are set manually. The…

Machine Learning · Computer Science 2020-06-04 Michele Fraccaroli , Evelina Lamma , Fabrizio Riguzzi

Hyper-parameters optimization (HPO) is vital for machine learning models. Besides model accuracy, other tuning intentions such as model training time and energy consumption are also worthy of attention from data analytic service providers.…

Machine Learning · Computer Science 2023-04-21 Hui Dou , Shanshan Zhu , Yiwen Zhang , Pengfei Chen , Zibin Zheng

Finding optimal parameter configurations for tunable GPU kernels is a non-trivial exercise for large search spaces, even when automated. This poses an optimization task on a non-convex search space, using an expensive to evaluate function…

Machine Learning · Computer Science 2021-12-01 Floris-Jan Willemsen , Rob van Nieuwpoort , Ben van Werkhoven

Performance optimization of deep learning models is conducted either manually or through automatic architecture search, or a combination of both. On the other hand, their performance strongly depends on the target hardware and how…

Machine Learning · Computer Science 2022-09-23 Vahid Partovi Nia , Alireza Ghaffari , Mahdi Zolnouri , Yvon Savaria

We study a budgeted hyper-parameter tuning problem, where we optimize the tuning result under a hard resource constraint. We propose to solve it as a sequential decision making problem, such that we can use the partial training progress of…

Machine Learning · Computer Science 2019-02-05 Zhiyun Lu , Chao-Kai Chiang , Fei Sha

The tuning of hyperparameters becomes increasingly important as machine learning (ML) models have been extensively applied in data mining applications. Among various approaches, Bayesian optimization (BO) is a successful methodology to tune…

Machine Learning · Computer Science 2022-06-07 Yang Li , Yu Shen , Huaijun Jiang , Tianyi Bai , Wentao Zhang , Ce Zhang , Bin Cui

Deep learning algorithms vary depending on the underlying connection mechanism of nodes of them. They have various hyperparameters that are either set via specific algorithms or randomly chosen. Meanwhile, hyperparameters of deep learning…

Machine Learning · Computer Science 2020-11-20 M. M. Ozturk

Modern AI practices all strive towards the same goal: better results. In the context of deep learning, the term "results" often refers to the achieved accuracy on a competitive problem set. In this paper, we adopt an idea from the emerging…

Machine Learning · Computer Science 2023-03-27 Tim Yarally , Luís Cruz , Daniel Feitosa , June Sallou , Arie van Deursen

Modern deep models are often pretrained on large-scale data with missing labels using composite objectives, where the relative weights of multiple loss terms act as hyperparameters. Tuning these weights with random search or Bayesian…

Machine Learning · Computer Science 2026-05-11 Ivan Karpukhin , Andrey Savchenko

Black box optimization requires specifying a search space to explore for solutions, e.g. a d-dimensional compact space, and this choice is critical for getting the best results at a reasonable budget. Unfortunately, determining a high…

Machine Learning · Computer Science 2021-12-20 Setareh Ariafar , Justin Gilmer , Zachary Nado , Jasper Snoek , Rodolphe Jenatton , George E. Dahl

The ever-growing demand and complexity of machine learning are putting pressure on hyper-parameter tuning systems: while the evaluation cost of models continues to increase, the scalability of state-of-the-arts starts to become a crucial…

Machine Learning · Computer Science 2022-01-19 Yang Li , Yu Shen , Huaijun Jiang , Wentao Zhang , Jixiang Li , Ji Liu , Ce Zhang , Bin Cui

Many contemporary machine learning models require extensive tuning of hyperparameters to perform well. A variety of methods, such as Bayesian optimization, have been developed to automate and expedite this process. However, tuning remains…

Machine Learning · Computer Science 2020-02-25 Setareh Ariafar , Zelda Mariet , Ehsan Elhamifar , Dana Brooks , Jennifer Dy , Jasper Snoek

Deep learning compiler frameworks are gaining ground as a more portable back-end for deep learning applications on increasingly diverse hardware. However, they face the daunting challenge of matching performance offered by hand-tuned…

Machine Learning · Computer Science 2021-02-10 Jaehun Ryu , Hyojin Sung

In this paper we develop a Bayesian optimization based hyperparameter tuning framework inspired by statistical learning theory for classifiers. We utilize two key facts from PAC learning theory; the generalization bound will be higher for a…

Machine Learning · Computer Science 2019-02-08 Tinu Theckel Joy , Santu Rana , Sunil Gupta , Svetha Venkatesh

Kernel methods provide a principled approach to nonparametric learning. While their basic implementations scale poorly to large problems, recent advances showed that approximate solvers can efficiently handle massive datasets. A shortcoming…

Machine Learning · Computer Science 2022-01-19 Giacomo Meanti , Luigi Carratino , Ernesto De Vito , Lorenzo Rosasco

Recent trends towards training ever-larger language models have substantially improved machine learning performance across linguistic tasks. However, the huge cost of training larger models can make tuning them prohibitively expensive,…

Computation and Language · Computer Science 2022-09-13 Jared Lichtarge , Chris Alberti , Shankar Kumar

The performance of deep (reinforcement) learning systems crucially depends on the choice of hyperparameters. Their tuning is notoriously expensive, typically requiring an iterative training process to run for numerous steps to convergence.…

Machine Learning · Computer Science 2021-01-19 Vu Nguyen , Sebastian Schulze , Michael A Osborne
‹ Prev 1 2 3 10 Next ›