Related papers: Efficient Hyperparameter Tuning with Dynamic Accur…

Adaptive Hyperparameter Optimization for Continual Learning Scenarios

Hyperparameter selection in continual learning scenarios is a challenging and underexplored aspect, especially in practical non-stationary environments. Traditional approaches, such as grid searches with held-out validation data from all…

Machine Learning · Computer Science 2024-06-21 Rudy Semola , Julio Hurtado , Vincenzo Lomonaco , Davide Bacciu

Stochastic Hyperparameter Optimization through Hypernetworks

Machine learning models are often tuned by nesting optimization of model weights inside the optimization of hyperparameters. We give a method to collapse this nested optimization into joint stochastic optimization of weights and…

Machine Learning · Computer Science 2018-03-09 Jonathan Lorraine , David Duvenaud

Autotune: A Derivative-free Optimization Framework for Hyperparameter Tuning

Machine learning applications often require hyperparameter tuning. The hyperparameters usually drive both the efficiency of the model training process and the resulting model quality. For hyperparameter tuning, machine learning algorithms…

Machine Learning · Computer Science 2018-08-06 Patrick Koch , Oleg Golovidov , Steven Gardner , Brett Wujek , Joshua Griffin , Yan Xu

Scalable One-Pass Optimisation of High-Dimensional Weight-Update Hyperparameters by Implicit Differentiation

Machine learning training methods depend plentifully and intricately on hyperparameters, motivating automated strategies for their optimisation. Many existing algorithms restart training for each new hyperparameter choice, at considerable…

Machine Learning · Computer Science 2022-04-22 Ross M. Clarke , Elre T. Oldewage , José Miguel Hernández-Lobato

Derivative-Free Optimization via Adaptive Sampling Strategies

In this paper, we present a novel derivative-free optimization framework for solving unconstrained stochastic optimization problems. Many problems in fields ranging from simulation optimization to reinforcement learning involve settings…

Optimization and Control · Mathematics 2024-04-19 Raghu Bollapragada , Cem Karamanli , Stefan M. Wild

On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice

Machine learning algorithms have been used widely in various applications and areas. To fit a machine learning model into different problems, its hyper-parameters must be tuned. Selecting the best hyper-parameter configuration for machine…

Machine Learning · Computer Science 2022-10-06 Li Yang , Abdallah Shami

Learning to Optimize Multi-Objective Alignment Through Dynamic Reward Weighting

Prior work in multi-objective reinforcement learning typically uses linear reward scalarization with fixed weights, which provably fails to capture non-convex Pareto fronts and thus yields suboptimal results. This limitation becomes…

Machine Learning · Computer Science 2026-04-01 Yining Lu , Zilong Wang , Shiyang Li , Xin Liu , Changlong Yu , Qingyu Yin , Zhan Shi , Zixuan Zhang , Meng Jiang

Adaptive Optimization Algorithms for Machine Learning

Machine learning assumes a pivotal role in our data-driven world. The increasing scale of models and datasets necessitates quick and reliable algorithms for model training. This dissertation investigates adaptivity in machine learning…

Machine Learning · Computer Science 2023-11-20 Slavomír Hanzely

How to "DODGE" Complex Software Analytics?

Machine learning techniques applied to software engineering tasks can be improved by hyperparameter optimization, i.e., automatic tools that find good settings for a learner's control parameters. We show that such hyperparameter…

Software Engineering · Computer Science 2019-12-03 Amritanshu Agrawal , Wei Fu , Di Chen , Xipeng Shen , Tim Menzies

Hyperparameter optimization with approximate gradient

Most models in machine learning contain at least one hyperparameter to control for model complexity. Choosing an appropriate set of hyperparameters is both crucial in terms of model accuracy and computationally challenging. In this work we…

Machine Learning · Statistics 2022-11-22 Fabian Pedregosa

Improving Pre-Trained Weights Through Meta-Heuristics Fine-Tuning

Machine Learning algorithms have been extensively researched throughout the last decade, leading to unprecedented advances in a broad range of applications, such as image classification and reconstruction, object recognition, and text…

Artificial Intelligence · Computer Science 2022-12-20 Gustavo H. de Rosa , Mateus Roder , João Paulo Papa , Claudio F. G. dos Santos

EXACT: How to Train Your Accuracy

Classification tasks are usually evaluated in terms of accuracy. However, accuracy is discontinuous and cannot be directly optimized using gradient ascent. Popular methods minimize cross-entropy, hinge loss, or other surrogate losses, which…

Machine Learning · Computer Science 2024-07-25 Ivan Karpukhin , Stanislav Dereka , Sergey Kolesnikov

HyperNOMAD: Hyperparameter optimization of deep neural networks using mesh adaptive direct search

The performance of deep neural networks is highly sensitive to the choice of the hyperparameters that define the structure of the network and the learning process. When facing a new application, tuning a deep neural network is a tedious and…

Machine Learning · Computer Science 2019-07-04 Dounia Lakhmiri , Sébastien Le Digabel , Christophe Tribes

Optimization and Supervised Machine Learning Methods for Fitting Numerical Physics Models without Derivatives

We address the calibration of a computationally expensive nuclear physics model for which derivative information with respect to the fit parameters is not readily available. Of particular interest is the performance of optimization-based…

Nuclear Theory · Physics 2020-12-16 Raghu Bollapragada , Matt Menickelly , Witold Nazarewicz , Jared O'Neal , Paul-Gerhard Reinhard , Stefan M. Wild

Derivative-Free Methods for Policy Optimization: Guarantees for Linear Quadratic Systems

We study derivative-free methods for policy optimization over the class of linear policies. We focus on characterizing the convergence rate of these methods when applied to linear-quadratic systems, and study various settings of driving…

Machine Learning · Computer Science 2020-05-19 Dhruv Malik , Ashwin Pananjady , Kush Bhatia , Koulik Khamaru , Peter L. Bartlett , Martin J. Wainwright

Inexact Derivative-Free Optimization for Bilevel Learning

Variational regularization techniques are dominant in the field of mathematical imaging. A drawback of these techniques is that they are dependent on a number of parameters which have to be set by the user. A by now common strategy to…

Optimization and Control · Mathematics 2020-12-10 Matthias J. Ehrhardt , Lindon Roberts

Fast Hyperparameter Tuning using Bayesian Optimization with Directional Derivatives

In this paper we develop a Bayesian optimization based hyperparameter tuning framework inspired by statistical learning theory for classifiers. We utilize two key facts from PAC learning theory; the generalization bound will be higher for a…

Machine Learning · Computer Science 2019-02-08 Tinu Theckel Joy , Santu Rana , Sunil Gupta , Svetha Venkatesh

Gradient-based Hyperparameter Optimization through Reversible Learning

Tuning hyperparameters of learning algorithms is hard because gradients are usually unavailable. We compute exact gradients of cross-validation performance with respect to all hyperparameters by chaining derivatives backwards through the…

Machine Learning · Statistics 2015-04-03 Dougal Maclaurin , David Duvenaud , Ryan P. Adams

Solver-Free Decision-Focused Learning for Linear Optimization Problems

Mathematical optimization is a fundamental tool for decision-making in a wide range of applications. However, in many real-world scenarios, the parameters of the optimization problem are not known a priori and must be predicted from…

Machine Learning · Computer Science 2025-11-13 Senne Berden , Ali İrfan Mahmutoğulları , Dimos Tsouros , Tias Guns

Training Image Derivatives: Increased Accuracy and Universal Robustness

Derivative training is an established method that can significantly increase the accuracy of neural networks in certain low-dimensional tasks. In this paper, we extend this improvement to an illustrative image analysis problem:…

Machine Learning · Computer Science 2025-02-04 Vsevolod I. Avrutskiy