Related papers: Improving Fast Minimum-Norm Attacks with Hyperpara…

HO-FMN: Hyperparameter Optimization for Fast Minimum-Norm Attacks

Gradient-based attacks are a primary tool to evaluate robustness of machine-learning models. However, many attacks tend to provide overly-optimistic evaluations as they use fixed loss functions, optimizers, step-size schedulers, and default…

Machine Learning · Computer Science 2025-11-27 Raffaele Mura , Giuseppe Floris , Luca Scionis , Giorgio Piras , Maura Pintor , Ambra Demontis , Giorgio Giacinto , Battista Biggio , Fabio Roli

Fast Minimum-norm Adversarial Attacks through Adaptive Norm Constraints

Evaluating adversarial robustness amounts to finding the minimum perturbation needed to have an input sample misclassified. The inherent complexity of the underlying optimization requires current gradient-based attacks to be carefully…

Machine Learning · Computer Science 2021-11-22 Maura Pintor , Fabio Roli , Wieland Brendel , Battista Biggio

On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice

Machine learning algorithms have been used widely in various applications and areas. To fit a machine learning model into different problems, its hyper-parameters must be tuned. Selecting the best hyper-parameter configuration for machine…

Machine Learning · Computer Science 2022-10-06 Li Yang , Abdallah Shami

HyperJump: Accelerating HyperBand via Risk Modelling

In the literature on hyper-parameter tuning, a number of recent solutions rely on low-fidelity observations (e.g., training with sub-sampled datasets) in order to efficiently identify promising configurations to be then tested via…

Machine Learning · Computer Science 2022-12-05 Pedro Mendes , Maria Casimiro , Paolo Romano , David Garlan

Hyperband: A Novel Bandit-Based Approach to Hyperparameter Optimization

Performance of machine learning algorithms depends critically on identifying a good set of hyperparameters. While recent approaches use Bayesian optimization to adaptively select configurations, we focus on speeding up random search through…

Machine Learning · Computer Science 2018-06-20 Lisha Li , Kevin Jamieson , Giulia DeSalvo , Afshin Rostamizadeh , Ameet Talwalkar

Behavior of Hyper-Parameters for Selected Machine Learning Algorithms: An Empirical Investigation

Hyper-parameters (HPs) are an important part of machine learning (ML) model development and can greatly influence performance. This paper studies their behavior for three algorithms: Extreme Gradient Boosting (XGB), Random Forest (RF), and…

Machine Learning · Computer Science 2022-11-17 Anwesha Bhattacharyya , Joel Vaughan , Vijayan N. Nair

Hyperparameter Optimization: A Spectral Approach

We give a simple, fast algorithm for hyperparameter optimization inspired by techniques from the analysis of Boolean functions. We focus on the high-dimensional regime where the canonical example is training a neural network with a large…

Machine Learning · Computer Science 2018-01-23 Elad Hazan , Adam Klivans , Yang Yuan

Hyperparameter Optimization: Foundations, Algorithms, Best Practices and Open Challenges

Most machine learning algorithms are configured by one or several hyperparameters that must be carefully chosen and often considerably impact performance. To avoid a time consuming and unreproducible manual trial-and-error process to find…

Machine Learning · Statistics 2021-11-29 Bernd Bischl , Martin Binder , Michel Lang , Tobias Pielok , Jakob Richter , Stefan Coors , Janek Thomas , Theresa Ullmann , Marc Becker , Anne-Laure Boulesteix , Difan Deng , Marius Lindauer

MetaOptimize: A Framework for Optimizing Step Sizes and Other Meta-parameters

We address the challenge of optimizing meta-parameters (hyperparameters) in machine learning, a key factor for efficient training and high model performance. Rather than relying on expensive meta-parameter search methods, we introduce…

Machine Learning · Computer Science 2025-07-10 Arsalan Sharifnassab , Saber Salehkaleybar , Richard Sutton

Hyperparameter Optimization in Machine Learning

Hyperparameters are configuration variables controlling the behavior of machine learning algorithms. They are ubiquitous in machine learning and artificial intelligence and the choice of their values determines the effectiveness of systems…

Machine Learning · Statistics 2025-11-26 Luca Franceschi , Michele Donini , Valerio Perrone , Aaron Klein , Cédric Archambeau , Matthias Seeger , Massimiliano Pontil , Paolo Frasconi

Hyperparameter Optimisation with Early Termination of Poor Performers

It is typical for a machine learning system to have numerous hyperparameters that affect its learning rate and prediction quality. Finding a good combination of the hyperparameters is, however, a challenging job. This is mainly because…

Machine Learning · Computer Science 2019-08-08 Dobromir Marinov , Daniel Karapetyan

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Since deep neural networks were developed, they have made huge contributions to everyday lives. Machine learning provides more rational advice than humans are capable of in almost every aspect of daily life. However, despite this…

Machine Learning · Computer Science 2020-03-13 Tong Yu , Hong Zhu

Learning Acceleration Algorithms for Fast Parametric Convex Optimization with Certified Robustness

We develop a machine-learning framework to learn hyperparameter sequences for accelerated first-order methods (e.g., the step size and momentum sequences in accelerated gradient descent) to quickly solve parametric convex optimization…

Optimization and Control · Mathematics 2025-10-07 Rajiv Sambharya , Jinho Bok , Nikolai Matni , George Pappas

A Method for Evaluating Hyperparameter Sensitivity in Reinforcement Learning

The performance of modern reinforcement learning algorithms critically relies on tuning ever-increasing numbers of hyperparameters. Often, small changes in a hyperparameter can lead to drastic changes in performance, and different…

Machine Learning · Computer Science 2025-02-05 Jacob Adkins , Michael Bowling , Adam White

HyperTime: Hyperparameter Optimization for Combating Temporal Distribution Shifts

In this work, we propose a hyperparameter optimization method named \emph{HyperTime} to find hyperparameters robust to potential temporal distribution shifts in the unseen test data. Our work is motivated by an important observation that it…

Machine Learning · Computer Science 2023-05-31 Shaokun Zhang , Yiran Wu , Zhonghua Zheng , Qingyun Wu , Chi Wang

Adaptive Hyperparameter Optimization for Continual Learning Scenarios

Hyperparameter selection in continual learning scenarios is a challenging and underexplored aspect, especially in practical non-stationary environments. Traditional approaches, such as grid searches with held-out validation data from all…

Machine Learning · Computer Science 2024-06-21 Rudy Semola , Julio Hurtado , Vincenzo Lomonaco , Davide Bacciu

Using Large Language Models for Hyperparameter Optimization

This paper explores the use of foundational large language models (LLMs) in hyperparameter optimization (HPO). Hyperparameters are critical in determining the effectiveness of machine learning models, yet their optimization often relies on…

Machine Learning · Computer Science 2024-11-12 Michael R. Zhang , Nishkrit Desai , Juhan Bae , Jonathan Lorraine , Jimmy Ba

Online hyperparameter optimization by real-time recurrent learning

Conventional hyperparameter optimization methods are computationally intensive and hard to generalize to scenarios that require dynamically adapting hyperparameters, such as life-long learning. Here, we propose an online hyperparameter…

Machine Learning · Computer Science 2021-04-09 Daniel Jiwoong Im , Cristina Savin , Kyunghyun Cho

Exploring the Optimized Value of Each Hyperparameter in Various Gradient Descent Algorithms

In the recent years, various gradient descent algorithms including the methods of gradient descent, gradient descent with momentum, adaptive gradient (AdaGrad), root-mean-square propagation (RMSProp) and adaptive moment estimation (Adam)…

Machine Learning · Computer Science 2024-09-19 Abel C. H. Chen

DC and SA: Robust and Efficient Hyperparameter Optimization of Multi-subnetwork Deep Learning Models

We present two novel hyperparameter optimization strategies for optimization of deep learning models with a modular architecture constructed of multiple subnetworks. As complex networks with multiple subnetworks become more frequently…

Machine Learning · Computer Science 2022-02-25 Alex H. Treacher , Albert Montillo