Related papers: Deep Neural Network Hyperparameter Optimization wi…

On tuning deep learning models: a data mining perspective

Deep learning algorithms vary depending on the underlying connection mechanism of nodes of them. They have various hyperparameters that are either set via specific algorithms or randomly chosen. Meanwhile, hyperparameters of deep learning…

Machine Learning · Computer Science 2020-11-20 M. M. Ozturk

Hyperparameter Optimization: A Spectral Approach

We give a simple, fast algorithm for hyperparameter optimization inspired by techniques from the analysis of Boolean functions. We focus on the high-dimensional regime where the canonical example is training a neural network with a large…

Machine Learning · Computer Science 2018-01-23 Elad Hazan , Adam Klivans , Yang Yuan

QATM: Quality-Aware Template Matching For Deep Learning

Finding a template in a search image is one of the core problems many computer vision, such as semantic image semantic, image-to-GPS verification \etc. We propose a novel quality-aware template matching method, QATM, which is not only used…

Computer Vision and Pattern Recognition · Computer Science 2019-04-11 Jiaxin Cheng , Yue Wu , Wael Abd-Almageed , Premkumar Natarajan

Hyper-Parameter Optimization: A Review of Algorithms and Applications

Since deep neural networks were developed, they have made huge contributions to everyday lives. Machine learning provides more rational advice than humans are capable of in almost every aspect of daily life. However, despite this…

Machine Learning · Computer Science 2020-03-13 Tong Yu , Hong Zhu

AUTOMATA: Gradient Based Data Subset Selection for Compute-Efficient Hyper-parameter Tuning

Deep neural networks have seen great success in recent years; however, training a deep model is often challenging as its performance heavily depends on the hyper-parameters used. In addition, finding the optimal hyper-parameter…

Machine Learning · Computer Science 2022-03-17 Krishnateja Killamsetty , Guttu Sai Abhishek , Aakriti , Alexandre V. Evfimievski , Lucian Popa , Ganesh Ramakrishnan , Rishabh Iyer

Hyperparameter Optimization: Foundations, Algorithms, Best Practices and Open Challenges

Most machine learning algorithms are configured by one or several hyperparameters that must be carefully chosen and often considerably impact performance. To avoid a time consuming and unreproducible manual trial-and-error process to find…

Machine Learning · Statistics 2021-11-29 Bernd Bischl , Martin Binder , Michel Lang , Tobias Pielok , Jakob Richter , Stefan Coors , Janek Thomas , Theresa Ullmann , Marc Becker , Anne-Laure Boulesteix , Difan Deng , Marius Lindauer

Orthogonal Deep Neural Networks

In this paper, we introduce the algorithms of Orthogonal Deep Neural Networks (OrthDNNs) to connect with recent interest of spectrally regularized deep learning methods. OrthDNNs are theoretically motivated by generalization analysis of…

Machine Learning · Computer Science 2019-10-16 Kui Jia , Shuai Li , Yuxin Wen , Tongliang Liu , Dacheng Tao

Automatic Setting of DNN Hyper-Parameters by Mixing Bayesian Optimization and Tuning Rules

Deep learning techniques play an increasingly important role in industrial and research environments due to their outstanding results. However, the large number of hyper-parameters to be set may lead to errors if they are set manually. The…

Machine Learning · Computer Science 2020-06-04 Michele Fraccaroli , Evelina Lamma , Fabrizio Riguzzi

Combination of Hyperband and Bayesian Optimization for Hyperparameter Optimization in Deep Learning

Deep learning has achieved impressive results on many problems. However, it requires high degree of expertise or a lot of experience to tune well the hyperparameters, and such manual tuning process is likely to be biased. Moreover, it is…

Computer Vision and Pattern Recognition · Computer Science 2018-01-08 Jiazhuo Wang , Jason Xu , Xuejun Wang

Cross-Entropy Optimization for Hyperparameter Optimization in Stochastic Gradient-based Approaches to Train Deep Neural Networks

In this paper, we present a cross-entropy optimization method for hyperparameter optimization in stochastic gradient-based approaches to train deep neural networks. The value of a hyperparameter of a learning algorithm often has great…

Machine Learning · Computer Science 2024-09-17 Kevin Li , Fulu Li

Tuning structure learning algorithms with out-of-sample and resampling strategies

One of the challenges practitioners face when applying structure learning algorithms to their data involves determining a set of hyperparameters; otherwise, a set of hyperparameter defaults is assumed. The optimal hyperparameter…

Machine Learning · Computer Science 2023-06-27 Kiattikun Chobtham , Anthony C. Constantinou

Hyperparameter Optimization in Machine Learning

Hyperparameters are configuration variables controlling the behavior of machine learning algorithms. They are ubiquitous in machine learning and artificial intelligence and the choice of their values determines the effectiveness of systems…

Machine Learning · Statistics 2025-11-26 Luca Franceschi , Michele Donini , Valerio Perrone , Aaron Klein , Cédric Archambeau , Matthias Seeger , Massimiliano Pontil , Paolo Frasconi

Online hyperparameter optimization by real-time recurrent learning

Conventional hyperparameter optimization methods are computationally intensive and hard to generalize to scenarios that require dynamically adapting hyperparameters, such as life-long learning. Here, we propose an online hyperparameter…

Machine Learning · Computer Science 2021-04-09 Daniel Jiwoong Im , Cristina Savin , Kyunghyun Cho

GADAM: Genetic-Evolutionary ADAM for Deep Neural Network Optimization

Deep neural network learning can be formulated as a non-convex optimization problem. Existing optimization algorithms, e.g., Adam, can learn the models fast, but may get stuck in local optima easily. In this paper, we introduce a novel…

Machine Learning · Computer Science 2019-03-12 Jiawei Zhang , Fisher B. Gouza

OATS: Outlier-Aware Pruning Through Sparse and Low Rank Decomposition

The recent paradigm shift to large-scale foundation models has brought about a new era for deep learning that, while has found great success in practice, has also been plagued by prohibitively expensive costs in terms of high memory…

Machine Learning · Computer Science 2025-05-21 Stephen Zhang , Vardan Papyan

Efficient Hyperparameter Optimization in Deep Learning Using a Variable Length Genetic Algorithm

Convolutional Neural Networks (CNN) have gained great success in many artificial intelligence tasks. However, finding a good set of hyperparameters for a CNN remains a challenging task. It usually takes an expert with deep knowledge, and…

Neural and Evolutionary Computing · Computer Science 2020-06-25 Xueli Xiao , Ming Yan , Sunitha Basodi , Chunyan Ji , Yi Pan

On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice

Machine learning algorithms have been used widely in various applications and areas. To fit a machine learning model into different problems, its hyper-parameters must be tuned. Selecting the best hyper-parameter configuration for machine…

Machine Learning · Computer Science 2022-10-06 Li Yang , Abdallah Shami

Practical recommendations for gradient-based training of deep architectures

Learning algorithms related to artificial neural networks and in particular for Deep Learning may seem to involve many bells and whistles, called hyper-parameters. This chapter is meant as a practical guide with recommendations for some of…

Machine Learning · Computer Science 2012-09-18 Yoshua Bengio

Tuning the Tuner: Introducing Hyperparameter Optimization for Auto-Tuning

Automatic performance tuning (auto-tuning) is widely used to optimize performance-critical applications across many scientific domains by finding the best program variant among many choices. Efficient optimization algorithms are crucial for…

Machine Learning · Computer Science 2025-10-10 Floris-Jan Willemsen , Rob V. van Nieuwpoort , Ben van Werkhoven

Complexity-Aware Training of Deep Neural Networks for Optimal Structure Discovery

We propose a novel algorithm for combined unit and layer pruning of deep neural networks that functions during training and without requiring a pre-trained network to apply. Our algorithm optimally trades-off learning accuracy and pruning…

Machine Learning · Computer Science 2025-07-17 Valentin Frank Ingmar Guenter , Athanasios Sideris