Related papers: Stochastic Hyperparameter Optimization through Hyp…

Stochastic Maximum Likelihood Optimization via Hypernetworks

This work explores maximum likelihood optimization of neural networks through hypernetworks. A hypernetwork initializes the weights of another network, which in turn can be employed for typical functional tasks such as regression and…

Machine Learning · Statistics 2018-01-15 Abdul-Saboor Sheikh , Kashif Rasul , Andreas Merentitis , Urs Bergmann

On Hyperparameter Optimization of Machine Learning Algorithms: Theory and Practice

Machine learning algorithms have been used widely in various applications and areas. To fit a machine learning model into different problems, its hyper-parameters must be tuned. Selecting the best hyper-parameter configuration for machine…

Machine Learning · Computer Science 2022-10-06 Li Yang , Abdallah Shami

Self-Tuning Networks: Bilevel Optimization of Hyperparameters using Structured Best-Response Functions

Hyperparameter optimization can be formulated as a bilevel optimization problem, where the optimal parameters on the training set depend on the hyperparameters. We aim to adapt regularization hyperparameters for neural networks by fitting…

Machine Learning · Computer Science 2019-03-08 Matthew MacKay , Paul Vicol , Jon Lorraine , David Duvenaud , Roger Grosse

Training Deep Neural Networks by optimizing over nonlocal paths in hyperparameter space

Hyperparameter optimization is both a practical issue and an interesting theoretical problem in training of deep architectures. Despite many recent advances the most commonly used methods almost universally involve training multiple and…

Machine Learning · Computer Science 2019-09-10 Vlad Pushkarov , Jonathan Efroni , Mykola Maksymenko , Maciej Koch-Janusz

Optimizing Ensemble Weights and Hyperparameters of Machine Learning Models for Regression Problems

Aggregating multiple learners through an ensemble of models aim to make better predictions by capturing the underlying distribution of the data more accurately. Different ensembling methods, such as bagging, boosting, and stacking/blending,…

Machine Learning · Statistics 2020-11-03 Mohsen Shahhosseini , Guiping Hu , Hieu Pham

Online hyperparameter optimization by real-time recurrent learning

Conventional hyperparameter optimization methods are computationally intensive and hard to generalize to scenarios that require dynamically adapting hyperparameters, such as life-long learning. Here, we propose an online hyperparameter…

Machine Learning · Computer Science 2021-04-09 Daniel Jiwoong Im , Cristina Savin , Kyunghyun Cho

Improving Pre-Trained Weights Through Meta-Heuristics Fine-Tuning

Machine Learning algorithms have been extensively researched throughout the last decade, leading to unprecedented advances in a broad range of applications, such as image classification and reconstruction, object recognition, and text…

Artificial Intelligence · Computer Science 2022-12-20 Gustavo H. de Rosa , Mateus Roder , João Paulo Papa , Claudio F. G. dos Santos

Deep Genetic Network

Optimizing a neural network's performance is a tedious and time taking process, this iterative process does not have any defined solution which can work for all the problems. Optimization can be roughly categorized into - Architecture and…

Machine Learning · Computer Science 2019-12-16 Siddhartha Dhar Choudhury , Shashank Pandey , Kunal Mehrotra

Improving Hyperparameter Optimization with Checkpointed Model Weights

When training deep learning models, the performance depends largely on the selected hyperparameters. However, hyperparameter optimization (HPO) is often one of the most expensive parts of model design. Classical HPO methods treat this as a…

Machine Learning · Computer Science 2024-06-28 Nikhil Mehta , Jonathan Lorraine , Steve Masson , Ramanathan Arunachalam , Zaid Pervaiz Bhat , James Lucas , Arun George Zachariah

On Hyper-parameter Tuning for Stochastic Optimization Algorithms

This paper proposes the first-ever algorithmic framework for tuning hyper-parameters of stochastic optimization algorithm based on reinforcement learning. Hyper-parameters impose significant influences on the performance of stochastic…

Machine Learning · Computer Science 2020-03-11 Haotian Zhang , Jianyong Sun , Zongben Xu

Scalable Nested Optimization for Deep Learning

Gradient-based optimization has been critical to the success of machine learning, updating a single set of parameters to minimize a single loss. A growing number of applications rely on a generalization of this, where we have a bilevel or…

Machine Learning · Computer Science 2024-07-02 Jonathan Lorraine

Automatic Setting of DNN Hyper-Parameters by Mixing Bayesian Optimization and Tuning Rules

Deep learning techniques play an increasingly important role in industrial and research environments due to their outstanding results. However, the large number of hyper-parameters to be set may lead to errors if they are set manually. The…

Machine Learning · Computer Science 2020-06-04 Michele Fraccaroli , Evelina Lamma , Fabrizio Riguzzi

Dynamic Optimization of Neural Network Structures Using Probabilistic Modeling

Deep neural networks (DNNs) are powerful machine learning models and have succeeded in various artificial intelligence tasks. Although various architectures and modules for the DNNs have been proposed, selecting and designing the…

Neural and Evolutionary Computing · Computer Science 2018-01-24 Shinichi Shirakawa , Yasushi Iwata , Youhei Akimoto

Efficient Hyperparameter Tuning with Dynamic Accuracy Derivative-Free Optimization

Many machine learning solutions are framed as optimization problems which rely on good hyperparameters. Algorithms for tuning these hyperparameters usually assume access to exact solutions to the underlying learning problem, which is…

Machine Learning · Computer Science 2020-11-09 Matthias J. Ehrhardt , Lindon Roberts

Optimizing Neural Networks through Activation Function Discovery and Automatic Weight Initialization

Automated machine learning (AutoML) methods improve upon existing models by optimizing various aspects of their design. While present methods focus on hyperparameters and neural network topologies, other aspects of neural network design can…

Machine Learning · Computer Science 2023-04-10 Garrett Bingham

Cross-Entropy Optimization for Hyperparameter Optimization in Stochastic Gradient-based Approaches to Train Deep Neural Networks

In this paper, we present a cross-entropy optimization method for hyperparameter optimization in stochastic gradient-based approaches to train deep neural networks. The value of a hyperparameter of a learning algorithm often has great…

Machine Learning · Computer Science 2024-09-17 Kevin Li , Fulu Li

Hyperparameter Optimization through Neural Network Partitioning

Well-tuned hyperparameters are crucial for obtaining good generalization behavior in neural networks. They can enforce appropriate inductive biases, regularize the model and improve performance -- especially in the presence of limited data.…

Machine Learning · Computer Science 2023-05-01 Bruno Mlodozeniec , Matthias Reisser , Christos Louizos

Neural Networks Designing Neural Networks: Multi-Objective Hyper-Parameter Optimization

Artificial neural networks have gone through a recent rise in popularity, achieving state-of-the-art results in various fields, including image classification, speech recognition, and automated control. Both the performance and…

Neural and Evolutionary Computing · Computer Science 2016-11-08 Sean C. Smithson , Guang Yang , Warren J. Gross , Brett H. Meyer

Stochastic Layer-Wise Precision in Deep Neural Networks

Low precision weights, activations, and gradients have been proposed as a way to improve the computational efficiency and memory footprint of deep neural networks. Recently, low precision networks have even shown to be more robust to…

Machine Learning · Computer Science 2018-07-04 Griffin Lacey , Graham W. Taylor , Shawki Areibi

Hyperparameter Optimization in Binary Communication Networks for Neuromorphic Deployment

Training neural networks for neuromorphic deployment is non-trivial. There have been a variety of approaches proposed to adapt back-propagation or back-propagation-like algorithms appropriate for training. Considering that these networks…

Neural and Evolutionary Computing · Computer Science 2020-05-11 Maryam Parsa , Catherine D. Schuman , Prasanna Date , Derek C. Rose , Bill Kay , J. Parker Mitchell , Steven R. Young , Ryan Dellana , William Severa , Thomas E. Potok , Kaushik Roy