English
Related papers

Related papers: A Generalized Framework for Population Based Train…

200 papers

Neural networks dominate the modern machine learning landscape, but their training and success still suffer from sensitivity to empirical choices of hyperparameters such as model architecture, loss function, and optimisation algorithm. In…

Hyperparameter optimization plays a key role in the machine learning domain. Its significance is especially pronounced in reinforcement learning (RL), where agents continuously interact with and adapt to their environments, requiring…

Machine Learning · Computer Science 2024-04-24 Hui Bai , Ran Cheng

The successful training of neural networks typically involves careful and time consuming hyperparameter tuning. Population Based Training (PBT) has recently been proposed to automate this process. PBT trains a population of neural networks…

Neural and Evolutionary Computing · Computer Science 2021-09-29 Valentin Dalibard , Max Jaderberg

Population-based learning paradigms, including evolutionary strategies, Population-Based Training (PBT), and recent model-merging methods, combine fast within-model optimisation with slower population-level adaptation. Despite their…

Machine Learning · Computer Science 2026-03-26 Giacomo Borghi , Hyesung Im , Lorenzo Pareschi

Reinforcement Learning's high sensitivity to hyperparameters is a source of instability and inefficiency, creating significant challenges for practitioners. Hyperparameter Optimization (HPO) algorithms have been developed to address this…

Machine Learning · Computer Science 2025-07-18 Waël Doulazmi , Auguste Lehuger , Marin Toromanoff , Valentin Charraut , Thibault Buhet , Fabien Moutarde

Reinforcement learning (RL) offers the potential for training generally capable agents that can interact autonomously in the real world. However, one key limitation is the brittleness of RL algorithms to core hyperparameters and network…

Machine Learning · Computer Science 2022-07-20 Xingchen Wan , Cong Lu , Jack Parker-Holder , Philip J. Ball , Vu Nguyen , Binxin Ru , Michael A. Osborne

Batch normalization (BN) is a popular and ubiquitous method in deep learning that has been shown to decrease training time and improve generalization performance of neural networks. Despite its success, BN is not theoretically well…

Machine Learning · Computer Science 2022-01-21 Susanna Lange , Kyle Helfrich , Qiang Ye

Metalearning of deep neural network (DNN) architectures and hyperparameters has become an increasingly important area of research. At the same time, network regularization has been recognized as a crucial dimension to effective training of…

Neural and Evolutionary Computing · Computer Science 2021-07-22 Jason Liang , Santiago Gonzalez , Hormoz Shahrzad , Risto Miikkulainen

Training populations of agents has demonstrated great promise in Reinforcement Learning for stabilizing training, improving exploration and asymptotic performance, and generating a diverse set of solutions. However, population-based…

Machine Learning · Computer Science 2022-06-20 Arthur Flajolet , Claire Bizon Monroc , Karim Beguir , Thomas Pierrot

Many of the recent triumphs in machine learning are dependent on well-tuned hyperparameters. This is particularly prominent in reinforcement learning (RL) where a small change in the configuration can lead to failure. Despite the importance…

Machine Learning · Computer Science 2021-06-07 Jack Parker-Holder , Vu Nguyen , Stephen Roberts

HyperParameter Optimization (HPO) aims at finding the best HyperParameters (HPs) of learning models, such as neural networks, in the fastest and most efficient way possible. Most recent HPO algorithms try to optimize HPs regardless of the…

Machine Learning · Computer Science 2023-04-11 Antoine Scardigli , Paul Fournier , Matteo Vilucchio , David Naccache

The tuning of hyperparameters in reinforcement learning (RL) is critical, as these parameters significantly impact an agent's performance and learning efficiency. Dynamic adjustment of hyperparameters during the training process can…

Machine Learning · Computer Science 2024-09-05 Felix Pfeiffer , Shahram Eivazi

AlphaZero has been very successful in many games. Unfortunately, it still consumes a huge amount of computing resources, the majority of which is spent in self-play. Hyperparameter tuning exacerbates the training cost since each…

Artificial Intelligence · Computer Science 2020-03-16 Ti-Rong Wu , Ting-Han Wei , I-Chen Wu

This work presents a novel training technique for deep neural networks that makes use of additional data from a distribution that is different from that of the original input data. This technique aims to reduce overfitting and improve the…

Computer Vision and Pattern Recognition · Computer Science 2020-06-09 Pravendra Singh , Pratik Mazumder , Vinay P. Namboodiri

The rapid advancement of deep learning models often attributes to their ability to leverage massive training data. In contrast, such privilege has not yet fully benefited 3D deep learning, mainly due to the limited availability of…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Xiaoyang Wu , Zhuotao Tian , Xin Wen , Bohao Peng , Xihui Liu , Kaicheng Yu , Hengshuang Zhao

The inductive bias of a neural network is largely determined by the architecture and the training algorithm. To achieve good generalization, how to effectively train a neural network is of great importance. We propose a novel orthogonal…

Machine Learning · Computer Science 2021-06-08 Weiyang Liu , Rongmei Lin , Zhen Liu , James M. Rehg , Liam Paull , Li Xiong , Le Song , Adrian Weller

PAC-Bayesian is an analysis framework where the training error can be expressed as the weighted average of the hypotheses in the posterior distribution whilst incorporating the prior knowledge. In addition to being a pure generalization…

Machine Learning · Computer Science 2022-02-07 Wei Huang , Chunrui Liu , Yilan Chen , Tianyu Liu , Richard Yi Da Xu

We introduce Predictive Batch Scheduling (PBS), a novel training optimization technique that accelerates language model convergence by dynamically prioritizing high-loss samples during batch construction. Unlike curriculum learning…

Artificial Intelligence · Computer Science 2026-02-20 Sumedh Rasal

We present a rapid design methodology that combines automated hyper-parameter tuning with semi-supervised training to build highly accurate and robust models for voice commands classification. Proposed approach allows quick evaluation of…

Audio and Speech Processing · Electrical Eng. & Systems 2019-05-13 Oguz H. Elibol , Gokce Keskin , Anil Thomas

Meta-learning models, or models that learn to learn, have been a long-desired target for their ability to quickly solve new tasks. Traditional meta-learning methods can require expensive inner and outer loops, thus there is demand for…

Neural and Evolutionary Computing · Computer Science 2021-03-12 Kevin Frans , Olaf Witkowski
‹ Prev 1 2 3 10 Next ›