Related papers: Generalized Population-Based Training for Hyperpar…

Population Based Training of Neural Networks

Neural networks dominate the modern machine learning landscape, but their training and success still suffer from sensitivity to empirical choices of hyperparameters such as model architecture, loss function, and optimisation algorithm. In…

Machine Learning · Computer Science 2017-11-29 Max Jaderberg , Valentin Dalibard , Simon Osindero , Wojciech M. Czarnecki , Jeff Donahue , Ali Razavi , Oriol Vinyals , Tim Green , Iain Dunning , Karen Simonyan , Chrisantha Fernando , Koray Kavukcuoglu

Bayesian Generational Population-Based Training

Reinforcement learning (RL) offers the potential for training generally capable agents that can interact autonomously in the real world. However, one key limitation is the brittleness of RL algorithms to core hyperparameters and network…

Machine Learning · Computer Science 2022-07-20 Xingchen Wan , Cong Lu , Jack Parker-Holder , Philip J. Ball , Vu Nguyen , Binxin Ru , Michael A. Osborne

A Generalized Framework for Population Based Training

Population Based Training (PBT) is a recent approach that jointly optimizes neural network weights and hyperparameters which periodically copies weights of the best performers and mutates hyperparameters during training. Previous PBT…

Artificial Intelligence · Computer Science 2022-02-01 Ang Li , Aleksandra Spyra , Sagi Perel , Valentin Dalibard , Max Jaderberg , Chenjie Gu , David Budden , Tim Harley , Pramod Gupta

Multiple-Frequencies Population-Based Training

Reinforcement Learning's high sensitivity to hyperparameters is a source of instability and inefficiency, creating significant challenges for practitioners. Hyperparameter Optimization (HPO) algorithms have been developed to address this…

Machine Learning · Computer Science 2025-07-18 Waël Doulazmi , Auguste Lehuger , Marin Toromanoff , Valentin Charraut , Thibault Buhet , Fabien Moutarde

Genealogical Population-Based Training for Hyperparameter Optimization

HyperParameter Optimization (HPO) aims at finding the best HyperParameters (HPs) of learning models, such as neural networks, in the fastest and most efficient way possible. Most recent HPO algorithms try to optimize HPs regardless of the…

Machine Learning · Computer Science 2023-04-11 Antoine Scardigli , Paul Fournier , Matteo Vilucchio , David Naccache

Multi-Objective Population Based Training

Population Based Training (PBT) is an efficient hyperparameter optimization algorithm. PBT is a single-objective algorithm, but many real-world hyperparameter optimization problems involve two or more conflicting objectives. In this work,…

Machine Learning · Computer Science 2023-06-05 Arkadiy Dushatskiy , Alexander Chebykin , Tanja Alderliesten , Peter A. N. Bosman

Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits

Many of the recent triumphs in machine learning are dependent on well-tuned hyperparameters. This is particularly prominent in reinforcement learning (RL) where a small change in the configuration can lead to failure. Despite the importance…

Machine Learning · Computer Science 2021-06-07 Jack Parker-Holder , Vu Nguyen , Stephen Roberts

Faster Improvement Rate Population Based Training

The successful training of neural networks typically involves careful and time consuming hyperparameter tuning. Population Based Training (PBT) has recently been proposed to automate this process. PBT trains a population of neural networks…

Neural and Evolutionary Computing · Computer Science 2021-09-29 Valentin Dalibard , Max Jaderberg

Two-Time-Scale Learning Dynamics: A Population View of Neural Network Training

Population-based learning paradigms, including evolutionary strategies, Population-Based Training (PBT), and recent model-merging methods, combine fast within-model optimisation with slower population-level adaptation. Despite their…

Machine Learning · Computer Science 2026-03-26 Giacomo Borghi , Hyesung Im , Lorenzo Pareschi

Simultaneous Training of First- and Second-Order Optimizers in Population-Based Reinforcement Learning

The tuning of hyperparameters in reinforcement learning (RL) is critical, as these parameters significantly impact an agent's performance and learning efficiency. Dynamic adjustment of hyperparameters during the training process can…

Machine Learning · Computer Science 2024-09-05 Felix Pfeiffer , Shahram Eivazi

Benchmarking Population-Based Reinforcement Learning across Robotic Tasks with GPU-Accelerated Simulation

In recent years, deep reinforcement learning (RL) has shown its effectiveness in solving complex continuous control tasks. However, this comes at the cost of an enormous amount of experience required for training, exacerbated by the…

Robotics · Computer Science 2025-11-19 Asad Ali Shahid , Yashraj Narang , Vincenzo Petrone , Enrico Ferrentino , Ankur Handa , Dieter Fox , Marco Pavone , Loris Roveda

Tuning Mixed Input Hyperparameters on the Fly for Efficient Population Based AutoRL

Despite a series of recent successes in reinforcement learning (RL), many RL algorithms remain sensitive to hyperparameters. As such, there has recently been interest in the field of AutoRL, which seeks to automate design decisions to…

Machine Learning · Computer Science 2021-07-01 Jack Parker-Holder , Vu Nguyen , Shaan Desai , Stephen Roberts

Fast Population-Based Reinforcement Learning on a Single Machine

Training populations of agents has demonstrated great promise in Reinforcement Learning for stabilizing training, improving exploration and asymptotic performance, and generating a diverse set of solutions. However, population-based…

Machine Learning · Computer Science 2022-06-20 Arthur Flajolet , Claire Bizon Monroc , Karim Beguir , Thomas Pierrot

Population-Based Evolution Optimizes a Meta-Learning Objective

Meta-learning models, or models that learn to learn, have been a long-desired target for their ability to quickly solve new tasks. Traditional meta-learning methods can require expensive inner and outer loops, thus there is demand for…

Neural and Evolutionary Computing · Computer Science 2021-03-12 Kevin Frans , Olaf Witkowski

Regularized Evolutionary Population-Based Training

Metalearning of deep neural network (DNN) architectures and hyperparameters has become an increasingly important area of research. At the same time, network regularization has been recognized as a crucial dimension to effective training of…

Neural and Evolutionary Computing · Computer Science 2021-07-22 Jason Liang , Santiago Gonzalez , Hormoz Shahrzad , Risto Miikkulainen

Accelerating and Improving AlphaZero Using Population Based Training

AlphaZero has been very successful in many games. Unfortunately, it still consumes a huge amount of computing resources, the majority of which is spent in self-play. Hyperparameter tuning exacerbates the training cost since each…

Artificial Intelligence · Computer Science 2020-03-16 Ti-Rong Wu , Ting-Han Wei , I-Chen Wu

Hyperparameter Tuning for Deep Reinforcement Learning Applications

Reinforcement learning (RL) applications, where an agent can simply learn optimal behaviors by interacting with the environment, are quickly gaining tremendous success in a wide variety of applications from controlling simple pendulums to…

Machine Learning · Computer Science 2022-01-28 Mariam Kiran , Melis Ozyildirim

Genetic Programming with Reinforcement Learning Trained Transformer for Real-World Dynamic Scheduling Problems

Dynamic scheduling in real-world environments often struggles to adapt to unforeseen disruptions, making traditional static scheduling methods and human-designed heuristics inadequate. This paper introduces an innovative approach that…

Artificial Intelligence · Computer Science 2025-08-06 Xinan Chen , Rong Qu , Jing Dong , Ruibin Bai , Yaochu Jin

Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing

Deep Reinforcement Learning (RL) is proven powerful for decision making in simulated environments. However, training deep RL model is challenging in real world applications such as production-scale health-care or recommender systems because…

Machine Learning · Computer Science 2020-02-14 Ge Liu , Rui Wu , Heng-Tze Cheng , Jing Wang , Jayden Ooi , Lihong Li , Ang Li , Wai Lok Sibon Li , Craig Boutilier , Ed Chi

PB$^2$: Preference Space Exploration via Population-Based Methods in Preference-Based Reinforcement Learning

Preference-based reinforcement learning (PbRL) has emerged as a promising approach for learning behaviors from human feedback without predefined reward functions. However, current PbRL methods face a critical challenge in effectively…

Artificial Intelligence · Computer Science 2025-06-17 Brahim Driss , Alex Davey , Riad Akrour