Related papers: Bayesian Generational Population-Based Training

Generalized Population-Based Training for Hyperparameter Optimization in Reinforcement Learning

Hyperparameter optimization plays a key role in the machine learning domain. Its significance is especially pronounced in reinforcement learning (RL), where agents continuously interact with and adapt to their environments, requiring…

Machine Learning · Computer Science 2024-04-24 Hui Bai , Ran Cheng

Tuning Mixed Input Hyperparameters on the Fly for Efficient Population Based AutoRL

Despite a series of recent successes in reinforcement learning (RL), many RL algorithms remain sensitive to hyperparameters. As such, there has recently been interest in the field of AutoRL, which seeks to automate design decisions to…

Machine Learning · Computer Science 2021-07-01 Jack Parker-Holder , Vu Nguyen , Shaan Desai , Stephen Roberts

Population Based Training of Neural Networks

Neural networks dominate the modern machine learning landscape, but their training and success still suffer from sensitivity to empirical choices of hyperparameters such as model architecture, loss function, and optimisation algorithm. In…

Machine Learning · Computer Science 2017-11-29 Max Jaderberg , Valentin Dalibard , Simon Osindero , Wojciech M. Czarnecki , Jeff Donahue , Ali Razavi , Oriol Vinyals , Tim Green , Iain Dunning , Karen Simonyan , Chrisantha Fernando , Koray Kavukcuoglu

Provably Efficient Online Hyperparameter Optimization with Population-Based Bandits

Many of the recent triumphs in machine learning are dependent on well-tuned hyperparameters. This is particularly prominent in reinforcement learning (RL) where a small change in the configuration can lead to failure. Despite the importance…

Machine Learning · Computer Science 2021-06-07 Jack Parker-Holder , Vu Nguyen , Stephen Roberts

Benchmarking Population-Based Reinforcement Learning across Robotic Tasks with GPU-Accelerated Simulation

In recent years, deep reinforcement learning (RL) has shown its effectiveness in solving complex continuous control tasks. However, this comes at the cost of an enormous amount of experience required for training, exacerbated by the…

Robotics · Computer Science 2025-11-19 Asad Ali Shahid , Yashraj Narang , Vincenzo Petrone , Enrico Ferrentino , Ankur Handa , Dieter Fox , Marco Pavone , Loris Roveda

Automatic tuning of hyper-parameters of reinforcement learning algorithms using Bayesian optimization with behavioral cloning

Optimal setting of several hyper-parameters in machine learning algorithms is key to make the most of available data. To this aim, several methods such as evolutionary strategies, random search, Bayesian optimization and heuristic rules of…

Machine Learning · Computer Science 2021-12-16 Juan Cruz Barsce , Jorge A. Palombarini , Ernesto C. Martínez

On-Robot Bayesian Reinforcement Learning for POMDPs

Robot learning is often difficult due to the expense of gathering data. The need for large amounts of data can, and should, be tackled with effective algorithms and leveraging expert information on robot dynamics. Bayesian reinforcement…

Robotics · Computer Science 2023-07-25 Hai Nguyen , Sammie Katt , Yuchen Xiao , Christopher Amato

Hyperparameter Tuning for Deep Reinforcement Learning Applications

Reinforcement learning (RL) applications, where an agent can simply learn optimal behaviors by interacting with the environment, are quickly gaining tremendous success in a wide variety of applications from controlling simple pendulums to…

Machine Learning · Computer Science 2022-01-28 Mariam Kiran , Melis Ozyildirim

Two-Time-Scale Learning Dynamics: A Population View of Neural Network Training

Population-based learning paradigms, including evolutionary strategies, Population-Based Training (PBT), and recent model-merging methods, combine fast within-model optimisation with slower population-level adaptation. Despite their…

Machine Learning · Computer Science 2026-03-26 Giacomo Borghi , Hyesung Im , Lorenzo Pareschi

Simultaneous Training of First- and Second-Order Optimizers in Population-Based Reinforcement Learning

The tuning of hyperparameters in reinforcement learning (RL) is critical, as these parameters significantly impact an agent's performance and learning efficiency. Dynamic adjustment of hyperparameters during the training process can…

Machine Learning · Computer Science 2024-09-05 Felix Pfeiffer , Shahram Eivazi

Multiple-Frequencies Population-Based Training

Reinforcement Learning's high sensitivity to hyperparameters is a source of instability and inefficiency, creating significant challenges for practitioners. Hyperparameter Optimization (HPO) algorithms have been developed to address this…

Machine Learning · Computer Science 2025-07-18 Waël Doulazmi , Auguste Lehuger , Marin Toromanoff , Valentin Charraut , Thibault Buhet , Fabien Moutarde

A Generalized Framework for Population Based Training

Population Based Training (PBT) is a recent approach that jointly optimizes neural network weights and hyperparameters which periodically copies weights of the best performers and mutates hyperparameters during training. Previous PBT…

Artificial Intelligence · Computer Science 2022-02-01 Ang Li , Aleksandra Spyra , Sagi Perel , Valentin Dalibard , Max Jaderberg , Chenjie Gu , David Budden , Tim Harley , Pramod Gupta

Fast Population-Based Reinforcement Learning on a Single Machine

Training populations of agents has demonstrated great promise in Reinforcement Learning for stabilizing training, improving exploration and asymptotic performance, and generating a diverse set of solutions. However, population-based…

Machine Learning · Computer Science 2022-06-20 Arthur Flajolet , Claire Bizon Monroc , Karim Beguir , Thomas Pierrot

Faster Improvement Rate Population Based Training

The successful training of neural networks typically involves careful and time consuming hyperparameter tuning. Population Based Training (PBT) has recently been proposed to automate this process. PBT trains a population of neural networks…

Neural and Evolutionary Computing · Computer Science 2021-09-29 Valentin Dalibard , Max Jaderberg

A Hierarchical Two-tier Approach to Hyper-parameter Optimization in Reinforcement Learning

Optimization of hyper-parameters in reinforcement learning (RL) algorithms is a key task, because they determine how the agent will learn its policy by interacting with its environment, and thus what data is gathered. In this work, an…

Machine Learning · Computer Science 2019-09-19 Juan Cruz Barsce , Jorge A. Palombarini , Ernesto Martínez

Active Exploration in Bayesian Model-based Reinforcement Learning for Robot Manipulation

Efficiently tackling multiple tasks within complex environment, such as those found in robot manipulation, remains an ongoing challenge in robotics and an opportunity for data-driven solutions, such as reinforcement learning (RL).…

Robotics · Computer Science 2024-04-03 Carlos Plou , Ana C. Murillo , Ruben Martinez-Cantin

Advances in Preference-based Reinforcement Learning: A Review

Reinforcement Learning (RL) algorithms suffer from the dependency on accurately engineered reward functions to properly guide the learning agents to do the required tasks. Preference-based reinforcement learning (PbRL) addresses that by…

Artificial Intelligence · Computer Science 2024-08-23 Youssef Abdelkareem , Shady Shehata , Fakhri Karray

Near-Optimal BRL using Optimistic Local Transitions

Model-based Bayesian Reinforcement Learning (BRL) allows a found formalization of the problem of acting optimally while facing an unknown environment, i.e., avoiding the exploration-exploitation dilemma. However, algorithms explicitly…

Artificial Intelligence · Computer Science 2012-06-22 Mauricio Araya , Olivier Buffet , Vincent Thomas

Generalized Bayesian deep reinforcement learning

Bayesian reinforcement learning (BRL) is a method that merges principles from Bayesian statistics and reinforcement learning to make optimal decisions in uncertain environments. As a model-based RL method, it has two key components: (1)…

Machine Learning · Statistics 2025-06-03 Shreya Sinha Roy , Richard G. Everitt , Christian P. Robert , Ritabrata Dutta

Supplementing Gradient-Based Reinforcement Learning with Simple Evolutionary Ideas

We present a simple, sample-efficient algorithm for introducing large but directed learning steps in reinforcement learning (RL), through the use of evolutionary operators. The methodology uses a population of RL agents training with a…

Neural and Evolutionary Computing · Computer Science 2023-05-15 Harshad Khadilkar