Related papers: Level-Based Analysis of the Population-Based Incre…

Level-Based Analysis of the Univariate Marginal Distribution Algorithm

Estimation of Distribution Algorithms (EDAs) are stochastic heuristics that search for optimal solutions by learning and sampling from probabilistic models. Despite their popularity in real-world applications, there is little rigorous…

Neural and Evolutionary Computing · Computer Science 2018-07-27 Duc-Cuong Dang , Per Kristian Lehre , Phan Trung Hai Nguyen

A Simplified Run Time Analysis of the Univariate Marginal Distribution Algorithm on LeadingOnes

With elementary means, we prove a stronger run time guarantee for the univariate marginal distribution algorithm (UMDA) optimizing the LeadingOnes benchmark function in the desirable regime with low genetic drift. If the population size is…

Neural and Evolutionary Computing · Computer Science 2020-04-13 Benjamin Doerr , Martin Krejca

On the Optimal Convergence Probability of Univariate Estimation of Distribution Algorithms

In this paper, we obtain bounds on the probability of convergence to the optimal solution for the compact Genetic Algorithm (cGA) and the Population Based Incremental Learning (PBIL). We also give a sufficient condition for convergence of…

Neural and Evolutionary Computing · Computer Science 2010-09-14 Reza Rastegar

On the Limitations of the Univariate Marginal Distribution Algorithm to Deception and Where Bivariate EDAs might help

We introduce a new benchmark problem called Deceptive Leading Blocks (DLB) to rigorously study the runtime of the Univariate Marginal Distribution Algorithm (UMDA) in the presence of epistasis and deception. We show that simple Evolutionary…

Neural and Evolutionary Computing · Computer Science 2019-07-30 Per Kristian Lehre , Phan Trung Hai Nguyen

Population-Based Evolution Optimizes a Meta-Learning Objective

Meta-learning models, or models that learn to learn, have been a long-desired target for their ability to quickly solve new tasks. Traditional meta-learning methods can require expensive inner and outer loops, thus there is demand for…

Neural and Evolutionary Computing · Computer Science 2021-03-12 Kevin Frans , Olaf Witkowski

The Univariate Marginal Distribution Algorithm Copes Well With Deception and Epistasis

In their recent work, Lehre and Nguyen (FOGA 2019) show that the univariate marginal distribution algorithm (UMDA) needs time exponential in the parent populations size to optimize the DeceptiveLeadingBlocks (DLB) problem. They conclude…

Neural and Evolutionary Computing · Computer Science 2022-04-28 Benjamin Doerr , Martin S. Krejca

Population Based Augmentation: Efficient Learning of Augmentation Policy Schedules

A key challenge in leveraging data augmentation for neural network training is choosing an effective augmentation policy from a large search space of candidate operations. Properly chosen augmentation policies can lead to significant…

Computer Vision and Pattern Recognition · Computer Science 2019-05-15 Daniel Ho , Eric Liang , Ion Stoica , Pieter Abbeel , Xi Chen

Level-based Analysis of Genetic Algorithms and other Search Processes

Understanding how the time-complexity of evolutionary algorithms (EAs) depend on their parameter settings and characteristics of fitness landscapes is a fundamental problem in evolutionary computation. Most rigorous results were derived…

Neural and Evolutionary Computing · Computer Science 2016-10-28 Dogan Corus , Duc-Cuong Dang , Anton V. Eremeev , Per Kristian Lehre

Population Based Training of Neural Networks

Neural networks dominate the modern machine learning landscape, but their training and success still suffer from sensitivity to empirical choices of hyperparameters such as model architecture, loss function, and optimisation algorithm. In…

Machine Learning · Computer Science 2017-11-29 Max Jaderberg , Valentin Dalibard , Simon Osindero , Wojciech M. Czarnecki , Jeff Donahue , Ali Razavi , Oriol Vinyals , Tim Green , Iain Dunning , Karen Simonyan , Chrisantha Fernando , Koray Kavukcuoglu

A Two-Timescale Primal-Dual Framework for Reinforcement Learning via Online Dual Variable Guidance

We study reinforcement learning by combining recent advances in regularized linear programming formulations with the classical theory of stochastic approximation. Motivated by the challenge of designing algorithms that leverage off-policy…

Optimization and Control · Mathematics 2026-04-15 Axel Friedrich Wolter , Tobias Sutter

Self-Supervised Primal-Dual Learning for Constrained Optimization

This paper studies how to train machine-learning models that directly approximate the optimal solutions of constrained optimization problems. This is an empirical risk minimization under constraints, which is challenging as training must…

Machine Learning · Computer Science 2022-11-24 Seonho Park , Pascal Van Hentenryck

Online Bayesian Imbalanced Learning with Bregman-Calibrated Deep Networks

Class imbalance remains a fundamental challenge in machine learning, where standard classifiers exhibit severe performance degradation in minority classes. Although existing approaches address imbalance through resampling or cost-sensitive…

Machine Learning · Computer Science 2026-02-10 Zahir Alsulaimawi

Bivariate Estimation-of-Distribution Algorithms Can Find an Exponential Number of Optima

Finding a large set of optima in a multimodal optimization landscape is a challenging task. Classical population-based evolutionary algorithms typically converge only to a single solution. While this can be counteracted by applying niching…

Neural and Evolutionary Computing · Computer Science 2023-10-10 Benjamin Doerr , Martin S. Krejca

Policy learning "without" overlap: Pessimism and generalized empirical Bernstein's inequality

This paper studies offline policy learning, which aims at utilizing observations collected a priori (from either fixed or adaptively evolving behavior policies) to learn an optimal individualized decision rule that achieves the best overall…

Machine Learning · Computer Science 2025-06-06 Ying Jin , Zhimei Ren , Zhuoran Yang , Zhaoran Wang

Improved Runtime Bounds for the Univariate Marginal Distribution Algorithm via Anti-Concentration

Unlike traditional evolutionary algorithms which produce offspring via genetic operators, Estimation of Distribution Algorithms (EDAs) sample solutions from probabilistic models which are learned from selected individuals. It is hoped that…

Neural and Evolutionary Computing · Computer Science 2018-02-05 Per Kristian Lehre , Phan Trung Hai Nguyen

Evolving Neural Networks in Reinforcement Learning by means of UMDAc

Neural networks are gaining popularity in the reinforcement learning field due to the vast number of successfully solved complex benchmark problems. In fact, artificial intelligence algorithms are, in some cases, able to overcome human…

Neural and Evolutionary Computing · Computer Science 2019-04-25 Mikel Malagon , Josu Ceberio

Bayesian Generational Population-Based Training

Reinforcement learning (RL) offers the potential for training generally capable agents that can interact autonomously in the real world. However, one key limitation is the brittleness of RL algorithms to core hyperparameters and network…

Machine Learning · Computer Science 2022-07-20 Xingchen Wan , Cong Lu , Jack Parker-Holder , Philip J. Ball , Vu Nguyen , Binxin Ru , Michael A. Osborne

REBEL: Reinforcement Learning via Regressing Relative Rewards

While originally developed for continuous control problems, Proximal Policy Optimization (PPO) has emerged as the work-horse of a variety of reinforcement learning (RL) applications, including the fine-tuning of generative models.…

Machine Learning · Computer Science 2024-12-11 Zhaolin Gao , Jonathan D. Chang , Wenhao Zhan , Owen Oertell , Gokul Swamy , Kianté Brantley , Thorsten Joachims , J. Andrew Bagnell , Jason D. Lee , Wen Sun

Runtime Analysis of the Univariate Marginal Distribution Algorithm under Low Selective Pressure and Prior Noise

We perform a rigorous runtime analysis for the Univariate Marginal Distribution Algorithm on the LeadingOnes function, a well-known benchmark function in the theory community of evolutionary computation with a high correlation between…

Neural and Evolutionary Computing · Computer Science 2019-04-22 Per Kristian Lehre , Phan Trung Hai Nguyen

Efficient Data-Dependent Learnability

The predictive normalized maximum likelihood (pNML) approach has recently been proposed as the min-max optimal solution to the batch learning problem where both the training set and the test data feature are individuals, known sequences.…

Machine Learning · Computer Science 2020-11-23 Yaniv Fogel , Tal Shapira , Meir Feder