Related papers: Analytical Benchmark Problems for Multifidelity Op…

A Literature Survey of Benchmark Functions For Global Optimization Problems

Test functions are important to validate and compare the performance of optimization algorithms. There have been many test or benchmark functions reported in the literature; however, there is no standard list or set of benchmark functions.…

Artificial Intelligence · Computer Science 2013-08-20 Momin Jamil , Xin-She Yang

Multi-Fidelity Methods for Optimization: A Survey

Real-world black-box optimization often involves time-consuming or costly experiments and simulations. Multi-fidelity optimization (MFO) stands out as a cost-effective strategy that balances high-fidelity accuracy with computational…

Machine Learning · Computer Science 2024-02-16 Ke Li , Fan Li

Best practices for comparing optimization algorithms

Comparing, or benchmarking, of optimization algorithms is a complicated task that involves many subtle considerations to yield a fair and unbiased evaluation. In this paper, we systematically review the benchmarking process of optimization…

Optimization and Control · Mathematics 2017-09-26 Vahid Beiranvand , Warren Hare , Yves Lucet

HPOBench: A Collection of Reproducible Multi-Fidelity Benchmark Problems for HPO

To achieve peak predictive performance, hyperparameter optimization (HPO) is a crucial component of machine learning and its applications. Over the last years, the number of efficient algorithms and tools for HPO grew substantially. At the…

Machine Learning · Computer Science 2022-10-07 Katharina Eggensperger , Philipp Müller , Neeratyoy Mallik , Matthias Feurer , René Sass , Aaron Klein , Noor Awad , Marius Lindauer , Frank Hutter

Guidelines for benchmarking of optimization approaches for fitting mathematical models

Insufficient performance of optimization approaches for fitting of mathematical models is still a major bottleneck in systems biology. In this manuscript, the reasons and methodological challenges are summarized as well as their impact in…

Performance · Computer Science 2019-07-09 Clemens Kreutz

Benchmarking Simulation-Based Inference

Recent advances in probabilistic modelling have led to a large number of simulation-based inference algorithms which do not require numerical evaluation of likelihoods. However, a public benchmark with appropriate performance metrics for…

Machine Learning · Statistics 2021-04-12 Jan-Matthis Lueckmann , Jan Boelts , David S. Greenberg , Pedro J. Gonçalves , Jakob H. Macke

Benchmarking in Optimization: Best Practice and Open Issues

This survey compiles ideas and recommendations from more than a dozen researchers with different backgrounds and from different institutes around the world. Promoting best practice in benchmarking is its main goal. The article discusses…

Neural and Evolutionary Computing · Computer Science 2020-12-18 Thomas Bartz-Beielstein , Carola Doerr , Daan van den Berg , Jakob Bossek , Sowmya Chandrasekaran , Tome Eftimov , Andreas Fischbach , Pascal Kerschke , William La Cava , Manuel Lopez-Ibanez , Katherine M. Malan , Jason H. Moore , Boris Naujoks , Patryk Orzechowski , Vanessa Volz , Markus Wagner , Thomas Weise

Multi-fidelity Machine Learning for Uncertainty Quantification and Optimization

In system analysis and design optimization, multiple computational models are typically available to represent a given physical system. These models can be broadly classified as high-fidelity models, which provide highly accurate…

Machine Learning · Computer Science 2024-11-01 Ruda Zhang , Negin Alemazkoor

Benchmarking for Metaheuristic Black-Box Optimization: Perspectives and Open Challenges

Research on new optimization algorithms is often funded based on the motivation that such algorithms might improve the capabilities to deal with real-world and industrially relevant optimization challenges. Besides a huge variety of…

Neural and Evolutionary Computing · Computer Science 2020-07-02 Ramses Sala , Ralf Müller

Evaluating the Performance of Reinforcement Learning Algorithms

Performance evaluations are critical for quantifying algorithmic advances in reinforcement learning. Recent reproducibility analyses have shown that reported performance results are often inconsistent and difficult to replicate. In this…

Machine Learning · Computer Science 2020-08-14 Scott M. Jordan , Yash Chandak , Daniel Cohen , Mengxue Zhang , Philip S. Thomas

A Statistical Analysis for Per-Instance Evaluation of Stochastic Optimizers: Avoiding Unreliable Conclusions

A key trait of stochastic optimizers is that multiple runs of the same optimizer in attempting to solve the same problem can produce different results. As a result, their performance is evaluated over several repeats, or runs, on the…

Machine Learning · Computer Science 2026-05-18 Moslem Noori , Elisabetta Valiante , Thomas Van Vaerenbergh , Masoud Mohseni , Ignacio Rozada

Randomness as Reference: Benchmark Metric for Optimization in Engineering

Benchmarking optimization algorithms is fundamental for the advancement of computational intelligence. However, widely adopted artificial test suites exhibit limited correspondence with the diversity and complexity of real-world engineering…

Computational Engineering, Finance, and Science · Computer Science 2026-04-17 Stefan Ivić , Siniša Družeta , Luka Grbčić

Review of multi-fidelity models

Multi-fidelity models provide a framework for integrating computational models of varying complexity, allowing for accurate predictions while optimizing computational resources. These models are especially beneficial when acquiring…

Applications · Statistics 2024-05-14 M. Giselle Fernández-Godino

SELECTOR: Selecting a Representative Benchmark Suite for Reproducible Statistical Comparison

Fair algorithm evaluation is conditioned on the existence of high-quality benchmark datasets that are non-redundant and are representative of typical optimization scenarios. In this paper, we evaluate three heuristics for selecting diverse…

Neural and Evolutionary Computing · Computer Science 2022-04-26 Gjorgjina Cenikj , Ryan Dieter Lang , Andries Petrus Engelbrecht , Carola Doerr , Peter Korošec , Tome Eftimov

A survey of benchmarking frameworks for reinforcement learning

Reinforcement learning has recently experienced increased prominence in the machine learning community. There are many approaches to solving reinforcement learning problems with new techniques developed constantly. When solving problems…

Machine Learning · Computer Science 2020-12-14 Belinda Stapelberg , Katherine M. Malan

Test Problems in Optimization

Test functions are important to validate new optimization algorithms and to compare the performance of various algorithms. There are many test functions in the literature, but there is no standard list or set of test functions one has to…

Optimization and Control · Mathematics 2010-08-04 Xin-She Yang

Using Affine Combinations of BBOB Problems for Performance Assessment

Benchmarking plays a major role in the development and analysis of optimization algorithms. As such, the way in which the used benchmark problems are defined significantly affects the insights that can be gained from any given benchmark…

Neural and Evolutionary Computing · Computer Science 2023-03-09 Diederick Vermetten , Furong Ye , Carola Doerr

Benchmark^2: Systematic Evaluation of LLM Benchmarks

The rapid proliferation of benchmarks for evaluating large language models (LLMs) has created an urgent need for systematic methods to assess benchmark quality itself. We propose Benchmark^2, a comprehensive framework comprising three…

Computation and Language · Computer Science 2026-01-08 Qi Qian , Chengsong Huang , Jingwen Xu , Changze Lv , Muling Wu , Wenhao Liu , Xiaohua Wang , Zhenghua Wang , Zisu Huang , Muzhao Tian , Jianhan Xu , Kun Hu , He-Da Wang , Yao Hu , Xuanjing Huang , Xiaoqing Zheng

A Stratified Analysis of Bayesian Optimization Methods

Empirical analysis serves as an important complement to theoretical analysis for studying practical Bayesian optimization. Often empirical insights expose strengths and weaknesses inaccessible to theoretical analysis. We define two metrics…

Machine Learning · Computer Science 2016-04-01 Ian Dewancker , Michael McCourt , Scott Clark , Patrick Hayes , Alexandra Johnson , George Ke

Block-Bench: A Framework for Controllable and Transparent Discrete Optimization Benchmarking

We present a novel approach for constructing discrete optimization benchmarks that enables fine-grained control over problem properties, and such benchmarks can facilitate analyzing discrete algorithm behaviors. We build benchmark problems…

Neural and Evolutionary Computing · Computer Science 2026-04-09 Furong Ye , Frank Neumann , Thomas Bäck , Niki van Stein