Related papers: Benchmarking Simulation-Based Inference

Benchmarking Neural Network Training Algorithms

Training algorithms, broadly construed, are an essential part of every deep learning pipeline. Training algorithm improvements that speed up training across a wide variety of workloads (e.g., better update rules, tuning protocols, learning…

Machine Learning · Computer Science 2025-06-19 George E. Dahl , Frank Schneider , Zachary Nado , Naman Agarwal , Chandramouli Shama Sastry , Philipp Hennig , Sourabh Medapati , Runa Eschenhagen , Priya Kasimbeg , Daniel Suo , Juhan Bae , Justin Gilmer , Abel L. Peirson , Bilal Khan , Rohan Anil , Mike Rabbat , Shankar Krishnan , Daniel Snider , Ehsan Amid , Kongtao Chen , Chris J. Maddison , Rakshith Vasudev , Michal Badura , Ankush Garg , Peter Mattson

Quantifying the Multi-Scale Performance of Network Inference Algorithms

Graphical models are widely used to study complex multivariate biological systems. Network inference algorithms aim to reverse-engineer such models from noisy experimental data. It is common to assess such algorithms using techniques from…

Methodology · Statistics 2014-03-03 Chris J. Oates , Richard Amos , Simon E. F. Spencer

SELECTOR: Selecting a Representative Benchmark Suite for Reproducible Statistical Comparison

Fair algorithm evaluation is conditioned on the existence of high-quality benchmark datasets that are non-redundant and are representative of typical optimization scenarios. In this paper, we evaluate three heuristics for selecting diverse…

Neural and Evolutionary Computing · Computer Science 2022-04-26 Gjorgjina Cenikj , Ryan Dieter Lang , Andries Petrus Engelbrecht , Carola Doerr , Peter Korošec , Tome Eftimov

MoleculeNet: A Benchmark for Molecular Machine Learning

Molecular machine learning has been maturing rapidly over the last few years. Improved methods and the presence of larger datasets have enabled machine learning algorithms to make increasingly accurate predictions about molecular…

Machine Learning · Computer Science 2018-10-29 Zhenqin Wu , Bharath Ramsundar , Evan N. Feinberg , Joseph Gomes , Caleb Geniesse , Aneesh S. Pappu , Karl Leswing , Vijay Pande

PMLB: A Large Benchmark Suite for Machine Learning Evaluation and Comparison

The selection, development, or comparison of machine learning methods in data mining can be a difficult task based on the target problem and goals of a particular study. Numerous publicly available real-world and simulated benchmark…

Machine Learning · Computer Science 2017-03-03 Randal S. Olson , William La Cava , Patryk Orzechowski , Ryan J. Urbanowicz , Jason H. Moore

Absolute Ranking: An Essential Normalization for Benchmarking Optimization Algorithms

Evaluating performance across optimization algorithms on many problems presents a complex challenge due to the diversity of numerical scales involved. Traditional data processing methods, such as hypothesis testing and Bayesian inference,…

Optimization and Control · Mathematics 2024-09-10 Yunpeng Jinng , Qunfeng Liu

From Variability to Stability: Advancing RecSys Benchmarking Practices

In the rapidly evolving domain of Recommender Systems (RecSys), new algorithms frequently claim state-of-the-art performance based on evaluations over a limited set of arbitrarily selected datasets. However, this approach may fail to…

Information Retrieval · Computer Science 2024-08-28 Valeriy Shevchenko , Nikita Belousov , Alexey Vasilev , Vladimir Zholobov , Artyom Sosedka , Natalia Semenova , Anna Volodkevich , Andrey Savchenko , Alexey Zaytsev

Accounting for Variance in Machine Learning Benchmarks

Strong empirical evidence that one machine-learning algorithm A outperforms another one B ideally calls for multiple trials optimizing the learning pipeline over sources of variation such as data sampling, data augmentation, parameter…

Machine Learning · Computer Science 2021-03-05 Xavier Bouthillier , Pierre Delaunay , Mirko Bronzi , Assya Trofimov , Brennan Nichyporuk , Justin Szeto , Naz Sepah , Edward Raff , Kanika Madan , Vikram Voleti , Samira Ebrahimi Kahou , Vincent Michalski , Dmitriy Serdyuk , Tal Arbel , Chris Pal , Gaël Varoquaux , Pascal Vincent

Benchmarking TinyML Systems: Challenges and Direction

Recent advancements in ultra-low-power machine learning (TinyML) hardware promises to unlock an entirely new class of smart applications. However, continued progress is limited by the lack of a widely accepted benchmark for these systems.…

Performance · Computer Science 2021-02-02 Colby R. Banbury , Vijay Janapa Reddi , Max Lam , William Fu , Amin Fazel , Jeremy Holleman , Xinyuan Huang , Robert Hurtado , David Kanter , Anton Lokhmotov , David Patterson , Danilo Pau , Jae-sun Seo , Jeff Sieracki , Urmish Thakker , Marian Verhelst , Poonam Yadav

Simulation-Based Inference: A Practical Guide

A central challenge in many areas of science and engineering is to identify model parameters that are consistent with prior knowledge and empirical data. Bayesian inference offers a principled framework for this task, but can be…

Machine Learning · Statistics 2025-08-19 Michael Deistler , Jan Boelts , Peter Steinbach , Guy Moss , Thomas Moreau , Manuel Gloeckler , Pedro L. C. Rodrigues , Julia Linhart , Janne K. Lappalainen , Benjamin Kurt Miller , Pedro J. Gonçalves , Jan-Matthis Lueckmann , Cornelius Schröder , Jakob H. Macke

Benchmarking Evolutionary Algorithms For Single Objective Real-valued Constrained Optimization - A Critical Review

Benchmarking plays an important role in the development of novel search algorithms as well as for the assessment and comparison of contemporary algorithmic ideas. This paper presents common principles that need to be taken into account when…

Neural and Evolutionary Computing · Computer Science 2018-10-08 Michael Hellwig , Hans-Georg Beyer

Benchmarking for Metaheuristic Black-Box Optimization: Perspectives and Open Challenges

Research on new optimization algorithms is often funded based on the motivation that such algorithms might improve the capabilities to deal with real-world and industrially relevant optimization challenges. Besides a huge variety of…

Neural and Evolutionary Computing · Computer Science 2020-07-02 Ramses Sala , Ralf Müller

An Extensible Benchmarking Infrastructure for Motion Planning Algorithms

Sampling-based planning algorithms are the most common probabilistically complete algorithms and are widely used on many robot platforms. Within this class of algorithms, many variants have been proposed over the last 20 years, yet there is…

Robotics · Computer Science 2015-08-11 Mark Moll , Ioan A. Sucan , Lydia E. Kavraki

Best practices for comparing optimization algorithms

Comparing, or benchmarking, of optimization algorithms is a complicated task that involves many subtle considerations to yield a fair and unbiased evaluation. In this paper, we systematically review the benchmarking process of optimization…

Optimization and Control · Mathematics 2017-09-26 Vahid Beiranvand , Warren Hare , Yves Lucet

Deprecating Benchmarks: Criteria and Framework

As frontier artificial intelligence (AI) models rapidly advance, benchmarks are integral to comparing different models and measuring their progress in different task-specific domains. However, there is a lack of guidance on when and how…

Computers and Society · Computer Science 2025-07-10 Ayrton San Joaquin , Rokas Gipiškis , Leon Staufer , Ariel Gil

Benchmarking Physical Performance of Neural Inference Circuits

Numerous neural network circuits and architectures are presently under active research for application to artificial intelligence and machine learning. Their physical performance metrics (area, time, energy) are estimated. Various types of…

Emerging Technologies · Computer Science 2019-07-15 Dmitri E. Nikonov , Ian A. Young

Truncated Marginal Neural Ratio Estimation

Parametric stochastic simulators are ubiquitous in science, often featuring high-dimensional input parameters and/or an intractable likelihood. Performing Bayesian parameter inference in this context can be challenging. We present a neural…

Machine Learning · Statistics 2021-10-27 Benjamin Kurt Miller , Alex Cole , Patrick Forré , Gilles Louppe , Christoph Weniger

Simulation benchmarks for low-pressure plasmas: capacitive discharges

Benchmarking is generally accepted as an important element in demonstrating the correctness of computer simulations. In the modern sense, a benchmark is a computer simulation result that has evidence of correctness, is accompanied by…

Plasma Physics · Physics 2015-06-12 M. M. Turner , A. Derzsi , Z. Donko , D. Eremin , S. J. Kelly , T. Lafleur , T. Mussenbrock

Optimization-based Calibration of Simulation Input Models

Studies on simulation input uncertainty often built on the availability of input data. In this paper, we investigate an inverse problem where, given only the availability of output data, we nonparametrically calibrate the input models and…

Optimization and Control · Mathematics 2018-01-09 Aleksandrina Goeva , Henry Lam , Huajie Qian , Bo Zhang

Vote'n'Rank: Revision of Benchmarking with Social Choice Theory

The development of state-of-the-art systems in different applied areas of machine learning (ML) is driven by benchmarks, which have shaped the paradigm of evaluating generalisation capabilities from multiple perspectives. Although the…

Machine Learning · Computer Science 2023-10-04 Mark Rofin , Vladislav Mikhailov , Mikhail Florinskiy , Andrey Kravchenko , Elena Tutubalina , Tatiana Shavrina , Daniel Karabekyan , Ekaterina Artemova