Related papers: Benchmarking Simulation-Based Inference

A PAC Approach to Application-Specific Algorithm Selection

The best algorithm for a computational problem generally depends on the "relevant inputs," a concept that depends on the application domain and often defies formal articulation. While there is a large literature on empirical approaches to…

Machine Learning · Computer Science 2016-09-06 Rishi Gupta , Tim Roughgarden

Benchmarking Feature-based Algorithm Selection Systems for Black-box Numerical Optimization

Feature-based algorithm selection aims to automatically find the best one from a portfolio of optimization algorithms on an unseen problem based on its landscape features. Feature-based algorithm selection has recently received attention in…

Neural and Evolutionary Computing · Computer Science 2022-04-27 Ryoji Tanabe

Measuring the accuracy of likelihood-free inference

Complex scientific models where the likelihood cannot be evaluated present a challenge for statistical inference. Over the past two decades, a wide range of algorithms have been proposed for learning parameters in computationally feasible…

Computation · Statistics 2021-12-16 Aden Forrow , Ruth E. Baker

Guidelines for benchmarking of optimization approaches for fitting mathematical models

Insufficient performance of optimization approaches for fitting of mathematical models is still a major bottleneck in systems biology. In this manuscript, the reasons and methodological challenges are summarized as well as their impact in…

Performance · Computer Science 2019-07-09 Clemens Kreutz

LLM-Inference-Bench: Inference Benchmarking of Large Language Models on AI Accelerators

Large Language Models (LLMs) have propelled groundbreaking advancements across several domains and are commonly used for text generation applications. However, the computational demands of these complex models pose significant challenges,…

Machine Learning · Computer Science 2024-11-04 Krishna Teja Chitty-Venkata , Siddhisanket Raskar , Bharat Kale , Farah Ferdaus , Aditya Tanikanti , Ken Raffenetti , Valerie Taylor , Murali Emani , Venkatram Vishwanath

Aligning Language Model Benchmarks with Pairwise Preferences

Language model benchmarks are pervasive and computationally-efficient proxies for real-world performance. However, many recent works find that benchmarks often fail to predict real utility. Towards bridging this gap, we introduce benchmark…

Artificial Intelligence · Computer Science 2026-05-28 Marco Gutierrez , Xinyi Leng , Hannah Cyberey , Jonathan Richard Schwarz , Ahmed Alaa , Thomas Hartvigsen

Defining Standard Strategies for Quantum Benchmarks

As quantum computers grow in size and scope, a question of great importance is how best to benchmark performance. Here we define a set of characteristics that any benchmark should follow -- randomized, well-defined, holistic, device…

Quantum Physics · Physics 2023-03-06 Mirko Amico , Helena Zhang , Petar Jurcevic , Lev S. Bishop , Paul Nation , Andrew Wack , David C. McKay

Benchmarking Simulated Robotic Manipulation through a Real World Dataset

We present a benchmark to facilitate simulated manipulation; an attempt to overcome the obstacles of physical benchmarks through the distribution of a real world, ground truth dataset. Users are given various simulated manipulation tasks…

Robotics · Computer Science 2019-11-28 Jack Collins , Jessie McVicar , David Wedlock , Ross Brown , David Howard , Jürgen Leitner

Network Tomography Based on Additive Metrics

Inference of the network structure (e.g., routing topology) and dynamics (e.g., link performance) is an essential component in many network design and management tasks. In this paper we propose a new, general framework for analyzing and…

Networking and Internet Architecture · Computer Science 2019-11-13 Jian Ni , Sekhar Tatikonda

LemmaBench: A Live, Research-Level Benchmark to Evaluate LLM Capabilities in Mathematics

We present a new approach for benchmarking Large Language Model (LLM) capabilities on research-level mathematics. Existing benchmarks largely rely on static, hand-curated sets of contest or textbook-style problems as proxies for…

Artificial Intelligence · Computer Science 2026-03-02 Antoine Peyronnet , Fabian Gloeckle , Amaury Hayat

Performance comparison of State-of-the-art Missing Value Imputation Algorithms on Some Bench mark Datasets

Decision making from data involves identifying a set of attributes that contribute to effective decision making through computational intelligence. The presence of missing values greatly influences the selection of right set of attributes…

Machine Learning · Computer Science 2013-07-23 M. Naresh Kumar

Essential guidelines for computational method benchmarking

In computational biology and other sciences, researchers are frequently faced with a choice between several computational methods for performing data analyses. Benchmarking studies aim to rigorously compare the performance of different…

Quantitative Methods · Quantitative Biology 2019-06-04 Lukas M. Weber , Wouter Saelens , Robrecht Cannoodt , Charlotte Soneson , Alexander Hapfelmeier , Paul P. Gardner , Anne-Laure Boulesteix , Yvan Saeys , Mark D. Robinson

A Computationally Efficient Approach to Fully Bayesian Benchmarking

In small area estimation, it is sometimes necessary to use model-based methods to produce estimates in areas with little or no data. In official statistics, we often require that some aggregate of small area estimates agree with a national…

Methodology · Statistics 2023-01-31 Taylor Okonek , Jon Wakefield

Baseline Results for Selected Nonlinear System Identification Benchmarks

Nonlinear system identification remains an important open challenge across research and academia. Large numbers of novel approaches are seen published each year, each presenting improvements or extensions to existing methods. It is natural,…

Systems and Control · Electrical Eng. & Systems 2024-08-28 Max D. Champneys , Gerben I. Beintema , Roland Tóth , Maarten Schoukens , Timothy J. Rogers

Moving Beyond Downstream Task Accuracy for Information Retrieval Benchmarking

Neural information retrieval (IR) systems have progressed rapidly in recent years, in large part due to the release of publicly available benchmarking tasks. Unfortunately, some dimensions of this progress are illusory: the majority of the…

Information Retrieval · Computer Science 2022-12-05 Keshav Santhanam , Jon Saad-Falcon , Martin Franz , Omar Khattab , Avirup Sil , Radu Florian , Md Arafat Sultan , Salim Roukos , Matei Zaharia , Christopher Potts

Scalable Full-Stack Benchmarks for Quantum Computers

Quantum processors are now able to run quantum circuits that are infeasible to simulate classically, creating a need for benchmarks that assess a quantum processor's rate of errors when running these circuits. Here, we introduce a general…

Quantum Physics · Physics 2025-10-29 Jordan Hines , Timothy Proctor

Do we Need Dozens of Methods for Real World Missing Value Imputation?

Missing values pose a persistent challenge in modern data science. Consequently, there is an ever-growing number of publications introducing new imputation methods in various fields. While many studies compare imputation approaches, they…

Computation · Statistics 2025-11-10 Krystyna Grzesiak , Christophe Muller , Julie Josse , Jeffrey Näf

A Benchmarking Framework for Model Datasets

Empirical and LLM-based research in model-driven engineering increasingly relies on datasets of software models, for instance, to train or evaluate machine learning techniques for modeling support. These datasets have a significant impact…

Software Engineering · Computer Science 2026-03-06 Philipp-Lorenz Glaser , Lola Burgueño , Dominik Bork

Ten New Benchmarks for Optimization

Benchmarks are used for testing new optimization algorithms and their variants to evaluate their performance. Most existing benchmarks are smooth functions. This chapter introduces ten new benchmarks with different properties, including…

Optimization and Control · Mathematics 2023-09-06 Xin-She Yang

Efficient Multifidelity Likelihood-Free Bayesian Inference with Adaptive Computational Resource Allocation

Likelihood-free Bayesian inference algorithms are popular methods for calibrating the parameters of complex, stochastic models, required when the likelihood of the observed data is intractable. These algorithms characteristically rely…

Computation · Statistics 2021-12-23 Thomas P Prescott , David J Warne , Ruth E Baker