Related papers: Towards a Statistical Methodology to Evaluate Prog…

A Statistical Analysis for Per-Instance Evaluation of Stochastic Optimizers: Avoiding Unreliable Conclusions

A key trait of stochastic optimizers is that multiple runs of the same optimizer in attempting to solve the same problem can produce different results. As a result, their performance is evaluated over several repeats, or runs, on the…

Machine Learning · Computer Science 2026-05-18 Moslem Noori , Elisabetta Valiante , Thomas Van Vaerenbergh , Masoud Mohseni , Ignacio Rozada

Analysis of Systems' Performance in Natural Language Processing Competitions

Collaborative competitions have gained popularity in the scientific and technological fields. These competitions involve defining tasks, selecting evaluation scores, and devising result verification methods. In the standard scenario,…

Machine Learning · Computer Science 2024-08-22 Sergio Nava-Muñoz , Mario Graff , Hugo Jair Escalante

"Can I Implement Your Algorithm?": A Model for Reproducible Research Software

The reproduction and replication of novel results has become a major issue for a number of scientific disciplines. In computer science and related computational disciplines such as systems biology, the issues closely revolve around the…

Software Engineering · Computer Science 2014-09-17 Tom Crick , Benjamin A. Hall , Samin Ishtiaq

Program Analysis of Probabilistic Programs

Probabilistic programming is a growing area that strives to make statistical analysis more accessible, by separating probabilistic modelling from probabilistic inference. In practice this decoupling is difficult. No single inference…

Programming Languages · Computer Science 2022-04-15 Maria I. Gorinova

Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)

One of the challenges in machine learning research is to ensure that presented and published results are sound and reliable. Reproducibility, that is obtaining similar results as presented in a paper or talk, using the same code and data…

Machine Learning · Computer Science 2021-01-01 Joelle Pineau , Philippe Vincent-Lamarre , Koustuv Sinha , Vincent Larivière , Alina Beygelzimer , Florence d'Alché-Buc , Emily Fox , Hugo Larochelle

Best Practices for Replicability, Reproducibility and Reusability of Computer-Based Experiments Exemplified by Model Reduction Software

Over the recent years the importance of numerical experiments has gradually been more recognized. Nonetheless, sufficient documentation of how computational results have been obtained is often not available. Especially in the scientific…

Mathematical Software · Computer Science 2021-05-10 Jörg Fehr , Jan Heiland , Christian Himpe , Jens Saak

Quantifying Performance Changes with Effect Size Confidence Intervals

Measuring performance & quantifying a performance change are core evaluation techniques in programming language and systems research. Of 122 recent scientific papers, as many as 65 included experimental evaluation that quantified a…

Methodology · Statistics 2020-07-22 Tomas Kalibera , Richard Jones

Reproducibility and Replication of Experimental Particle Physics Results

Recently, much attention has been focused on the replicability of scientific results, causing scientists, statisticians, and journal editors to examine closely their methodologies and publishing criteria. Experimental particle physicists…

Data Analysis, Statistics and Probability · Physics 2021-05-07 Thomas R. Junk , Louis Lyons

Beyond Application End-Point Results: Quantifying Statistical Robustness of MCMC Accelerators

Statistical machine learning often uses probabilistic algorithms, such as Markov Chain Monte Carlo (MCMC), to solve a wide range of problems. Probabilistic computations, often considered too slow on conventional processors, can be…

Signal Processing · Electrical Eng. & Systems 2020-03-26 Xiangyu Zhang , Ramin Bashizade , Yicheng Wang , Cheng Lyu , Sayan Mukherjee , Alvin R. Lebeck

Dear CAV, We Need to Talk About Reproducibility

How many times have you tried to re-implement a past CAV tool paper, and failed? Reliably reproducing published scientific discoveries has been acknowledged as a barrier to scientific progress for some time but there remains only a small…

Logic in Computer Science · Computer Science 2015-02-10 Tom Crick , Benjamin A. Hall , Samin Ishtiaq

The state of play of reproducibility in Statistics: an empirical analysis

Reproducibility, the ability to reproduce the results of published papers or studies using their computer code and data, is a cornerstone of reliable scientific methodology. Studies where results cannot be reproduced by the scientific…

Applications · Statistics 2022-10-03 Xin Xiong , Ivor Cribben

The Practice of Ensuring Repeatable and Reproducible Computational Models

Recent studies have shown that the majority of published computational models in systems biology and physiology are not repeatable or reproducible. There are a variety of reasons for this. One of the most likely reasons is that given how…

Other Quantitative Biology · Quantitative Biology 2021-07-13 Herbert M. Sauro

Reproducibility as a Technical Specification

Reproducibility of computationally-derived scientific discoveries should be a certainty. As the product of several person-years' worth of effort, results -- whether disseminated through academic journals, conferences or exploited through…

Computational Engineering, Finance, and Science · Computer Science 2015-06-17 Tom Crick , Benjamin A. Hall , Samin Ishtiaq

From Guidelines to Practice: Evaluating the Reproducibility of Methods in Computational Social Science

Reproducibility remains a central challenge in computational social science, where complex workflows, evolving software ecosystems, and inconsistent documentation hinder researchers ability to re-execute published methods. This study…

Human-Computer Interaction · Computer Science 2026-03-04 Fakhri Momeni , Sarah Sajid , Johannes Kiesel

Laying foundations to quantify the "Effort of Reproducibility"

Why are some research studies easy to reproduce while others are difficult? Casting doubt on the accuracy of scientific work is not fruitful, especially when an individual researcher cannot reproduce the claims made in the paper. There…

Digital Libraries · Computer Science 2023-08-25 Akhil Pandey Akella , David Koop , Hamed Alhoori

Reproducibility in Research: Systems, Infrastructure, Culture

The reproduction and replication of research results has become a major issue for a number of scientific disciplines. In computer science and related computational disciplines such as systems biology, the challenges closely revolve around…

Software Engineering · Computer Science 2017-07-31 Tom Crick , Benjamin A. Hall , Samin Ishtiaq

A Survey on Reproducibility by Evaluating Deep Reinforcement Learning Algorithms on Real-World Robots

As reinforcement learning (RL) achieves more success in solving complex tasks, more care is needed to ensure that RL research is reproducible and that algorithms herein can be compared easily and fairly with minimal bias. RL results are,…

Machine Learning · Computer Science 2019-09-12 Nicolai A. Lynnerup , Laura Nolling , Rasmus Hasle , John Hallam

A Guide to Computational Reproducibility in Signal Processing and Machine Learning

Computational reproducibility is a growing problem that has been extensively studied among computational researchers and within the signal processing and machine learning research community. However, with the changing landscape of signal…

Signal Processing · Electrical Eng. & Systems 2022-02-16 Joseph Shenouda , Waheed U. Bajwa

A Tutorial on the Design, Experimentation and Application of Metaheuristic Algorithms to Real-World Optimization Problems

In the last few years, the formulation of real-world optimization problems and their efficient solution via metaheuristic algorithms has been a catalyst for a myriad of research studies. In spite of decades of historical advancements on the…

Neural and Evolutionary Computing · Computer Science 2024-10-07 Eneko Osaba , Esther Villar-Rodriguez , Javier Del Ser , Antonio J. Nebro , Daniel Molina , Antonio LaTorre , Ponnuthurai N. Suganthan , Carlos A. Coello Coello , Francisco Herrera

Investigating Reproducibility in Deep Learning-Based Software Fault Prediction

Over the past few years, deep learning methods have been applied for a wide range of Software Engineering (SE) tasks, including in particular for the important task of automatically predicting and localizing faults in software. With the…

Software Engineering · Computer Science 2024-02-09 Adil Mukhtar , Dietmar Jannach , Franz Wotawa