Related papers: MLXP: A Framework for Conducting Replicable Experi…

Reproducible Experiment Platform

Data analysis in fundamental sciences nowadays is an essential process that pushes frontiers of our knowledge and leads to new discoveries. At the same time we can see that complexity of those analyses increases fast due to a)~enormous…

Data Analysis, Statistics and Probability · Physics 2016-01-20 Tatiana Likhomanenko , Alex Rogozhnikov , Alexander Baranov , Egor Khairullin , Andrey Ustyuzhanin

Helix 1.0: An Open-Source Framework for Reproducible and Interpretable Machine Learning on Tabular Scientific Data

Helix is an open-source, extensible, Python-based software framework to facilitate reproducible and interpretable machine learning workflows for tabular data. It addresses the growing need for transparent experimental data analytics…

Machine Learning · Computer Science 2025-07-25 Eduardo Aguilar-Bejarano , Daniel Lea , Karthikeyan Sivakumar , Jimiama M. Mase , Reza Omidvar , Ruizhe Li , Troy Kettle , James Mitchell-White , Morgan R Alexander , David A Winkler , Grazziela Figueredo

Let's Talk About It: Making Scientific Computational Reproducibility Easy

Computational reproducibility of scientific results, that is, the execution of a computational experiment (e.g., a script) using its original settings (data, code, etc.), should always be possible. However, reproducibility has become a…

Human-Computer Interaction · Computer Science 2025-04-15 Lázaro Costa , Susana Barbosa , Jácome Cunha

Reproducibility in Machine Learning-Driven Research

Research is facing a reproducibility crisis, in which the results and findings of many studies are difficult or even impossible to reproduce. This is also the case in machine learning (ML) and artificial intelligence (AI) research. Often,…

Machine Learning · Computer Science 2023-07-21 Harald Semmelrock , Simone Kopeinik , Dieter Theiler , Tony Ross-Hellauer , Dominik Kowald

Reproducibility in Machine Learning-based Research: Overview, Barriers and Drivers

Many research fields are currently reckoning with issues of poor levels of reproducibility. Some label it a "crisis", and research employing or building Machine Learning (ML) models is no exception. Issues including lack of transparency,…

Software Engineering · Computer Science 2025-02-27 Harald Semmelrock , Tony Ross-Hellauer , Simone Kopeinik , Dieter Theiler , Armin Haberl , Stefan Thalmann , Dominik Kowald

The Reproducible Research Platform establishes a unified open science environment bridging data and software lifecycles across disciplines, from proposal to publication

Many research groups aspire to make data and code FAIR and reproducible, yet struggle because the data and code life cycles are disconnected, executable environments are often missing from published work, and technical skill requirements…

Digital Libraries · Computer Science 2025-12-09 Andreas P. Cuny , Henry Lütcke , Andrei-Valentin Plamadă , Antti Luomi , John Hennig , Matthew Baker , Fabian Rudolf , Bernd Rinn

A Backend Platform for Supporting the Reproducibility of Computational Experiments

In recent years, the research community has raised serious questions about the reproducibility of scientific work. In particular, since many studies include some kind of computing work, reproducibility is also a technological challenge, not…

Software Engineering · Computer Science 2023-08-03 Lázaro Costa , Susana Barbosa , Jácome Cunha

Towards Continuous Experiment-driven MLOps

Despite advancements in MLOps and AutoML, ML development still remains challenging for data scientists. First, there is poor support for and limited control over optimizing and evolving ML models. Second, there is lack of efficient…

Software Engineering · Computer Science 2025-03-06 Keerthiga Rajenthiram , Milad Abdullah , Ilias Gerostathopoulos , Petr Hnetynka , Tomáš Bureš , Gerard Pons , Besim Bilalli , Anna Queralt

MLDev: Data Science Experiment Automation and Reproducibility Software

In this paper we explore the challenges of automating experiments in data science. We propose an extensible experiment model as a foundation for integration of different open source tools for running research experiments. We implement our…

Machine Learning · Computer Science 2022-09-21 Anton Khritankov , Nikita Pershin , Nikita Ukhov , Artem Ukhov

More Rigorous Software Engineering Would Improve Reproducibility in Machine Learning Research

While experimental reproduction remains a pillar of the scientific method, we observe that the software best practices supporting the reproduction of machine learning ( ML ) research are often undervalued or overlooked, leading both to poor…

Software Engineering · Computer Science 2025-09-03 Moritz Wolter , Lokesh Veeramacheneni , Charles Tapley Hoyt

Memento: Facilitating Effortless, Efficient, and Reliable ML Experiments

Running complex sets of machine learning experiments is challenging and time-consuming due to the lack of a unified framework. This leaves researchers forced to spend time implementing necessary features such as parallelization, caching,…

Machine Learning · Computer Science 2023-11-22 Zac Pullar-Strecker , Xinglong Chang , Liam Brydon , Ioannis Ziogas , Katharina Dost , Jörg Wicker

Machine Learning Pipelines: Provenance, Reproducibility and FAIR Data Principles

Machine learning (ML) is an increasingly important scientific tool supporting decision making and knowledge generation in numerous fields. With this, it also becomes more and more important that the results of ML experiments are…

Machine Learning · Computer Science 2020-06-23 Sheeba Samuel , Frank Löffler , Birgitta König-Ries

Improving Reproducibility in Machine Learning Research (A Report from the NeurIPS 2019 Reproducibility Program)

One of the challenges in machine learning research is to ensure that presented and published results are sound and reliable. Reproducibility, that is obtaining similar results as presented in a paper or talk, using the same code and data…

Machine Learning · Computer Science 2021-01-01 Joelle Pineau , Philippe Vincent-Lamarre , Koustuv Sinha , Vincent Larivière , Alina Beygelzimer , Florence d'Alché-Buc , Emily Fox , Hugo Larochelle

dalex: Responsible Machine Learning with Interactive Explainability and Fairness in Python

The increasing amount of available data, computing power, and the constant pursuit for higher performance results in the growing complexity of predictive models. Their black-box nature leads to opaqueness debt phenomenon inflicting…

Machine Learning · Computer Science 2021-10-12 Hubert Baniecki , Wojciech Kretowicz , Piotr Piatyszek , Jakub Wisniewski , Przemyslaw Biecek

Reproducibility of Machine Learning-Based Fault Detection and Diagnosis for HVAC Systems in Buildings: An Empirical Study

Reproducibility is a cornerstone of scientific research, enabling independent verification and validation of empirical findings. The topic gained prominence in fields such as psychology and medicine, where concerns about non - replicable…

Machine Learning · Computer Science 2025-08-05 Adil Mukhtar , Michael Hadwiger , Franz Wotawa , Gerald Schweiger

Laying foundations to quantify the "Effort of Reproducibility"

Why are some research studies easy to reproduce while others are difficult? Casting doubt on the accuracy of scientific work is not fruitful, especially when an individual researcher cannot reproduce the claims made in the paper. There…

Digital Libraries · Computer Science 2023-08-25 Akhil Pandey Akella , David Koop , Hamed Alhoori

Reproducibility in Machine Learning for Health

Machine learning algorithms designed to characterize, monitor, and intervene on human health (ML4H) are expected to perform safely and reliably when operating at scale, potentially outside strict human supervision. This requirement warrants…

Machine Learning · Computer Science 2019-07-03 Matthew B. A. McDermott , Shirly Wang , Nikki Marinsek , Rajesh Ranganath , Marzyeh Ghassemi , Luca Foschini

A Survey on Reproducibility by Evaluating Deep Reinforcement Learning Algorithms on Real-World Robots

As reinforcement learning (RL) achieves more success in solving complex tasks, more care is needed to ensure that RL research is reproducible and that algorithms herein can be compared easily and fairly with minimal bias. RL results are,…

Machine Learning · Computer Science 2019-09-12 Nicolai A. Lynnerup , Laura Nolling , Rasmus Hasle , John Hallam

GPLMT: A Lightweight Experimentation and Testbed Management Framework

Conducting experiments in federated, distributed, and heterogeneous testbeds is a challenging task for researchers. Researchers have to take care of the whole experiment life cycle, ensure the reproducibility of each run, and the…

Networking and Internet Architecture · Computer Science 2016-01-18 Matthias Wachs , Nadine Herold , Stephan-A. Posselt , Florian Dold , Georg Carle

TARexp: A Python Framework for Technology-Assisted Review Experiments

Technology-assisted review (TAR) is an important industrial application of information retrieval (IR) and machine learning (ML). While a small TAR research community exists, the complexity of TAR software and workflows is a major barrier to…

Information Retrieval · Computer Science 2022-04-26 Eugene Yang , David D. Lewis