English
Related papers

Related papers: MLXP: A Framework for Conducting Replicable Experi…

200 papers

Data analysis in fundamental sciences nowadays is an essential process that pushes frontiers of our knowledge and leads to new discoveries. At the same time we can see that complexity of those analyses increases fast due to a)~enormous…

Data Analysis, Statistics and Probability · Physics 2016-01-20 Tatiana Likhomanenko , Alex Rogozhnikov , Alexander Baranov , Egor Khairullin , Andrey Ustyuzhanin

Helix is an open-source, extensible, Python-based software framework to facilitate reproducible and interpretable machine learning workflows for tabular data. It addresses the growing need for transparent experimental data analytics…

Computational reproducibility of scientific results, that is, the execution of a computational experiment (e.g., a script) using its original settings (data, code, etc.), should always be possible. However, reproducibility has become a…

Human-Computer Interaction · Computer Science 2025-04-15 Lázaro Costa , Susana Barbosa , Jácome Cunha

Research is facing a reproducibility crisis, in which the results and findings of many studies are difficult or even impossible to reproduce. This is also the case in machine learning (ML) and artificial intelligence (AI) research. Often,…

Machine Learning · Computer Science 2023-07-21 Harald Semmelrock , Simone Kopeinik , Dieter Theiler , Tony Ross-Hellauer , Dominik Kowald

Many research fields are currently reckoning with issues of poor levels of reproducibility. Some label it a "crisis", and research employing or building Machine Learning (ML) models is no exception. Issues including lack of transparency,…

Software Engineering · Computer Science 2025-02-27 Harald Semmelrock , Tony Ross-Hellauer , Simone Kopeinik , Dieter Theiler , Armin Haberl , Stefan Thalmann , Dominik Kowald

Many research groups aspire to make data and code FAIR and reproducible, yet struggle because the data and code life cycles are disconnected, executable environments are often missing from published work, and technical skill requirements…

In recent years, the research community has raised serious questions about the reproducibility of scientific work. In particular, since many studies include some kind of computing work, reproducibility is also a technological challenge, not…

Software Engineering · Computer Science 2023-08-03 Lázaro Costa , Susana Barbosa , Jácome Cunha

Despite advancements in MLOps and AutoML, ML development still remains challenging for data scientists. First, there is poor support for and limited control over optimizing and evolving ML models. Second, there is lack of efficient…

In this paper we explore the challenges of automating experiments in data science. We propose an extensible experiment model as a foundation for integration of different open source tools for running research experiments. We implement our…

Machine Learning · Computer Science 2022-09-21 Anton Khritankov , Nikita Pershin , Nikita Ukhov , Artem Ukhov

While experimental reproduction remains a pillar of the scientific method, we observe that the software best practices supporting the reproduction of machine learning ( ML ) research are often undervalued or overlooked, leading both to poor…

Software Engineering · Computer Science 2025-09-03 Moritz Wolter , Lokesh Veeramacheneni , Charles Tapley Hoyt

Running complex sets of machine learning experiments is challenging and time-consuming due to the lack of a unified framework. This leaves researchers forced to spend time implementing necessary features such as parallelization, caching,…

Machine Learning · Computer Science 2023-11-22 Zac Pullar-Strecker , Xinglong Chang , Liam Brydon , Ioannis Ziogas , Katharina Dost , Jörg Wicker

Machine learning (ML) is an increasingly important scientific tool supporting decision making and knowledge generation in numerous fields. With this, it also becomes more and more important that the results of ML experiments are…

Machine Learning · Computer Science 2020-06-23 Sheeba Samuel , Frank Löffler , Birgitta König-Ries

One of the challenges in machine learning research is to ensure that presented and published results are sound and reliable. Reproducibility, that is obtaining similar results as presented in a paper or talk, using the same code and data…

The increasing amount of available data, computing power, and the constant pursuit for higher performance results in the growing complexity of predictive models. Their black-box nature leads to opaqueness debt phenomenon inflicting…

Machine Learning · Computer Science 2021-10-12 Hubert Baniecki , Wojciech Kretowicz , Piotr Piatyszek , Jakub Wisniewski , Przemyslaw Biecek

Reproducibility is a cornerstone of scientific research, enabling independent verification and validation of empirical findings. The topic gained prominence in fields such as psychology and medicine, where concerns about non - replicable…

Machine Learning · Computer Science 2025-08-05 Adil Mukhtar , Michael Hadwiger , Franz Wotawa , Gerald Schweiger

Why are some research studies easy to reproduce while others are difficult? Casting doubt on the accuracy of scientific work is not fruitful, especially when an individual researcher cannot reproduce the claims made in the paper. There…

Digital Libraries · Computer Science 2023-08-25 Akhil Pandey Akella , David Koop , Hamed Alhoori

Machine learning algorithms designed to characterize, monitor, and intervene on human health (ML4H) are expected to perform safely and reliably when operating at scale, potentially outside strict human supervision. This requirement warrants…

Machine Learning · Computer Science 2019-07-03 Matthew B. A. McDermott , Shirly Wang , Nikki Marinsek , Rajesh Ranganath , Marzyeh Ghassemi , Luca Foschini

As reinforcement learning (RL) achieves more success in solving complex tasks, more care is needed to ensure that RL research is reproducible and that algorithms herein can be compared easily and fairly with minimal bias. RL results are,…

Machine Learning · Computer Science 2019-09-12 Nicolai A. Lynnerup , Laura Nolling , Rasmus Hasle , John Hallam

Conducting experiments in federated, distributed, and heterogeneous testbeds is a challenging task for researchers. Researchers have to take care of the whole experiment life cycle, ensure the reproducibility of each run, and the…

Networking and Internet Architecture · Computer Science 2016-01-18 Matthias Wachs , Nadine Herold , Stephan-A. Posselt , Florian Dold , Georg Carle

Technology-assisted review (TAR) is an important industrial application of information retrieval (IR) and machine learning (ML). While a small TAR research community exists, the complexity of TAR software and workflows is a major barrier to…

Information Retrieval · Computer Science 2022-04-26 Eugene Yang , David D. Lewis
‹ Prev 1 2 3 10 Next ›