Related papers: Predictive Test Selection

Practical Flaky Test Prediction using Common Code Evolution and Test History Data

Non-deterministically behaving test cases cause developers to lose trust in their regression test suites and to eventually ignore failures. Detecting flaky tests is therefore a crucial task in maintaining code quality, as it builds the…

Software Engineering · Computer Science 2023-03-21 Martin Gruber , Michael Heine , Norbert Oster , Michael Philippsen , Gordon Fraser

Targeted Test Selection Approach in Continuous Integration

In modern software development change-based testing plays a crucial role. However, as codebases expand and test suites grow, efficiently managing the testing process becomes increasingly challenging, especially given the high frequency of…

Software Engineering · Computer Science 2025-09-15 Pavel Plyusnin , Aleksey Antonov , Vasilii Ermakov , Aleksandr Khaybriev , Margarita Kikot , Ilseyar Alimova , Stanislav Moiseev

Bayesian Predictive Probabilities for Online Experimentation

The widespread adoption of online randomized controlled experiments (A/B Tests) for decision-making has created ongoing capacity constraints which necessitate interim analyses. As a consequence, platform users are increasingly motivated to…

Applications · Statistics 2025-11-11 Abbas Zaidi , Rina Friedberg , Samir Khan , Yao-Yang Leow , Maulik Soneji , Houssam Nassif , Richard Mudd

On the Improvement of Predictive Modeling Using Bayesian Stacking and Posterior Predictive Checking

Model uncertainty is pervasive in real world analysis situations and is an often-neglected issue in applied statistics. However, standard approaches to the research process do not address the inherent uncertainty in model building and,…

Methodology · Statistics 2024-03-01 Mariana Nold , Florian Meinfelder , David Kaplan

FlakiMe: Laboratory-Controlled Test Flakiness Impact Assessment. A Case Study on Mutation Testing and Program Repair

Much research on software testing makes an implicit assumption that test failures are deterministic such that they always witness the presence of the same defects. However, this assumption is not always true because some test failures are…

Software Engineering · Computer Science 2019-12-09 Maxime Cordy , Renaud Rwemalika , Mike Papadakis , Mark Harman

Quality-preserving Model for Electronics Production Quality Tests Reduction

Manufacturing test flows in high-volume electronics production are typically fixed during product development and executed unchanged on every unit, even as failure patterns and process conditions evolve. This protects quality, but it also…

Machine Learning · Computer Science 2026-04-09 Noufa Haneefa , Teddy Lazebnik , Einav Peretz-Andersson

An Online Sequential Test for Qualitative Treatment Effects

Tech companies (e.g., Google or Facebook) often use randomized online experiments and/or A/B testing primarily based on the average treatment effects to compare their new product with an old one. However, it is also critically important to…

Methodology · Statistics 2021-11-09 Chengchun Shi , Shikai Luo , Hongtu Zhu , Rui Song

On the Use of Mutation in Injecting Test Order-Dependency

Background: Test flakiness is identified as a major issue that compromises the regression testing process of complex software systems. Flaky tests manifest non-deterministic behaviour, send confusing signals to developers, and break their…

Software Engineering · Computer Science 2021-04-16 Sarra Habchi , Maxime Cordy , Mike Papadakis , Yves Le Traon

What is the Vocabulary of Flaky Tests? An Extended Replication

Software systems have been continuously evolved and delivered with high quality due to the widespread adoption of automated tests. A recurring issue hurting this scenario is the presence of flaky tests, a test case that may pass or fail…

Software Engineering · Computer Science 2021-03-24 B. H. P. Camara , M. A. G. Silva , A. T. Endo , S. R. Vergilio

Feature-oriented Test Case Selection and Prioritization During the Evolution of Highly-Configurable Systems

Testing Highly Configurable Systems (HCSs) is a challenging task, especially in an evolution scenario where features are added, changed, or removed, which hampers test case selection and prioritization. Existing work is usually based on the…

Software Engineering · Computer Science 2024-06-24 Willian D. F. Mendonça , Wesley K. G. Assunção , Silvia R. Vergilio

Selective Prediction via Training Dynamics

Selective Prediction is the task of rejecting inputs a model would predict incorrectly on. This involves a trade-off between input space coverage (how many data points are accepted) and model utility (how good is the performance on accepted…

Machine Learning · Computer Science 2025-07-08 Stephan Rabanser , Anvith Thudi , Kimia Hamidieh , Adam Dziedzic , Israfil Bahceci , Akram Bin Sediq , Hamza Sokun , Nicolas Papernot

The Importance of Discerning Flaky from Fault-triggering Test Failures: A Case Study on the Chromium CI

Flaky tests are tests that pass and fail on different executions of the same version of a program under test. They waste valuable developer time by making developers investigate false alerts (flaky test failures). To deal with this problem,…

Software Engineering · Computer Science 2023-02-22 Guillaume Haben , Sarra Habchi , Mike Papadakis , Maxime Cordy , Yves Le Traon

Online experiment design for continuous-time systems using generalized filtering

The goal of experiment design is to select the inputs of a dynamical system in such a way that the resulting data contain sufficient information for system identification and data-driven control. This paper investigates the problem of…

Optimization and Control · Mathematics 2026-04-10 Jiwei Wang , Simone Baldi , Henk J. van Waarde

Practical Mutation Testing at Scale

Mutation analysis assesses a test suite's adequacy by measuring its ability to detect small artificial faults, systematically seeded into the tested program. Mutation analysis is considered one of the strongest test-adequacy criteria.…

Software Engineering · Computer Science 2021-03-01 Goran Petrović , Marko Ivanković , Gordon Fraser , René Just

Statistical Test for Feature Selection Pipelines by Selective Inference

A data analysis pipeline is a structured sequence of steps that transforms raw data into meaningful insights by integrating various analysis algorithms. In this paper, we propose a novel statistical test to assess the significance of data…

Machine Learning · Statistics 2024-10-15 Tomohiro Shiraishi , Tatsuya Matsukawa , Shuichi Nishino , Ichiro Takeuchi

Dynamic feature selection in medical predictive monitoring by reinforcement learning

In this paper, we investigate dynamic feature selection within multivariate time-series scenario, a common occurrence in clinical prediction monitoring where each feature corresponds to a bio-test result. Many existing feature selection…

Machine Learning · Computer Science 2024-05-31 Yutong Chen , Jiandong Gao , Ji Wu

A Survey on How Test Flakiness Affects Developers and What Support They Need To Address It

Non-deterministically passing and failing test cases, so-called flaky tests, have recently become a focus area of software engineering research. While this research focus has been met with some enthusiastic endorsement from industry, prior…

Software Engineering · Computer Science 2022-04-11 Martin Gruber , Gordon Fraser

A Stable, Fast, and Fully Automatic Learning Algorithm for Predictive Coding Networks

Predictive coding networks are neuroscience-inspired models with roots in both Bayesian statistics and neuroscience. Training such models, however, is quite inefficient and unstable. In this work, we show how by simply changing the temporal…

Neural and Evolutionary Computing · Computer Science 2024-02-08 Tommaso Salvatori , Yuhang Song , Yordan Yordanov , Beren Millidge , Zhenghua Xu , Lei Sha , Cornelius Emde , Rafal Bogacz , Thomas Lukasiewicz

A New Flexible Train-Test Split Algorithm, an approach for choosing among the Hold-out, K-fold cross-validation, and Hold-out iteration

Choosing an appropriate strategy for partitioning data into training and evaluation sets is a critical step in machine learning, yet validation methods are often selected using default or conventional settings without considering their…

Machine Learning · Computer Science 2026-01-05 Zahra Bami , Ali Behnampour , Aniruddha Bora , Hassan Doosti

Fine-Grained Assertion-Based Test Selection

For large software applications, running the whole test suite after each code change is time- and resource-intensive. Regression test selection techniques aim at reducing test execution time by selecting only the tests that are affected by…

Software Engineering · Computer Science 2025-04-29 Sijia Gu , Ali Mesbah