Related papers: Simulink Mutation Testing using CodeBERT

$\mu$BERT: Mutation Testing using Pre-Trained Language Models

We introduce $\mu$BERT, a mutation testing tool that uses a pre-trained language model (CodeBERT) to generate mutants. This is done by masking a token from the expression given as input and using CodeBERT to predict it. Thus, the mutants…

Software Engineering · Computer Science 2022-03-08 Renzo Degiovanni , Mike Papadakis

Timed Model-Based Mutation Operators for Simulink Models

Model-based mutation analysis is a recent research area, and real-time system testing can benefit from using model mutants. Model-based mutation testing (MBMT) is a particular branch of model-based testing. It generates faulty versions of a…

Software Engineering · Computer Science 2023-01-04 Jian Chen , Manar H. Alalfi , Thomas R. Dean

Exploring the Potential of Large Language Models in Simulink-Stateflow Mutant Generation

Mutation analysis is a powerful technique for assessing test-suite adequacy, yet conventional approaches suffer from generating redundant, equivalent, or non-executable mutants. These challenges are particularly amplified in…

Software Engineering · Computer Science 2026-02-16 Pablo Valle , Shaukat Ali , Aitor Arrieta

Efficient Mutation Testing via Pre-Trained Language Models

Mutation testing is an established fault-based testing technique. It operates by seeding faults into the programs under test and asking developers to write tests that reveal these faults. These tests have the potential to reveal a large…

Software Engineering · Computer Science 2023-01-10 Ahmed Khanfir , Renzo Degiovanni , Mike Papadakis , Yves Le Traon

PITMuS: A Tool for Automated Bug Dataset Generation via Source-Level Mutant Reconstruction

LLM-based software engineering increasingly depends on executable, context-rich bug artifacts: paired correct and buggy code, methods under test (MUTs), documentation, and metadata. These artifacts support the training and evaluation of…

Software Engineering · Computer Science 2026-05-22 Tasfia Tasnim , Soneya Binta Hossain

SimuGen: Multi-modal Agentic Framework for Constructing Block Diagram-Based Simulation Models

Recent advances in large language models (LLMs) have shown impressive performance in mathematical reasoning and code generation. However, LLMs still struggle in the simulation domain, particularly in generating Simulink models, which are…

Machine Learning · Computer Science 2025-09-01 Xinxing Ren , Qianbo Zang , Zekun Guo

Vulnerability Mimicking Mutants

With the increasing release of powerful language models trained on large code corpus (e.g. CodeBERT was trained on 6.4 million programs), a new family of mutation testing tools has arisen with the promise to generate more "natural" mutants…

Software Engineering · Computer Science 2023-03-09 Aayush Garg , Renzo Degiovanni , Mike Papadakis , Yves Le Traon

Behavioral Compatibility of Simulink Models for Product Line Maintenance and Evolution

Embedded software systems, e.g. automotive, robotic or automation systems are highly configurable and consist of many software components being available in different variants and versions. To identify the degree of reusability between…

Software Engineering · Computer Science 2015-11-18 Bernhard Rumpe , Christoph Schulze , Michael von Wenckstern , Jan Oliver Ringert , Peter Manhart

SLGPT: Using Transfer Learning to Directly Generate Simulink Model Files and Find Bugs in the Simulink Toolchain

Finding bugs in a commercial cyber-physical system (CPS) development tool such as Simulink is hard as its codebase contains millions of lines of code and complete formal language specifications are not available. While deep learning…

Software Engineering · Computer Science 2022-03-11 Sohil Lal Shrestha , Christoph Csallner

Contextual Predictive Mutation Testing

Mutation testing is a powerful technique for assessing and improving test suite quality that artificially introduces bugs and checks whether the test suites catch them. However, it is also computationally expensive and thus does not scale…

Software Engineering · Computer Science 2023-09-06 Kush Jain , Uri Alon , Alex Groce , Claire Le Goues

SMT-Based Model Checking of Industrial Simulink Models

The development of embedded systems requires formal analysis of models such as those described with MATLAB/Simulink. However, the increasing complexity of industrial models makes analysis difficult. This paper proposes a model checking…

Logic in Computer Science · Computer Science 2022-06-08 Daisuke Ishii , Takashi Tomita , Toshiaki Aoki , The Quyen Ngo , Thi Bich Ngoc Do , Hideaki Takai

Mutation Testing via Iterative Large Language Model-Driven Scientific Debugging

Large Language Models (LLMs) can generate plausible test code. Intuitively they generate this by imitating tests seen in their training data, rather than reasoning about execution semantics. However, such reasoning is important when…

Software Engineering · Computer Science 2025-03-12 Philipp Straubinger , Marvin Kreis , Stephan Lukasczyk , Gordon Fraser

QMutBench: A Dataset of Quantum Circuit Mutants

Quantum software testing has attracted interest in recent years, prompting the development of various techniques to automate the testing of quantum software. These techniques generate test cases that must be assessed for their effectiveness…

Software Engineering · Computer Science 2026-04-20 Eñaut Mendiluze Usandizaga , Thomas Laurent , Paolo Arcaini , Shaukat Ali

Simulation-based Testing of Simulink Models with Test Sequence and Test Assessment Blocks

Simulation-based software testing supports engineers in finding faults in Simulink models. It typically relies on search algorithms that iteratively generate test inputs used to exercise models in simulation to detect design errors. While…

Software Engineering · Computer Science 2022-12-23 Federico Formica , Tony Fan , Akshay Rajhans , Vera Pantelic , Mark Lawford , Claudio Menghi

Evaluating Model Testing and Model Checking for Finding Requirements Violations in Simulink Models

Matlab/Simulink is a development and simulation language that is widely used by the Cyber-Physical System (CPS) industry to model dynamical systems. There are two mainstream approaches to verify CPS Simulink models: model testing that…

Software Engineering · Computer Science 2019-05-10 Shiva Nejati , Khouloud Gaaloul , Claudio Menghi , Lionel C. Briand , Stephen Foster , David Wolfe

LLMorpheus: Mutation Testing using Large Language Models

In mutation testing, the quality of a test suite is evaluated by introducing faults into a program and determining whether the program's tests detect them. Most existing approaches for mutation testing involve the application of a fixed set…

Software Engineering · Computer Science 2025-03-10 Frank Tip , Jonathan Bell , Max Schaefer

CodeBERT: A Pre-Trained Model for Programming and Natural Languages

We present CodeBERT, a bimodal pre-trained model for programming language (PL) and nat-ural language (NL). CodeBERT learns general-purpose representations that support downstream NL-PL applications such as natural language codesearch, code…

Computation and Language · Computer Science 2020-09-21 Zhangyin Feng , Daya Guo , Duyu Tang , Nan Duan , Xiaocheng Feng , Ming Gong , Linjun Shou , Bing Qin , Ting Liu , Daxin Jiang , Ming Zhou

Requirements-driven Slicing of Simulink Models Using LLMs

Model slicing is a useful technique for identifying a subset of a larger model that is relevant to fulfilling a given requirement. Notable applications of slicing include reducing inspection effort when checking design adequacy to meet…

Software Engineering · Computer Science 2024-05-06 Dipeeka Luitel , Shiva Nejati , Mehrdad Sabetzadeh

MetricBERT: Text Representation Learning via Self-Supervised Triplet Training

We present MetricBERT, a BERT-based model that learns to embed text under a well-defined similarity metric while simultaneously adhering to the ``traditional'' masked-language task. We focus on downstream tasks of learning similarities for…

Computation and Language · Computer Science 2022-08-16 Itzik Malkiel , Dvir Ginzburg , Oren Barkan , Avi Caciularu , Yoni Weill , Noam Koenigstein

Supporting Modularity in Simulink Models

Model-Based Development (MBD) is widely used for embedded controls development, with Matlab Simulink being one of the most used modelling environments in industry. As with all software, Simulink models are subject to evolution over their…

Software Engineering · Computer Science 2020-07-21 Monika Jaskolka , Vera Pantelic , Alan Wassyng , Mark Lawford