Related papers: Towards Generating Executable Metamorphic Relation…

Metamorphic Testing of Large Language Models for Natural Language Processing

Using large language models (LLMs) to perform natural language processing (NLP) tasks has become increasingly pervasive in recent times. The versatile nature of LLMs makes them applicable to a wide range of such tasks. While the performance…

Software Engineering · Computer Science 2026-01-12 Steven Cho , Stefano Ruberto , Valerio Terragni

Bidirectional Empowerment of Metamorphic Testing and Large Language Models: A Systematic Survey

Large language models (LLMs) have introduced substantial challenges to software quality assurance due to their generative, probabilistic, and open-ended nature, which intensifies the oracle problem and limits the applicability of…

Software Engineering · Computer Science 2026-05-15 Zheng Zheng , Zenghui Zhou , Yinwang Xu , Daixu Ren , Tsong Yueh Chen

Towards a Complete Metamorphic Testing Pipeline

Metamorphic Testing (MT) addresses the test oracle problem by examining the relationships between input-output pairs in consecutive executions of the System Under Test (SUT). These relations, known as Metamorphic Relations (MRs), specify…

Software Engineering · Computer Science 2023-10-23 Alejandra Duque-Torres , Dietmar Pfahl

LLMORPH: Automated Metamorphic Testing of Large Language Models

Automated testing is essential for evaluating and improving the reliability of Large Language Models (LLMs), yet the lack of automated oracles for verifying output correctness remains a key challenge. We present LLMORPH, an automated…

Software Engineering · Computer Science 2026-03-26 Steven Cho , Stefano Ruberto , Valerio Terragni

Metamorphic Relation Prioritization for Effective Regression Testing

Metamorphic testing (MT) is widely used for testing programs that face the oracle problem. It uses a set of metamorphic relations (MRs), which are relations among multiple inputs and their corresponding outputs to determine whether the…

Software Engineering · Computer Science 2021-09-22 Madhusudan Srinivasan , Upulee Kanewala

METAL: Metamorphic Testing Framework for Analyzing Large-Language Model Qualities

Large-Language Models (LLMs) have shifted the paradigm of natural language data processing. However, their black-boxed and probabilistic characteristics can lead to potential risks in the quality of outputs in diverse LLM applications.…

Software Engineering · Computer Science 2023-12-12 Sangwon Hyun , Mingyu Guo , M. Ali Babar

LLM Assisted Coding with Metamorphic Specification Mutation Agent

Metamorphic Relations (MRs) serve as a foundational mechanism for generating semantically equivalent mutations. Software engineering has advanced significantly in recent years with the advent of Large Language Models (LLMs). However, the…

Software Engineering · Computer Science 2025-11-25 Mostafijur Rahman Akhond , Gias Uddin

A Replication Study on Predicting Metamorphic Relations at Unit Testing Level

Metamorphic Testing (MT) addresses the test oracle problem by examining the relations between inputs and outputs of test executions. Such relations are known as Metamorphic Relations (MRs). In current practice, identifying and selecting…

Software Engineering · Computer Science 2022-07-28 Alejandra Duque-Torres , Dietmar Pfahl , Rudolf Ramler , Claus Klammer

Towards the Automation of Metamorphic Testing in Model Transformations

Model transformations are the cornerstone of Model-Driven Engineering, and provide the essential mechanisms for manipulating and transforming models. Checking whether the output of a model transformation is correct is a manual and…

Software Engineering · Computer Science 2018-05-01 Javier Troya , Sergio Segura , Antonio Ruiz-Cortés

GenMorph: Automatically Generating Metamorphic Relations via Genetic Programming

Metamorphic testing is a popular approach that aims to alleviate the oracle problem in software testing. At the core of this approach are Metamorphic Relations (MRs), specifying properties that hold among multiple test inputs and…

Software Engineering · Computer Science 2024-06-06 Jon Ayerdi , Valerio Terragni , Gunel Jahangirova , Aitor Arrieta , Paolo Tonella

Validating LLM-Generated Programs with Metamorphic Prompt Testing

The latest paradigm shift in software development brings in the innovation and automation afforded by Large Language Models (LLMs), showcased by Generative Pre-trained Transformer (GPT), which has shown remarkable capacity to generate code…

Software Engineering · Computer Science 2024-06-12 Xiaoyin Wang , Dakai Zhu

An LLM Agentic Approach for Legal-Critical Software: A Case Study for Tax Prep Software

Large language models (LLMs) show promise for translating natural-language statutes into executable logic, but reliability in legally critical settings remains challenging due to ambiguity and hallucinations. We present an agentic approach…

Software Engineering · Computer Science 2026-03-05 Sina Gogani-Khiabani , Ashutosh Trivedi , Diptikalyan Saha , Saeid Tizpaz-Niari

Search-based Selection of Metamorphic Relations for Optimized Robustness Testing of Large Language Models

Assessing the trustworthiness of Large Language Models (LLMs), such as robustness, has garnered significant attention. Recently, metamorphic testing that defines Metamorphic Relations (MRs) has been widely applied to evaluate the robustness…

Software Engineering · Computer Science 2025-07-09 Sangwon Hyun , Shaukat Ali , M. Ali Babar

Metamorphic Testing of Vision-Language Action-Enabled Robots

Vision-Language-Action (VLA) models are multimodal robotic task controllers that, given an instruction and visual inputs, produce a sequence of low-level control actions (or motor commands) enabling a robot to execute the requested task in…

Robotics · Computer Science 2026-03-18 Pablo Valle , Sergio Segura , Shaukat Ali , Aitor Arrieta

Optimizing Metamorphic Testing: Prioritizing Relations Through Execution Profile Dissimilarity

An oracle determines whether the output of a program for executed test cases is correct. For machine learning programs, such an oracle is often unavailable or impractical to apply. Metamorphic testing addresses this by using metamorphic…

Software Engineering · Computer Science 2024-11-15 Madhusudan Srinivasan , Upulee Kanewala

From Untestable to Testable: Metamorphic Testing in the Age of LLMs

This article discusses the challenges of testing software systems with increasingly integrated AI and LLM functionalities. LLMs are powerful but unreliable, and labeled ground truth for testing rarely scales. Metamorphic Testing solves this…

Software Engineering · Computer Science 2026-03-27 Valerio Terragni

Application of property-based testing tools\\ for metamorphic testing

Metamorphic testing (MT) is a general approach for the testing of a specific kind of software systems -- so-called ``non-testable'', where the ``classical'' testing approaches are difficult to apply. MT is an effective approach for…

Software Engineering · Computer Science 2022-11-23 Nasser Alzahrani , Maria Spichkova , James Harland

Multi-Agent Specification-based Metamorphic Testing of FMU-Based Simulations

In many industrial domains, the Functional Mock-up Interface (FMI) is used to exchange simulation models as Functional Mock-up Units (FMUs) across different partners using various modelling tools. This opens up the possibilities for…

Software Engineering · Computer Science 2026-05-26 Ashir Kulshreshtha , Abdullah Mughees , Gaadha Sudheerbabu , Tanwir Ahmad , Kristian Klemets , Dragos Truscan , Mikael Manngård

Metamorphic Evaluation of ChatGPT as a Recommender System

With the rise of Large Language Models (LLMs) such as ChatGPT, researchers have been working on how to utilize the LLMs for better recommendations. However, although LLMs exhibit black-box and probabilistic characteristics (meaning their…

Information Retrieval · Computer Science 2024-11-20 Madhurima Khirbat , Yongli Ren , Pablo Castells , Mark Sanderson

MR-Scout: Automated Synthesis of Metamorphic Relations from Existing Test Cases

Metamorphic Testing (MT) alleviates the oracle problem by defining oracles based on metamorphic relations (MRs), that govern multiple related inputs and their outputs. However, designing MRs is challenging, as it requires domain-specific…

Software Engineering · Computer Science 2026-04-14 Congying Xu , Valerio Terragni , Hengcheng Zhu , Jiarong Wu , Shing-Chi Cheung