Related papers: Agentic Property-Based Testing: Finding Bugs Acros…

PBT-Bench: Benchmarking AI Agents on Property-Based Testing

Existing code benchmarks measure whether an agent can produce any test that reproduces a known bug, or whether it can produce a patch that fixes a described issue. Neither isolates the distinct skill of property-based testing: deriving a…

Software Engineering · Computer Science 2026-05-21 Lucas Jing , Xinqi Wang , Liao Zhang , Simon S. Du

Can Large Language Models Write Good Property-Based Tests?

Property-based testing (PBT), while an established technique in the software testing research community, is still relatively underused in real-world software. Pain points in writing property-based tests include implementing diverse random…

Software Engineering · Computer Science 2024-07-23 Vasudev Vikram , Caroline Lemieux , Joshua Sunshine , Rohan Padhye

Property-Based Testing by Elaborating Proof Outlines

Property-based testing (PBT) is a technique for validating code against an executable specification by automatically generating test-data. We present a proof-theoretical reconstruction of this style of testing for relational specifications…

Logic in Computer Science · Computer Science 2025-01-22 Dale Miller , Alberto Momigliano

Understanding the Characteristics of LLM-Generated Property-Based Tests in Exploring Edge Cases

As Large Language Models (LLMs) increasingly generate code in software development, ensuring the quality of LLM-generated code has become important. Traditional testing approaches using Example-based Testing (EBT) often miss edge cases --…

Software Engineering · Computer Science 2025-10-30 Hidetake Tanaka , Haruto Tanaka , Kazumasa Shimari , Kenichi Matsumoto

From Natural Language to Executable Properties for Property-based Testing of Mobile Apps

Property-based testing (PBT) is a popular software testing methodology and is effective in validating the functionality of mobile applications (apps for short). However, its adoption in practice remains limited, largely due to the manual…

Software Engineering · Computer Science 2026-03-24 Yiheng Xiong , Ting Su , Jingling Sun , Jue Wang , Qin Li , Geguang Pu , Zhendong Su

Programmable Property-Based Testing

Property-based testing (PBT) is a popular technique for establishing confidence in software, where users write properties -- i.e., executable specifications -- that can be checked many times in a loop by a testing framework. In modern PBT…

Software Engineering · Computer Science 2026-02-24 Alperen Keles , Justine Frank , Ceren Mert , Harrison Goldstein , Leonidas Lampropoulos

Fail Faster: Staging and Fast Randomness for High-Performance PBT

Property-based testing (PBT) relies on generators for random test cases, often constructed using embedded domain specific languages, which provide expressive combinators for building and composing generators. The effectiveness of PBT…

Programming Languages · Computer Science 2026-04-08 Cynthia Richey , Joseph W. Cutler , Harrison Goldstein , Benjamin C. Pierce

Automated, Targeted Testing of Property-Based Testing Predicates

Context: This work is based on property-based testing (PBT). PBT is an increasingly important form of software testing. Furthermore, it serves as a concrete gateway into the abstract area of formal methods. Specifically, we focus on…

Programming Languages · Computer Science 2021-11-23 Tim Nelson , Elijah Rivera , Sam Soucie , Thomas Del Vecchio , John Wrenn , Shriram Krishnamurthi

Agent Audit: A Security Analysis System for LLM Agent Applications

What should a developer inspect before deploying an LLM agent: the model, the tool code, the deployment configuration, or all three? In practice, many security failures in agent systems arise not from model weights alone, but from the…

Cryptography and Security · Computer Science 2026-03-25 Haiyue Zhang , Yi Nian , Yue Zhao

Type-level Property Based Testing

We present an automated framework for solidifying the cohesion between software specifications, their dependently typed models, and implementation at compile time. Model Checking and type checking are currently separate techniques for…

Programming Languages · Computer Science 2024-07-18 Thomas Ekström Hansen , Edwin Brady

We've Got You Covered: Type-Guided Repair of Incomplete Input Generators

Property-based testing (PBT) is a popular technique for automatically testing semantic properties of a program, specified as a pair of pre- and post-conditions. The efficacy of this approach depends on being able to quickly generate inputs…

Programming Languages · Computer Science 2025-10-07 Patrick LaFontaine , Zhe Zhou , Ashish Mishra , Suresh Jagannathan , Benjamin Delaware

Application of property-based testing tools\\ for metamorphic testing

Metamorphic testing (MT) is a general approach for the testing of a specific kind of software systems -- so-called ``non-testable'', where the ``classical'' testing approaches are difficult to apply. MT is an effective approach for…

Software Engineering · Computer Science 2022-11-23 Nasser Alzahrani , Maria Spichkova , James Harland

Beyond Pass/Fail: The Story of Learning-Based Testing

Learning-Based Testing (LBT) merges learning and testing processes to achieve both testing and behavioral adequacy. LBT utilizes active learning to infer the model of the System Under Test (SUT), enabling scalability for large and complex…

Software Engineering · Computer Science 2025-10-02 Sheikh Md. Mushfiqur Rahman , Nasir Eisty

Tuning Random Generators: Property-Based Testing as Probabilistic Programming

Property-based testing validates software against an executable specification by evaluating it on randomly generated inputs. The standard way that PBT users generate test inputs is via generators that describe how to sample test inputs…

Programming Languages · Computer Science 2025-11-18 Ryan Tjoa , Poorva Garg , Harrison Goldstein , Todd Millstein , Benjamin Pierce , Guy Van den Broeck

Evaluating Agent-based Program Repair at Google

Agent-based program repair offers to automatically resolve complex bugs end-to-end by combining the planning, tool use, and code generation abilities of modern LLMs. Recent work has explored the use of agent-based repair approaches on the…

Software Engineering · Computer Science 2025-01-14 Pat Rondon , Renyao Wei , José Cambronero , Jürgen Cito , Aaron Sun , Siddhant Sanyam , Michele Tufano , Satish Chandra

An Empirical Study on LLM-based Agents for Automated Bug Fixing

Large language models (LLMs) and LLM-based Agents have been applied to fix bugs automatically, demonstrating the capability in addressing software defects by engaging in development environment interaction, iterative validation and code…

Software Engineering · Computer Science 2025-10-21 Xiangxin Meng , Zexiong Ma , Pengfei Gao , Chao Peng

Agentic Model Checking

Verifying LLM-generated systems code is hard: bugs are prevalent, formal specifications are missing, and safety contracts are encoded implicitly at call sites rather than enforced at function boundaries. We propose agentic model checking, a…

Software Engineering · Computer Science 2026-05-21 Youcheng Sun , Jiawen Liu , Daniel Kroening , Jason Xue

Etna: An Evaluation Platform for Property-Based Testing

Property-based testing is a mainstay of functional programming, boasting a rich literature, an enthusiastic user community, and an abundance of tools~ -- so many, indeed, that new users may have difficulty choosing. Moreover, any given…

Software Engineering · Computer Science 2026-03-31 Alperen Keles , Jessica Shi , Nikhil Kamath , Tin Nam Liu , Ceren Mert , Harrison Goldstein , Benjamin C. Pierce , Leonidas Lampropoulos

PAGENT: Learning to Patch Software Engineering Agents

LLM Agents produce patches automatically to resolve an issue. However, they can generate inaccurate patches. Little is known about the root causes behind those failed patches or how those could be fixed. This paper reports an empirical…

Software Engineering · Computer Science 2025-06-24 Haoran Xue , Gias Uddin , Song Wang

LLM-based Unit Test Generation via Property Retrieval

Automated unit test generation has been widely studied, with Large Language Models (LLMs) recently showing significant potential. Moreover, in the context of unit test generation, these tools prioritize high code coverage, often at the…

Software Engineering · Computer Science 2024-10-18 Zhe Zhang , Xingyu Liu , Yuanzhang Lin , Xiang Gao , Hailong Sun , Yuan Yuan