Related papers: Improving Counterexample Quality from Failed Progr…

Combining Tests and Proofs for Better Software Verification

Test or prove? These two approaches to software verification have long been presented as opposites. One is dynamic, the other static: a test executes the program, a proof only analyzes the program text. A different perspective is emerging,…

Software Engineering · Computer Science 2026-02-10 Li Huang , Bertrand Meyer , Manuel Oriol

A Failed Proof Can Yield a Useful Test

A successful automated program proof is, in software verification, the ultimate triumph. In practice, however, the road to such success is paved with many failed proof attempts. Unlike a failed test, which provides concrete evidence of an…

Software Engineering · Computer Science 2023-04-25 Li Huang , Bertrand Meyer

ACE: Adapting sampling for Counterfactual Explanations

Counterfactual Explanations (CFEs) interpret machine learning models by identifying the smallest change to input features needed to change the model's prediction to a desired output. For classification tasks, CFEs determine how close a…

Machine Learning · Computer Science 2025-10-01 Margarita A. Guerrero , Cristian R. Rojas

Your Proof Fails? Testing Helps to Find the Reason

Applying deductive verification to formally prove that a program respects its formal specification is a very complex and time-consuming task due in particular to the lack of feedback in case of proof failures. Along with a non-compliance…

Software Engineering · Computer Science 2015-08-10 Guillaume Petiot , Nikolai Kosmatov , Bernard Botella , Alain Giorgetti , Jacques Julliand

Understanding Counterexamples for Relational Properties with DIbugger

Software verification is a tedious process that involves the analysis of multiple failed verification attempts, and adjustments of the program or specification. This is especially the case for complex requirements, e.g., regarding security…

Software Engineering · Computer Science 2019-07-10 Mihai Herda , Michael Kirsten , Etienne Brunner , Joana Plewnia , Ulla Scheler , Chiara Staudenmaier , Benedikt Wagner , Pascal Zwick , Bernhard Beckert

ExVerus: Verus Proof Repair via Counterexample Reasoning

Large Language Models (LLMs) have shown promising results in automating formal verification. However, existing approaches treat proof generation as a static, end-to-end prediction over source code, relying on limited verifier feedback and…

Programming Languages · Computer Science 2026-03-31 Jun Yang , Yuechun Sun , Yi Wu , Rodrigo Caridad , Yongwei Yuan , Jianan Yao , Shan Lu , Kexin Pei

High-level Counterexamples for Probabilistic Automata

Providing compact and understandable counterexamples for violated system properties is an essential task in model checking. Existing works on counterexamples for probabilistic systems so far computed either a large set of system runs or a…

Software Engineering · Computer Science 2016-11-26 Ralf Wimmer , Nils Jansen , Erika Ábrahám , Joost-Pieter Katoen

Teaching by Failure: Counter-Example-Driven Curricula for Transformer Self-Improvement

Transformer models often exhibit brittle extrapolation, failing on inputs that are longer or structurally more complex than those seen during training. We introduce Counter-Example-Driven Curricula (CEDC), an automated framework that…

Machine Learning · Computer Science 2025-12-02 Harshil Vejendla

Counterexample Classification

In model checking, when a given model fails to satisfy the desired specification, a typical model checker provides a counterexample that illustrates how the violation occurs. In general, there exist many diverse counterexamples that exhibit…

Software Engineering · Computer Science 2021-10-12 Cole Vick , Eunsuk Kang , Stavros Tripakis

Proof Simplification and Automated Theorem Proving

The proofs first generated by automated theorem provers are far from optimal by any measure of simplicity. In this paper I describe a technique for simplifying automated proofs. Hopefully this discussion will stimulate interest in the…

Logic in Computer Science · Computer Science 2021-01-19 Michael Kinyon

miniCodeProps: a Minimal Benchmark for Proving Code Properties

AI agents have shown initial promise in automating mathematical theorem proving in proof assistants such as Lean. The same proof assistants can be used to verify the correctness of code by pairing code with specifications and proofs that…

Software Engineering · Computer Science 2024-10-11 Evan Lohn , Sean Welleck

The Counterfeit Conundrum: Can Code Language Models Grasp the Nuances of Their Incorrect Generations?

While language models are increasingly more proficient at code generation, they still frequently generate incorrect programs. Many of these programs are obviously wrong, but others are more subtle and pass weaker correctness checks such as…

Software Engineering · Computer Science 2024-03-01 Alex Gu , Wen-Ding Li , Naman Jain , Theo X. Olausson , Celine Lee , Koushik Sen , Armando Solar-Lezama

Counterfactual Explanations for Integer Optimization Problems

Counterfactual explanations (CEs) offer a human-understandable way to explain decisions by identifying specific changes to the input parameters of a base or present model that would lead to a desired change in the outcome. For optimization…

Optimization and Control · Mathematics 2026-01-06 Felix Engelhardt , Jannis Kurtz , Ş. İlker Birbil , Ted Ralphs

Counterexample-Driven Synthesis for Probabilistic Program Sketches

Probabilistic programs are key to deal with uncertainty in e.g. controller synthesis. They are typically small but intricate. Their development is complex and error prone requiring quantitative reasoning over a myriad of alternative…

Software Engineering · Computer Science 2019-04-30 Milan Češka , Christian Hensel , Sebastian Junges , Joost-Pieter Katoen

Explaining Counterexamples with Giant-Step Assertion Checking

Identifying the cause of a proof failure during deductive verification of programs is hard: it may be due to an incorrectness in the program, an incompleteness in the program annotations, or an incompleteness of the prover. The changes…

Logic in Computer Science · Computer Science 2021-08-09 Benedikt Becker , Cláudio Belo Lourenço , Claude Marché

A Program Instrumentation Framework for Automatic Verification

In deductive verification and software model checking, dealing with certain specification language constructs can be problematic when the back-end solver is not sufficiently powerful or lacks the required theories. One way to deal with this…

Logic in Computer Science · Computer Science 2024-12-10 Jesper Amilon , Zafer Esen , Dilian Gurov , Christian Lidström , Philipp Rümmer , Marten Voorberg

Bounded Model Checking and Feature Omission Diversity

In this paper we introduce a novel way to speed up the discovery of counterexamples in bounded model checking, based on parallel runs over versions of a system in which features have been randomly disabled. As shown in previous work, adding…

Software Engineering · Computer Science 2016-10-26 Mohammad Amin Alipour , Alex Groce

Towards Counterfactual Explanation and Assertion Inference for CPS Debugging

Verification and validation of cyber-physical systems (CPS) via large-scale simulation often surface failures that are hard to interpret, especially when triggered by interactions between continuous and discrete behaviors at specific events…

Software Engineering · Computer Science 2026-04-10 Zaid Ghazal , Hadiza Yusuf , Khouloud Gaaloul

A Counterexample Guided Abstraction-Refinement Framework for Markov Decision Processes

The main challenge in using abstractions effectively, is to construct a suitable abstraction for the system being verified. One approach that tries to address this problem is that of {\it counterexample guided abstraction-refinement…

Software Engineering · Computer Science 2008-07-09 Rohit Chadha , Mahesh Viswanthan

Verified Training for Counterfactual Explanation Robustness under Data Shift

Counterfactual explanations (CEs) enhance the interpretability of machine learning models by describing what changes to an input are necessary to change its prediction to a desired class. These explanations are commonly used to guide users'…

Machine Learning · Computer Science 2024-03-07 Anna P. Meyer , Yuhao Zhang , Aws Albarghouthi , Loris D'Antoni