English

Learning Correct Behavior from Examples: Validating Sequential Execution in Autonomous Agents

Artificial Intelligence 2026-05-06 v1 Software Engineering

Abstract

As autonomous agents become increasingly sophisticated, validating their sequential behavior presents a significant challenge. Traditional testing approaches require manual specification, exact sequence matching, or thousands of training examples. We present a novel algorithm that automatically learns correct behavior from just 2-10 passing execution traces and validates new executions against this learned model. Our approach combines dominator analysis from compiler theory with multimodal large language model-powered semantic understanding to identify essential states and handle non-deterministic behavior. The system constructs a generalized ground truth model using Prefix Tree Acceptors, merges traces through multi-tiered equivalence detection, and validates new executions via topological subsequence matching. In controlled experiments, our system achieved high accuracy in detecting product bugs and false successes using only 3 training traces. This approach provides explainable validation results with coverage metrics and works across diverse domains including UI testing, code generation, and robotic processes.

Keywords

Cite

@article{arxiv.2605.03159,
  title  = {Learning Correct Behavior from Examples: Validating Sequential Execution in Autonomous Agents},
  author = {Reshabh K Sharma and Gaurav Mittal and Yu Hu},
  journal= {arXiv preprint arXiv:2605.03159},
  year   = {2026}
}
R2 v1 2026-07-01T12:49:29.483Z