English

STAB: Specification-driven Testing for Algorithmic Bottlenecks

Artificial Intelligence 2026-05-28 v1

Abstract

Evaluating the efficiency of algorithmic code requires test cases that expose runtime bottlenecks. Previous methods generate efficiency test cases either by increasing input size or by generating code-specific inputs that make the given implementation run slowly. Consequently, they do not address the structural input conditions that drive the algorithmic worst case. We introduce STAB, a specification-driven pipeline that generates test cases that expose algorithmic bottlenecks from a natural-language problem specification alone. STAB separates the task into constraint-bound maximization and adversarial structure injection. (i) The constraint saturator extracts constraints and resolves large admissible size assignments using rule-based saturation and CP-SAT optimization over related variables. (ii) The adversarial scenario injector retrieves implementation-level adversarial construction principles from a curated scenario catalog using keyword matching and K-nearest neighbors (KNN). STAB encodes the problem specification, resolved boundary, and retrieved construction principles into a structured generation specification, from which the LLM synthesizes a Python test case generator. On CodeContests, STAB raises the rate of generated test cases that expose algorithmic bottlenecks from 50.43% to 73.45% on average across open-source LLMs and from 57.45% to 71.85% on average across closed-source LLMs, with consistent gains across Python, Java, and C++. Our code is available at https://github.com/suhanmen/STAB.

Keywords

Cite

@article{arxiv.2605.27981,
  title  = {STAB: Specification-driven Testing for Algorithmic Bottlenecks},
  author = {Soohan Lim and Joonghyuk Hahn and Hyundong Jin and Yo-Sub Han},
  journal= {arXiv preprint arXiv:2605.27981},
  year   = {2026}
}

Comments

16 pages, 5 figures, 8 tables