English

Data-Driven Invariant Learning for Probabilistic Programs

Programming Languages 2025-03-10 v4

Abstract

Morgan and McIver's weakest pre-expectation framework is one of the most well-established methods for deductive verification of probabilistic programs. Roughly, the idea is to generalize binary state assertions to real-valued expectations, which can measure expected values of probabilistic program quantities. While loop-free programs can be analyzed by mechanically transforming expectations, verifying loops usually requires finding an invariant expectation, a difficult task. We propose a new view of invariant expectation synthesis as a regression problem: given an input state, predict the average value of the post-expectation in the output distribution. Guided by this perspective, we develop the first data-driven invariant synthesis method for probabilistic programs. Unlike prior work on probabilistic invariant inference, our approach can learn piecewise continuous invariants without relying on template expectations, and also works with black-box access to the program. We also develop a data-driven approach to learn sub-invariants from data, which can be used to upper- or lower-bound expected values. We implement our approaches and demonstrate their effectiveness on a variety of benchmarks from the probabilistic programming literature.

Keywords

Cite

@article{arxiv.2106.05421,
  title  = {Data-Driven Invariant Learning for Probabilistic Programs},
  author = {Jialu Bao and Nitesh Trivedi and Drashti Pathak and Justin Hsu and Subhajit Roy},
  journal= {arXiv preprint arXiv:2106.05421},
  year   = {2025}
}

Comments

37 pages

R2 v1 2026-06-24T03:02:06.970Z