Neural Interactive Proofs

Lewis Hammond; Sam Adam-Day

Neural Interactive Proofs

Artificial Intelligence 2025-03-19 v2 Machine Learning

Authors: Lewis Hammond , Sam Adam-Day

Abstract

We consider the problem of how a trusted, but computationally bounded agent (a 'verifier') can learn to interact with one or more powerful but untrusted agents ('provers') in order to solve a given task. More specifically, we study the case in which agents are represented using neural networks and refer to solutions of this problem as neural interactive proofs. First we introduce a unifying framework based on prover-verifier games, which generalises previously proposed interaction protocols. We then describe several new protocols for generating neural interactive proofs, and provide a theoretical comparison of both new and existing approaches. Finally, we support this theory with experiments in two domains: a toy graph isomorphism problem that illustrates the key ideas, and a code validation task using large language models. In so doing, we aim to create a foundation for future work on neural interactive proofs and their application in building safer AI systems.

Keywords

multi-agent reinforcement learning neural network theorem proving

Cite

@article{arxiv.2412.08897,
  title  = {Neural Interactive Proofs},
  author = {Lewis Hammond and Sam Adam-Day},
  journal= {arXiv preprint arXiv:2412.08897},
  year   = {2025}
}

Comments

ICLR'25 camera-ready version; 51 pages, 17 figures

Neural Interactive Proofs

Abstract

Keywords

Cite

Comments

Related papers