Related papers: ImProver: Agent-Based Automated Proof Optimization

Prover Agent: An Agent-Based Framework for Formal Mathematical Proofs

We present Prover Agent, a novel AI agent for automated theorem proving that integrates large language models (LLMs) with a formal proof assistant, Lean. Prover Agent coordinates an informal reasoning LLM, a formal prover model, and…

Artificial Intelligence · Computer Science 2026-02-18 Kaito Baba , Chaoran Liu , Shuhei Kurita , Akiyoshi Sannai

ProofOptimizer: Training Language Models to Simplify Proofs without Human Demonstrations

Neural theorem proving has advanced rapidly in the past year, reaching IMO gold-medalist capabilities and producing formal proofs that span thousands of lines. Although such proofs are mechanically verified by formal systems like Lean,…

Machine Learning · Computer Science 2025-10-20 Alex Gu , Bartosz Piotrowski , Fabian Gloeckle , Kaiyu Yang , Aram H. Markosyan

Agentic Proof Automation: A Case Study

Proof engineering is notoriously labor-intensive: proofs that are straightforward on paper often require lengthy scripts in theorem provers. Recent advances in large language models (LLMs) create new opportunities for proof automation:…

Programming Languages · Computer Science 2026-01-08 Yichen Xu , Martin Odersky

From Natural Language to Solver-Ready Power System Optimization: An LLM-Assisted, Validation-in-the-Loop Framework

This paper introduces a novel Large Language Models (LLMs)-assisted agent that automatically converts natural-language descriptions of power system optimization scenarios into compact, solver-ready formulations and generates corresponding…

Artificial Intelligence · Computer Science 2025-08-12 Yunkai Hu , Tianqiao Zhao , Meng Yue

Solving Formal Math Problems by Decomposition and Iterative Reflection

General-purpose Large Language Models (LLMs) have achieved remarkable success in intelligence, performing comparably to human experts on complex reasoning tasks such as coding and mathematical reasoning. However, generating formal proofs in…

Artificial Intelligence · Computer Science 2025-07-31 Yichi Zhou , Jianqiu Zhao , Yongxin Zhang , Bohan Wang , Siran Wang , Luoxin Chen , Jiahui Wang , Haowei Chen , Allan Jie , Xinbo Zhang , Haocheng Wang , Luong Trung , Rong Ye , Phan Nhat Hoang , Huishuai Zhang , Peng Sun , Hang Li

ImProver 2: Iteratively Self-Improving LMs for Neurosymbolic Proof Optimization

Formal mathematics libraries are rapidly expanding, creating a growing need to refactor verified proofs for maintainability and to improve training data quality for neural provers. However, scalable proof optimization is hindered by…

Artificial Intelligence · Computer Science 2026-05-25 Riyaz Ahuja , Tate Rowney , Jeremy Avigad , Sean Welleck

Lean Refactor: Multi-Objective Controllable Proof Optimization via Agentic Strategy Search

We present Lean Refactor, a plug-and-play retrieval-augmented agentic framework for multi-objective, controllable, and version-robust refactoring of Lean proofs. LLM-generated proofs are notoriously correct-but-verbose and brittle across…

Logic in Computer Science · Computer Science 2026-05-21 Jialin Lu , Soonho Kong , Rodrigo Stehling , Kaiyu Yang , Zhangyang Wang , Weiran Sun , Wuyang Chen

Hilbert: Recursively Building Formal Proofs with Informal Reasoning

Large Language Models (LLMs) demonstrate impressive mathematical reasoning abilities, but their solutions frequently contain errors that cannot be automatically checked. Formal theorem proving systems such as Lean 4 offer automated…

Artificial Intelligence · Computer Science 2026-03-18 Sumanth Varambally , Thomas Voice , Yanchao Sun , Zhifeng Chen , Rose Yu , Ke Ye

Ax-Prover: A Deep Reasoning Agentic Framework for Theorem Proving in Mathematics and Quantum Physics

We present Ax-Prover, a multi-agent system for automated theorem proving in Lean that can solve problems across diverse scientific domains and operate either autonomously or collaboratively with human experts. To achieve this, Ax-Prover…

Artificial Intelligence · Computer Science 2026-05-25 Benjamin Breen , Marco Del Tredici , Jacob McCarran , Javier Aspuru Mijares , Weichen Winston Yin , Kfir Sulimany , Jacob M. Taylor , Frank H. L. Koppens , Dirk Englund

Reinforced Large Language Model is a formal theorem prover

To take advantage of Large Language Model in theorem formalization and proof, we propose a reinforcement learning framework to iteratively optimize the pretrained LLM by rolling out next tactics and comparing them with the expected ones.…

Artificial Intelligence · Computer Science 2025-02-14 Zhiling Luo

LeanReasoner: Boosting Complex Logical Reasoning with Lean

Large language models (LLMs) often struggle with complex logical reasoning due to logical inconsistencies and the inherent difficulty of such reasoning. We use Lean, a theorem proving framework, to address these challenges. By formalizing…

Computation and Language · Computer Science 2024-03-21 Dongwei Jiang , Marcio Fonseca , Shay B. Cohen

InternLM2.5-StepProver: Advancing Automated Theorem Proving via Critic-Guided Search

Large Language Models (LLMs) have emerged as powerful tools in mathematical theorem proving, particularly when utilizing formal languages such as LEAN. A prevalent proof method involves the LLM prover iteratively constructing the proof…

Artificial Intelligence · Computer Science 2025-10-22 Zijian Wu , Suozhi Huang , Zhejian Zhou , Huaiyuan Ying , Zheng Yuan , Wenwei Zhang , Dahua Lin , Kai Chen

metaTextGrad: Automatically optimizing language model optimizers

Large language models (LLMs) are increasingly used in learning algorithms, evaluations, and optimization tasks. Recent studies have shown that using LLM-based optimizers to automatically optimize model prompts, demonstrations, predictions…

Computation and Language · Computer Science 2025-10-23 Guowei Xu , Mert Yuksekgonul , Carlos Guestrin , James Zou

On Reasoning-Centric LLM-based Automated Theorem Proving

Automated theorem proving is fundamental to formal methods, and the recent trend is to integrate large language models (LLMs) and proof assistants to form effective proof agents. While existing proof agents show promising performance, they…

Software Engineering · Computer Science 2026-04-22 Yican Sun , Chengwei Shi , Hangzhou Lyu , Yingfei Xiong

Adaptive Proof Refinement with LLM-Guided Strategy Selection

Formal verification via theorem proving enables the expressive specification and rigorous proof of software correctness, but it is difficult to scale due to the significant manual effort and expertise required. While Large Language Models…

Software Engineering · Computer Science 2025-10-30 Minghai Lu , Zhe Zhou , Danning Xie , Songlin Jia , Benjamin Delaware , Tianyi Zhang

OProver: A Unified Framework for Agentic Formal Theorem Proving

Recent progress in formal theorem proving has benefited from large-scale proof generation and verifier-aware training, but agentic proving is rarely integrated into prover training, appearing only at inference time. We present OProver, a…

Computation and Language · Computer Science 2026-05-19 David Ma , Kaijing Ma , Shawn Guo , Yunfeng Shi , Enduo Zhao , Jiajun Shi , Zhaoxiang Zhang , Gavin Cheung , Jiaheng Liu , Zili Wang

Advancing Mathematics Research with AI-Driven Formal Proof Search

Large language models (LLMs) increasingly excel at mathematical reasoning, but their unreliability limits their utility in mathematics research. A mitigation is using LLMs to generate formal proofs in languages like Lean. We perform the…

Artificial Intelligence · Computer Science 2026-05-22 George Tsoukalas , Anton Kovsharov , Sergey Shirobokov , Anja Surina , Moritz Firsching , Gergely Bérczi , Francisco J. R. Ruiz , Arun Suggala , Adam Zsolt Wagner , Eric Wieser , Lei Yu , Aja Huang , Miklós Z. Horváth , Andrew Ferrauiolo , Henryk Michalewski , Codrut Grosu , Thomas Hubert , Matej Balog , Pushmeet Kohli , Swarat Chaudhuri

A Survey on Mathematical Reasoning and Optimization with Large Language Models

Mathematical reasoning and optimization are fundamental to artificial intelligence and computational problem-solving. Recent advancements in Large Language Models (LLMs) have significantly improved AI-driven mathematical reasoning, theorem…

Artificial Intelligence · Computer Science 2025-03-25 Ali Forootani

Putting People in LLMs' Shoes: Generating Better Answers via Question Rewriter

Large Language Models (LLMs) have demonstrated significant capabilities, particularly in the domain of question answering (QA). However, their effectiveness in QA is often undermined by the vagueness of user questions. To address this…

Computation and Language · Computer Science 2025-02-26 Junhao Chen , Bowen Wang , Zhouqiang Jiang , Yuta Nakashima

LeanDojo: Theorem Proving with Retrieval-Augmented Language Models

Large language models (LLMs) have shown promise in proving formal theorems using proof assistants such as Lean. However, existing methods are difficult to reproduce or build on, due to private code, data, and large compute requirements.…

Machine Learning · Computer Science 2023-10-30 Kaiyu Yang , Aidan M. Swope , Alex Gu , Rahul Chalamala , Peiyang Song , Shixing Yu , Saad Godil , Ryan Prenger , Anima Anandkumar