English
Related papers

Related papers: ImProver: Agent-Based Automated Proof Optimization

200 papers

We present Prover Agent, a novel AI agent for automated theorem proving that integrates large language models (LLMs) with a formal proof assistant, Lean. Prover Agent coordinates an informal reasoning LLM, a formal prover model, and…

Artificial Intelligence · Computer Science 2026-02-18 Kaito Baba , Chaoran Liu , Shuhei Kurita , Akiyoshi Sannai

Neural theorem proving has advanced rapidly in the past year, reaching IMO gold-medalist capabilities and producing formal proofs that span thousands of lines. Although such proofs are mechanically verified by formal systems like Lean,…

Machine Learning · Computer Science 2025-10-20 Alex Gu , Bartosz Piotrowski , Fabian Gloeckle , Kaiyu Yang , Aram H. Markosyan

Proof engineering is notoriously labor-intensive: proofs that are straightforward on paper often require lengthy scripts in theorem provers. Recent advances in large language models (LLMs) create new opportunities for proof automation:…

Programming Languages · Computer Science 2026-01-08 Yichen Xu , Martin Odersky

This paper introduces a novel Large Language Models (LLMs)-assisted agent that automatically converts natural-language descriptions of power system optimization scenarios into compact, solver-ready formulations and generates corresponding…

Artificial Intelligence · Computer Science 2025-08-12 Yunkai Hu , Tianqiao Zhao , Meng Yue

General-purpose Large Language Models (LLMs) have achieved remarkable success in intelligence, performing comparably to human experts on complex reasoning tasks such as coding and mathematical reasoning. However, generating formal proofs in…

Formal mathematics libraries are rapidly expanding, creating a growing need to refactor verified proofs for maintainability and to improve training data quality for neural provers. However, scalable proof optimization is hindered by…

Artificial Intelligence · Computer Science 2026-05-25 Riyaz Ahuja , Tate Rowney , Jeremy Avigad , Sean Welleck

We present Lean Refactor, a plug-and-play retrieval-augmented agentic framework for multi-objective, controllable, and version-robust refactoring of Lean proofs. LLM-generated proofs are notoriously correct-but-verbose and brittle across…

Logic in Computer Science · Computer Science 2026-05-21 Jialin Lu , Soonho Kong , Rodrigo Stehling , Kaiyu Yang , Zhangyang Wang , Weiran Sun , Wuyang Chen

Large Language Models (LLMs) demonstrate impressive mathematical reasoning abilities, but their solutions frequently contain errors that cannot be automatically checked. Formal theorem proving systems such as Lean 4 offer automated…

Artificial Intelligence · Computer Science 2026-03-18 Sumanth Varambally , Thomas Voice , Yanchao Sun , Zhifeng Chen , Rose Yu , Ke Ye

We present Ax-Prover, a multi-agent system for automated theorem proving in Lean that can solve problems across diverse scientific domains and operate either autonomously or collaboratively with human experts. To achieve this, Ax-Prover…

To take advantage of Large Language Model in theorem formalization and proof, we propose a reinforcement learning framework to iteratively optimize the pretrained LLM by rolling out next tactics and comparing them with the expected ones.…

Artificial Intelligence · Computer Science 2025-02-14 Zhiling Luo

Large language models (LLMs) often struggle with complex logical reasoning due to logical inconsistencies and the inherent difficulty of such reasoning. We use Lean, a theorem proving framework, to address these challenges. By formalizing…

Computation and Language · Computer Science 2024-03-21 Dongwei Jiang , Marcio Fonseca , Shay B. Cohen

Large Language Models (LLMs) have emerged as powerful tools in mathematical theorem proving, particularly when utilizing formal languages such as LEAN. A prevalent proof method involves the LLM prover iteratively constructing the proof…

Artificial Intelligence · Computer Science 2025-10-22 Zijian Wu , Suozhi Huang , Zhejian Zhou , Huaiyuan Ying , Zheng Yuan , Wenwei Zhang , Dahua Lin , Kai Chen

Large language models (LLMs) are increasingly used in learning algorithms, evaluations, and optimization tasks. Recent studies have shown that using LLM-based optimizers to automatically optimize model prompts, demonstrations, predictions…

Computation and Language · Computer Science 2025-10-23 Guowei Xu , Mert Yuksekgonul , Carlos Guestrin , James Zou

Automated theorem proving is fundamental to formal methods, and the recent trend is to integrate large language models (LLMs) and proof assistants to form effective proof agents. While existing proof agents show promising performance, they…

Software Engineering · Computer Science 2026-04-22 Yican Sun , Chengwei Shi , Hangzhou Lyu , Yingfei Xiong

Formal verification via theorem proving enables the expressive specification and rigorous proof of software correctness, but it is difficult to scale due to the significant manual effort and expertise required. While Large Language Models…

Software Engineering · Computer Science 2025-10-30 Minghai Lu , Zhe Zhou , Danning Xie , Songlin Jia , Benjamin Delaware , Tianyi Zhang

Recent progress in formal theorem proving has benefited from large-scale proof generation and verifier-aware training, but agentic proving is rarely integrated into prover training, appearing only at inference time. We present OProver, a…

Computation and Language · Computer Science 2026-05-19 David Ma , Kaijing Ma , Shawn Guo , Yunfeng Shi , Enduo Zhao , Jiajun Shi , Zhaoxiang Zhang , Gavin Cheung , Jiaheng Liu , Zili Wang

Large language models (LLMs) increasingly excel at mathematical reasoning, but their unreliability limits their utility in mathematics research. A mitigation is using LLMs to generate formal proofs in languages like Lean. We perform the…

Mathematical reasoning and optimization are fundamental to artificial intelligence and computational problem-solving. Recent advancements in Large Language Models (LLMs) have significantly improved AI-driven mathematical reasoning, theorem…

Artificial Intelligence · Computer Science 2025-03-25 Ali Forootani

Large Language Models (LLMs) have demonstrated significant capabilities, particularly in the domain of question answering (QA). However, their effectiveness in QA is often undermined by the vagueness of user questions. To address this…

Computation and Language · Computer Science 2025-02-26 Junhao Chen , Bowen Wang , Zhouqiang Jiang , Yuta Nakashima

Large language models (LLMs) have shown promise in proving formal theorems using proof assistants such as Lean. However, existing methods are difficult to reproduce or build on, due to private code, data, and large compute requirements.…

Machine Learning · Computer Science 2023-10-30 Kaiyu Yang , Aidan M. Swope , Alex Gu , Rahul Chalamala , Peiyang Song , Shixing Yu , Saad Godil , Ryan Prenger , Anima Anandkumar
‹ Prev 1 2 3 10 Next ›