Related papers: CSLib: The Lean Computer Science Library

Computer Science as Infrastructure: the Spine of the Lean Computer Science Library (CSLib)

Following in the footsteps of the success of Mathlib - the centralised library of formalised mathematics in Lean - CSLib is a rapidly-growing centralised library of formalised computer science and software. In this paper, we present its…

Logic in Computer Science · Computer Science 2026-02-18 Christopher Henson , Fabrizio Montesi

The Lean mathematical library

This paper describes mathlib, a community-driven effort to build a unified library of mathematics formalized in the Lean proof assistant. Among proof assistant libraries, it is distinguished by its dependently typed foundations, focus on…

Logic in Computer Science · Computer Science 2020-01-28 The mathlib Community

Maintaining a Library of Formal Mathematics

The Lean mathematical library mathlib is developed by a community of users with very different backgrounds and levels of experience. To lower the barrier of entry for contributors and to lessen the burden of reviewing contributions, we have…

Programming Languages · Computer Science 2020-07-28 Floris van Doorn , Gabriel Ebner , Robert Y. Lewis

CSLibPremiseBench: Structure-Guided Premise Retrieval and Label Robustness for Lean 4 Computer-Science Theorems

CSLib is an emerging Lean 4 library for computer-science formalization, but its premise-retrieval behavior is not well represented by broad mathematical theorem-proving benchmarks. We introduce CSLibPremiseBench, a reproducible…

Logic in Computer Science · Computer Science 2026-05-15 Junye Ji

A Semantic Search Engine for Mathlib4

The interactive theorem prover Lean enables the verification of formal mathematical proofs and is backed by an expanding community. Central to this ecosystem is its mathematical library, mathlib4, which lays the groundwork for the…

Information Retrieval · Computer Science 2025-02-05 Guoxiong Gao , Haocheng Ju , Jiedong Jiang , Zihan Qin , Bin Dong

VeriSoftBench: Repository-Scale Formal Verification Benchmarks for Lean

Large language models have achieved striking results in interactive theorem proving, particularly in Lean. However, most benchmarks for LLM-based proof automation are drawn from mathematics in the Mathlib ecosystem, whereas proofs in…

Software Engineering · Computer Science 2026-02-23 Yutong Xin , Qiaochu Chen , Greg Durrett , Işil Dillig

Structuring Definitions in Mathematical Libraries

Codifying mathematical theories in a proof assistant or computer algebra system is a challenging task, of which the most difficult part is, counterintuitively, structuring definitions. This results in a steep learning curve for new users…

Symbolic Computation · Computer Science 2025-11-19 Alena Gusakov , Peter Nelson , Stephen Watt

The Chase in Lean -- Crafting a Formal Library for Existential Rule Research

The chase is a sound, complete, but possibly non-terminating algorithm for reasoning with existential rules (aka. tuple-generating dependencies), a highly expressive knowledge representation language. Although the procedure appears simple,…

Logic in Computer Science · Computer Science 2026-04-27 Lukas Gerlach

Lean Meets Theoretical Computer Science: Scalable Synthesis of Theorem Proving Challenges in Formal-Informal Pairs

Formal theorem proving (FTP) has emerged as a critical foundation for evaluating the reasoning capabilities of large language models, enabling automated verification of mathematical proofs at scale. However, progress has been constrained by…

Logic in Computer Science · Computer Science 2026-05-19 Terry Jingchen Zhang , Wenyuan Jiang , Rongchuan Liu , Yisong Wang , Junran Yang , Ning Wang , Nicole Ni , Yinya Huang , Mrinmaya Sachan

Lean Copilot: Large Language Models as Copilots for Theorem Proving in Lean

Neural theorem proving combines large language models (LLMs) with proof assistants such as Lean, where the correctness of formal proofs can be rigorously verified, leaving no room for hallucination. With existing neural theorem provers…

Artificial Intelligence · Computer Science 2025-05-13 Peiyang Song , Kaiyu Yang , Anima Anandkumar

CLEVER: A Curated Benchmark for Formally Verified Code Generation

We introduce ${\rm C{\small LEVER}}$, a high-quality, curated benchmark of 161 problems for end-to-end verified code generation in Lean. Each problem consists of (1) the task of generating a specification that matches a held-out…

Machine Learning · Computer Science 2025-10-24 Amitayush Thakur , Jasper Lee , George Tsoukalas , Meghana Sistla , Matthew Zhao , Stefan Zetzsche , Greg Durrett , Yisong Yue , Swarat Chaudhuri

sciwrite-lint: Verification Infrastructure for the Age of Science Vibe-Writing

Scientific papers make claims about prior work backed by citations. Verifying those citations at scale (that each cited paper exists, says what the citation claims, and is itself reliable) is structurally beyond what human review can…

Digital Libraries · Computer Science 2026-05-26 Sergey V Samsonau

MathlibLemma: Folklore Lemma Generation and Benchmark for Formal Mathematics

While the ecosystem of Lean and Mathlib has enjoyed celebrated success in formal mathematical reasoning with the help of large language models (LLMs), the absence of many folklore lemmas in Mathlib remains a persistent barrier that limits…

Logic in Computer Science · Computer Science 2026-05-28 Xinyu Liu , Zixuan Xie , Amir Moeini , Claire Chen , Shuze Daniel Liu , Yu Meng , Aidong Zhang , Shangtong Zhang

Mathematical Formalized Problem Solving and Theorem Proving in Different Fields in Lean 4

Formalizing mathematical proofs using computerized verification languages like Lean 4 has the potential to significantly impact the field of mathematics, it offers prominent capabilities for advancing mathematical reasoning. However,…

Computation and Language · Computer Science 2024-11-11 Xichen Tang

Lean4Lean: Verifying a Typechecker for Lean, in Lean

In this paper we present a new "external checker" for the Lean theorem prover, written in Lean itself. This is the first complete typechecker for Lean 4 other than the reference implementation in C++ used by Lean itself, and our new checker…

Programming Languages · Computer Science 2025-09-16 Mario Carneiro

The continuous functional calculus in Lean

The continuous functional calculus is perhaps the most fundamental construction in the theory of operator algebras, especially $C^{*}$-algebras. Here we document our formalization of the continuous functional calculus in Lean, which…

Operator Algebras · Mathematics 2025-01-28 Anatole Dedecker , Jireh Loreaux

ZFLean: a framework for set-level mathematics in Lean

We present ZFLean, a Lean 4 library for doing core mathematics inside a model of ZFC with the ergonomics expected of typed Mathlib developments. Building on Mathlib's ZFC model, we contribute a relational calculus for sets with rewriting…

Logic in Computer Science · Computer Science 2026-04-28 Vincent Trélat

QLib - A Matlab Package for Quantum Information Theory Calculations with Applications

Developing intuition about quantum information theory problems is difficult, as is verifying or ruling-out of hypothesis. We present a Matlab package intended to provide the QIT community with a new and powerful tool-set for quantum…

Quantum Physics · Physics 2007-08-06 Shai Machnes

A bi-directional extensible interface between Lean and Mathematica

We implement a user-extensible ad hoc connection between the Lean proof assistant and the computer algebra system Mathematica. By reflecting the syntax of each system in the other and providing a flexible interface for extending…

Logic in Computer Science · Computer Science 2021-01-20 Robert Y. Lewis , Minchao Wu

CSPLib: Twenty Years On

In 1999, we introduced CSPLib, a benchmark library for the constraints community. Our CP-1999 poster paper about CSPLib discussed the advantages and disadvantages of building such a library. Unlike some other domains such as theorem…

Artificial Intelligence · Computer Science 2019-10-01 Ian Gent , Toby Walsh