English
Related papers

Related papers: Term Orders for Optimistic Lambda-Superposition

200 papers

We introduce a reduction order called the weighted path order (WPO) that subsumes many existing reduction orders. WPO compares weights of terms as in the Knuth-Bendix order (KBO), while WPO allows weights to be computed by a wide class of…

Logic in Computer Science · Computer Science 2014-04-25 Akihisa Yamada , Keiichirou Kusakari , Toshiki Sakabe

The superposition calculus for reasoning in first-order logic with equality relies on simplification orderings on terms. Modern saturation provers use the Knuth-Bendix order (KBO) and the lexicographic path order (LPO) for discovering…

Logic in Computer Science · Computer Science 2025-05-29 Márton Hajdu , Robin Coutelier , Laura Kovács , Andrei Voronkov

The $\lambda$-superposition calculus is a successful approach to proving higher-order formulas. However, some parts of the calculus are extremely explosive, notably due to the higher-order unifier enumeration and the functional…

Logic in Computer Science · Computer Science 2025-10-22 Alexander Bentkamp , Jasmin Blanchette , Matthias Hetzenberger , Uwe Waldmann

We introduce refutationally complete superposition calculi for intentional and extensional clausal $\lambda$-free higher-order logic, two formalisms that allow partial application and applied variables. The calculi are parameterized by a…

Logic in Computer Science · Computer Science 2023-06-22 Alexander Bentkamp , Jasmin Blanchette , Simon Cruanes , Uwe Waldmann

This paper presents two new approaches to prove termination of rewrite systems with the Knuth-Bendix order efficiently. The constraints for the weight function and for the precedence are encoded in (pseudo-)propositional logic and the…

Symbolic Computation · Computer Science 2007-05-23 Harald Zankl , Aart Middeldorp

To adapt large language models (LLMs) to ranking tasks, existing list-wise methods, represented by list-wise Direct Preference Optimization (DPO), focus on optimizing partial-order or full-order list ranking consistency for LLMs to enhance…

Information Retrieval · Computer Science 2025-06-03 Shihao Cai , Chongming Gao , Yang Zhang , Wentao Shi , Jizhi Zhang , Keqin Bao , Qifan Wang , Fuli Feng

We introduce two extensions of the $\lambda$-calculus with a probabilistic choice operator, $\Lambda_\oplus^{cbv}$ and $\Lambda_\oplus^{cbn}$, modeling respectively call-by-value and call-by-name probabilistic computation. We prove that…

Logic in Computer Science · Computer Science 2019-05-13 Claudia Faggian , Simona Ronchi della Rocca

In this paper we present a novel termination order the {\em predicative lexicographic path order} (PLPO for short), a syntactic restriction of the lexicographic path order. As well as lexicographic path orders, several non-trivial primitive…

Logic · Mathematics 2014-06-03 Naohi Eguchi

Recent alignment methods based on Direct Preference Optimization (DPO) reformulate preference learning as supervised optimization over pairwise comparisons, offering improved efficiency and stability over reinforcement learning from human…

Machine Learning · Computer Science 2026-01-22 Yuhui Sun , Xiyao Wang , Zixi Li , YiTian Ding , Tianyang Ling , Jialuo Chen , Tianyi Yu , Zhenlong Yuan , Jinman Zhao

Group Relative Policy Optimization(GRPO) has become a cornerstone of modern reinforcement learning alignment, prized for its efficacy in foregoing an explicit value-critic by leveraging reward normalization across sampled trajectory…

Computation and Language · Computer Science 2026-05-29 Redacted by arXiv

Robust optimization (RO) is one of the key paradigms for solving optimization problems affected by uncertainty. Two principal approaches for RO, the robust counterpart method and the adversarial approach, potentially lead to excessively…

Optimization and Control · Mathematics 2024-09-05 Krzysztof Postek , Shimrit Shtern

We designed a superposition calculus for a clausal fragment of extensional polymorphic higher-order logic that includes anonymous functions but excludes Booleans. The inference rules work on $\beta\eta$-equivalence classes of…

Logic in Computer Science · Computer Science 2021-02-02 Alexander Bentkamp , Jasmin Blanchette , Sophie Tourret , Petar Vukmirović , Uwe Waldmann

This seminar report is concerned with expressing LPO-termination of term rewrite systems as a satisfiability problem in propositional logic. After relevant algorithms are explained, experimental results are reported.

Symbolic Computation · Computer Science 2007-05-23 Harald Zankl

At the forefront of state-of-the-art human alignment methods are preference optimization methods (*PO). Prior research has often concentrated on identifying the best-performing method, typically involving a grid search over hyperparameters,…

Computation and Language · Computer Science 2025-04-30 Kian Ahrabian , Xihui Lin , Barun Patra , Vishrav Chaudhary , Alon Benhaim , Jay Pujara , Xia Song

Lexicographic multi-objective problems, which impose a lexicographic importance order over the objectives, arise in many real-life scenarios. Existing Reinforcement Learning work directly addressing lexicographic tasks has been scarce. The…

Machine Learning · Computer Science 2024-09-05 Alperen Tercan , Vinayak S. Prabhu

First, we extend Leifer-Milner RPO theory, by giving general conditions to obtain IPO labelled transition systems (and bisimilarities) with a reduced set of transitions, and possibly finitely branching. Moreover, we study the weak variant…

Programming Languages · Computer Science 2015-07-01 Pietro Di Gianantonio , Furio Honsell , Marina Lenisa

We define sound and adequate denotational and operational semantics for the stochastic lambda calculus. These two semantic approaches build on previous work that used similar techniques to reason about higher-order probabilistic programs,…

Logic in Computer Science · Computer Science 2021-05-18 Pedro Amorim , Dexter Kozen , Radu Mardare , Prakash Panangaden , Michael Roberts

Large language models (LLMs) require careful alignment to balance competing objectives - factuality, safety, conciseness, proactivity, and diversity. Existing studies focus on individual techniques or specific dimensions, lacking a holistic…

Machine Learning · Computer Science 2025-09-17 Denis Janiak , Julia Moska , Dawid Motyka , Karolina Seweryn , Paweł Walkowiak , Bartosz Żuk , Arkadiusz Janz

This paper aims at carrying out termination proofs for simply typed higher-order calculi automatically by using ordering comparisons. To this end, we introduce the computability path ordering (CPO), a recursive relation on terms obtained by…

Logic in Computer Science · Computer Science 2019-03-14 Frédéric Blanqui , Jean-Pierre Jouannaud , Albert Rubio

The rapid development of large language model (LLM) alignment algorithms has resulted in a complex and fragmented landscape, with limited clarity on the effectiveness of different methods and their inter-connections. This paper introduces…

‹ Prev 1 2 3 10 Next ›