Related papers: Term Orders for Optimistic Lambda-Superposition

A Unified Ordering for Termination Proving

We introduce a reduction order called the weighted path order (WPO) that subsumes many existing reduction orders. WPO compares weights of terms as in the Knuth-Bendix order (KBO), while WPO allows weights to be computed by a wide class of…

Logic in Computer Science · Computer Science 2014-04-25 Akihisa Yamada , Keiichirou Kusakari , Toshiki Sakabe

Term Ordering Diagrams

The superposition calculus for reasoning in first-order logic with equality relies on simplification orderings on terms. Modern saturation provers use the Knuth-Bendix order (KBO) and the lexicographic path order (LPO) for discovering…

Logic in Computer Science · Computer Science 2025-05-29 Márton Hajdu , Robin Coutelier , Laura Kovács , Andrei Voronkov

Optimistic Higher-Order Superposition

The $\lambda$-superposition calculus is a successful approach to proving higher-order formulas. However, some parts of the calculus are extremely explosive, notably due to the higher-order unifier enumeration and the functional…

Logic in Computer Science · Computer Science 2025-10-22 Alexander Bentkamp , Jasmin Blanchette , Matthias Hetzenberger , Uwe Waldmann

Superposition for Lambda-Free Higher-Order Logic

We introduce refutationally complete superposition calculi for intentional and extensional clausal $\lambda$-free higher-order logic, two formalisms that allow partial application and applied variables. The calculi are parameterized by a…

Logic in Computer Science · Computer Science 2023-06-22 Alexander Bentkamp , Jasmin Blanchette , Simon Cruanes , Uwe Waldmann

Satisfying KBO Constraints

This paper presents two new approaches to prove termination of rewrite systems with the Knuth-Bendix order efficiently. The constraints for the weight function and for the precedence are encoded in (pseudo-)propositional logic and the…

Symbolic Computation · Computer Science 2007-05-23 Harald Zankl , Aart Middeldorp

K-order Ranking Preference Optimization for Large Language Models

To adapt large language models (LLMs) to ranking tasks, existing list-wise methods, represented by list-wise Direct Preference Optimization (DPO), focus on optimizing partial-order or full-order list ranking consistency for LLMs to enhance…

Information Retrieval · Computer Science 2025-06-03 Shihao Cai , Chongming Gao , Yang Zhang , Wentao Shi , Jizhi Zhang , Keqin Bao , Qifan Wang , Fuli Feng

Lambda Calculus and Probabilistic Computation

We introduce two extensions of the $\lambda$-calculus with a probabilistic choice operator, $\Lambda_\oplus^{cbv}$ and $\Lambda_\oplus^{cbn}$, modeling respectively call-by-value and call-by-name probabilistic computation. We prove that…

Logic in Computer Science · Computer Science 2019-05-13 Claudia Faggian , Simona Ronchi della Rocca

Predicative Lexicographic Path Orders: An Application of Term Rewriting to the Region of Primitive Recursive Functions

In this paper we present a novel termination order the {\em predicative lexicographic path order} (PLPO for short), a syntactic restriction of the lexicographic path order. As well as lexicographic path orders, several non-trivial primitive…

Logic · Mathematics 2014-06-03 Naohi Eguchi

Listwise Direct Preference Optimization with Multi-Dimensional Preference Mixing

Recent alignment methods based on Direct Preference Optimization (DPO) reformulate preference learning as supervised optimization over pairwise comparisons, offering improved efficiency and stability over reinforcement learning from human…

Machine Learning · Computer Science 2026-01-22 Yuhui Sun , Xiyao Wang , Zixi Li , YiTian Ding , Tianyang Ling , Jialuo Chen , Tianyi Yu , Zhenlong Yuan , Jinman Zhao

LambdaPO: A Lambda Style Policy Optimization for Reasoning Language Models

Group Relative Policy Optimization(GRPO) has become a cornerstone of modern reinforcement learning alignment, prized for its efficacy in foregoing an explicit value-critic by leveraging reward normalization across sampled trajectory…

Computation and Language · Computer Science 2026-05-29 Redacted by arXiv

First-order algorithms for robust optimization problems via convex-concave saddle-point Lagrangian reformulation

Robust optimization (RO) is one of the key paradigms for solving optimization problems affected by uncertainty. Two principal approaches for RO, the robust counterpart method and the adversarial approach, potentially lead to excessively…

Optimization and Control · Mathematics 2024-09-05 Krzysztof Postek , Shimrit Shtern

Superposition with Lambdas

We designed a superposition calculus for a clausal fragment of extensional polymorphic higher-order logic that includes anonymous functions but excludes Booleans. The inference rules work on $\beta\eta$-equivalence classes of…

Logic in Computer Science · Computer Science 2021-02-02 Alexander Bentkamp , Jasmin Blanchette , Sophie Tourret , Petar Vukmirović , Uwe Waldmann

SAT Techniques for Lexicographic Path Orders

This seminar report is concerned with expressing LPO-termination of term rewrite systems as a satisfiability problem in propositional logic. After relevant algorithms are explained, experimental results are reported.

Symbolic Computation · Computer Science 2007-05-23 Harald Zankl

A Practical Analysis of Human Alignment with *PO

At the forefront of state-of-the-art human alignment methods are preference optimization methods (*PO). Prior research has often concentrated on identifying the best-performing method, typically involving a grid search over hyperparameters,…

Computation and Language · Computer Science 2025-04-30 Kian Ahrabian , Xihui Lin , Barun Patra , Vishrav Chaudhary , Alon Benhaim , Jay Pujara , Xia Song

Thresholded Lexicographic Ordered Multiobjective Reinforcement Learning

Lexicographic multi-objective problems, which impose a lexicographic importance order over the objectives, arise in many real-life scenarios. Existing Reinforcement Learning work directly addressing lexicographic tasks has been scarce. The…

Machine Learning · Computer Science 2024-09-05 Alperen Tercan , Vinayak S. Prabhu

RPO, Second-order Contexts, and Lambda-calculus

First, we extend Leifer-Milner RPO theory, by giving general conditions to obtain IPO labelled transition systems (and bisimilarities) with a reduced set of transitions, and possibly finitely branching. Moreover, we study the weak variant…

Programming Languages · Computer Science 2015-07-01 Pietro Di Gianantonio , Furio Honsell , Marina Lenisa

Universal Semantics for the Stochastic Lambda-Calculus

We define sound and adequate denotational and operational semantics for the stochastic lambda calculus. These two semantic approaches build on previous work that used similar techniques to reason about higher-order probabilistic programs,…

Logic in Computer Science · Computer Science 2021-05-18 Pedro Amorim , Dexter Kozen , Radu Mardare , Prakash Panangaden , Michael Roberts

Rethinking the Evaluation of Alignment Methods: Insights into Diversity, Generalisation, and Safety

Large language models (LLMs) require careful alignment to balance competing objectives - factuality, safety, conciseness, proactivity, and diversity. Existing studies focus on individual techniques or specific dimensions, lacking a holistic…

Machine Learning · Computer Science 2025-09-17 Denis Janiak , Julia Moska , Dawid Motyka , Karolina Seweryn , Paweł Walkowiak , Bartosz Żuk , Arkadiusz Janz

The computability path ordering

This paper aims at carrying out termination proofs for simply typed higher-order calculi automatically by using ordering comparisons. To this end, we introduce the computability path ordering (CPO), a recursive relation on terms obtained by…

Logic in Computer Science · Computer Science 2019-03-14 Frédéric Blanqui , Jean-Pierre Jouannaud , Albert Rubio

Reward-aware Preference Optimization: A Unified Mathematical Framework for Model Alignment

The rapid development of large language model (LLM) alignment algorithms has resulted in a complex and fragmented landscape, with limited clarity on the effectiveness of different methods and their inter-connections. This paper introduces…

Machine Learning · Computer Science 2025-02-11 Shengyang Sun , Yian Zhang , Alexander Bukharin , David Mosallanezhad , Jiaqi Zeng , Soumye Singhal , Gerald Shen , Adithya Renduchintala , Tugrul Konuk , Yi Dong , Zhilin Wang , Dmitry Chichkov , Olivier Delalleau , Oleksii Kuchaiev