Related papers: Separating Principles Below WKL0

Iterative forcing and hyperimmunity in reverse mathematics

The separation between two theorems in reverse mathematics is usually done by constructing a Turing ideal satisfying a theorem P and avoiding the solutions to a fixed instance of a theorem Q. Lerman, Solomon and Towsner introduced a forcing…

Logic · Mathematics 2015-03-13 Ludovic Patey

Ramsey-type graph coloring and diagonal non-computability

A function is diagonally non-computable (d.n.c.) if it diagonalizes against the universal partial computable function. D.n.c. functions play a central role in algorithmic randomness and reverse mathematics. Flood and Towsner asked for which…

Logic · Mathematics 2014-12-03 Ludovic Patey

Constructing Sequences One Step at a Time

We propose a new method for constructing Turing ideals satisfying principles of reverse mathematics below the Chain-Antichain Principle (CAC). Using this method, we are able to prove several new separations in the presence of Weak Konig's…

Logic · Mathematics 2018-10-05 Henry Towsner

Formal Theorem Proving by Rewarding LLMs to Decompose Proofs Hierarchically

Mathematical theorem proving is an important testbed for large language models' deep and abstract reasoning capability. This paper focuses on improving LLMs' ability to write proofs in formal languages that permit automated proof…

Machine Learning · Computer Science 2024-11-05 Kefan Dong , Arvind Mahankali , Tengyu Ma

Replacing Multi-Step Assembly of Data Preparation Pipelines with One-Step LLM Pipeline Generation for Table QA

Table Question Answering (TQA) aims to answer natural language questions over structured tables. Large Language Models (LLMs) enable promising solutions to this problem, with operator-centric solutions that generate table manipulation…

Databases · Computer Science 2026-04-02 Fengyu Li , Junhao Zhu , Kaishi Song , Lu Chen , Zhongming Yao , Tianyi Li , Christian S. Jensen

NRCL - A Model Building Approach to the Bernays-Sch\"onfinkel Fragment (Full Paper)

We combine constrained literals for model representation with key concepts from first-order superposition and propositional conflict-driven clause learning (CDCL) to create the new calculus Non-Redundant Clause Learning (NRCL) deciding the…

Logic in Computer Science · Computer Science 2015-07-21 Gábor Alagi , Christoph Weidenbach

RL Grokking Recipe: How Does RL Unlock and Transfer New Algorithms in LLMs?

It remains an open question whether LLMs can acquire or generalize genuinely new reasoning strategies, beyond the sharpened skills encoded in their parameters during pre-training or post-training. To attempt to answer this debate, we…

Machine Learning · Computer Science 2025-10-07 Yiyou Sun , Yuhan Cao , Pohao Huang , Haoyue Bai , Hannaneh Hajishirzi , Nouha Dziri , Dawn Song

Model-theoretic Forcing in Transition Algebra

We study L\"owenheim-Skolem and Omitting Types theorems in Transition Algebra, a logical system obtained by enhancing many sorted first-order logic with features from dynamic logic. The sentences we consider include compositions, unions,…

Logic in Computer Science · Computer Science 2025-09-03 Go Hashimoto , Daniel Găină

Rule Extrapolation in Language Models: A Study of Compositional Generalization on OOD Prompts

LLMs show remarkable emergent abilities, such as inferring concepts from presumably out-of-distribution prompts, known as in-context learning. Though this success is often attributed to the Transformer architecture, our systematic…

Computation and Language · Computer Science 2024-10-25 Anna Mészáros , Szilvia Ujváry , Wieland Brendel , Patrik Reizinger , Ferenc Huszár

Interpretable Two-level Boolean Rule Learning for Classification

This paper proposes algorithms for learning two-level Boolean rules in Conjunctive Normal Form (CNF, i.e. AND-of-ORs) or Disjunctive Normal Form (DNF, i.e. OR-of-ANDs) as a type of human-interpretable classification model, aiming for a…

Machine Learning · Computer Science 2015-11-24 Guolong Su , Dennis Wei , Kush R. Varshney , Dmitry M. Malioutov

TernaryLLM: Ternarized Large Language Model

Large language models (LLMs) have achieved remarkable performance on Natural Language Processing (NLP) tasks, but they are hindered by high computational costs and memory requirements. Ternarization, an extreme form of quantization, offers…

Machine Learning · Computer Science 2024-06-12 Tianqi Chen , Zhe Li , Weixiang Xu , Zeyu Zhu , Dong Li , Lu Tian , Emad Barsoum , Peisong Wang , Jian Cheng

Learning nonparametric ordinary differential equations from noisy data

Learning nonparametric systems of Ordinary Differential Equations (ODEs) dot x = f(t,x) from noisy data is an emerging machine learning topic. We use the well-developed theory of Reproducing Kernel Hilbert Spaces (RKHS) to define candidates…

Machine Learning · Statistics 2023-11-14 Kamel Lahouel , Michael Wells , Victor Rielly , Ethan Lew , David Lovitz , Bruno M. Jedynak

Understanding and Mitigating Classification Errors Through Interpretable Token Patterns

State-of-the-art NLP methods achieve human-like performance on many tasks, but make errors nevertheless. Characterizing these errors in easily interpretable terms gives insight into whether a classifier is prone to making systematic errors,…

Computation and Language · Computer Science 2023-11-21 Michael A. Hedderich , Jonas Fischer , Dietrich Klakow , Jilles Vreeken

Unsupervised learning of disentangled representations in deep restricted kernel machines with orthogonality constraints

We introduce Constr-DRKM, a deep kernel method for the unsupervised learning of disentangled data representations. We propose augmenting the original deep restricted kernel machine formulation for kernel PCA by orthogonality constraints on…

Machine Learning · Computer Science 2020-12-01 Francesco Tonin , Panagiotis Patrinos , Johan A. K. Suykens

Generalized Effective Reducibility

We introduce two notions of effective reducibility for set-theoretical statements, based on computability with Ordinal Turing Machines (OTMs), one of which resembles Turing reducibility while the other is modelled after Weihrauch…

Logic · Mathematics 2026-05-19 Merlin Carl

The strength of the tree theorem for pairs in reverse mathematics

No natural principle is currently known to be strictly between the arithmetic comprehension axiom (ACA) and Ramsey's theorem for pairs (RT^2_2) in reverse mathematics. The tree theorem for pairs (TT^2_2) is however a good candidate. The…

Logic · Mathematics 2015-12-16 Ludovic Patey

LLM-NEO: Parameter Efficient Knowledge Distillation for Large Language Models

Knowledge distillation (KD) has been a predominant method for compressing Large Language Models (LLMs). In this paper, we first revisit KD and Low-Rank Adaption (LoRA) and demonstrate that they follow the same paradigm. Inspired by this…

Computation and Language · Computer Science 2025-02-26 Runming Yang , Taiqiang Wu , Jiahao Wang , Pengfei Hu , Yik-Chung Wu , Ngai Wong , Yujiu Yang

Reasoning Graph Enhanced Exemplars Retrieval for In-Context Learning

Large language models (LLMs) have exhibited remarkable few-shot learning capabilities and unified the paradigm of NLP tasks through the in-context learning (ICL) technique. Despite the success of ICL, the quality of the exemplar…

Computation and Language · Computer Science 2024-12-13 Yukang Lin , Bingchen Zhong , Shuoran Jiang , Joanna Siebert , Qingcai Chen

Reverse Transition Kernel: A Flexible Framework to Accelerate Diffusion Inference

To generate data from trained diffusion models, most inference algorithms, such as DDPM, DDIM, and other variants, rely on discretizing the reverse SDEs or their equivalent ODEs. In this paper, we view such approaches as decomposing the…

Machine Learning · Statistics 2024-05-28 Xunpeng Huang , Difan Zou , Hanze Dong , Yi Zhang , Yi-An Ma , Tong Zhang

One Class Restricted Kernel Machines

Restricted kernel machines (RKMs) have demonstrated a significant impact in enhancing generalization ability in the field of machine learning. Recent studies have introduced various methods within the RKM framework, combining kernel…

Machine Learning · Computer Science 2025-02-18 A. Quadir , M. Sajid , M. Tanveer