Related papers: RECKONING: Reasoning through Dynamic Knowledge Enc…

Enhanced and Efficient Reasoning in Large Learning Models

In current Large Language Models we can trust the production of smoothly flowing prose on the basis of the principles of machine learning. However, there is no comparably principled basis to justify trust in the content of the text…

Artificial Intelligence · Computer Science 2026-05-15 Leslie G. Valiant

RecaLLM: Addressing the Lost-in-Thought Phenomenon with Explicit In-Context Retrieval

We propose RecaLLM, a set of reasoning language models post-trained to make effective use of long-context information. In-context retrieval, which identifies relevant evidence from context, and reasoning are deeply intertwined: retrieval…

Computation and Language · Computer Science 2026-04-13 Kyle Whitecross , Negin Rahimi

MixReasoning: Switching Modes to Think

Reasoning models enhance performance by tackling problems in a step-by-step manner, decomposing them into sub-problems and exploring long chains of thought before producing an answer. However, applying extended reasoning to every step…

Artificial Intelligence · Computer Science 2025-10-08 Haiquan Lu , Gongfan Fang , Xinyin Ma , Qi Li , Xinchao Wang

Decoupling Knowledge and Reasoning in LLMs: An Exploration Using Cognitive Dual-System Theory

While large language models (LLMs) leverage both knowledge and reasoning during inference, the capacity to distinguish between them plays a pivotal role in model analysis, interpretability, and development. Inspired by dual-system cognitive…

Artificial Intelligence · Computer Science 2025-07-25 Mutian Yang , Jiandong Gao , Ji Wu

Reasoning in Dialog: Improving Response Generation by Context Reading Comprehension

In multi-turn dialog, utterances do not always take the full form of sentences \cite{Carbonell1983DiscoursePA}, which naturally makes understanding the dialog context more difficult. However, it is essential to fully grasp the dialog…

Computation and Language · Computer Science 2020-12-15 Xiuying Chen , Zhi Cui , Jiayi Zhang , Chen Wei , Jianwei Cui , Bin Wang , Dongyan Zhao , Rui Yan

Scaling Competence, Shrinking Reasoning: Cognitive Signatures in Language Model Learning

We analyze reasoning in language models during task-specific fine-tuning and draws parallel between reasoning tokens--intermediate steps generated while solving problem and the human working memory. Drawing from cognitive science, we align…

Computation and Language · Computer Science 2025-12-01 Mukul Singh , Ananya Singha , Arjun Radhakrishna , Sumit Gulwani

Learn Beyond The Answer: Training Language Models with Reflection for Mathematical Reasoning

Supervised fine-tuning enhances the problem-solving abilities of language models across various mathematical reasoning tasks. To maximize such benefits, existing research focuses on broadening the training set with various data augmentation…

Computation and Language · Computer Science 2024-10-08 Zhihan Zhang , Tao Ge , Zhenwen Liang , Wenhao Yu , Dian Yu , Mengzhao Jia , Dong Yu , Meng Jiang

Answering Unseen Questions With Smaller Language Models Using Rationale Generation and Dense Retrieval

When provided with sufficient explanatory context, smaller Language Models have been shown to exhibit strong reasoning ability on challenging short-answer question-answering tasks where the questions are unseen in training. We evaluate two…

Computation and Language · Computer Science 2023-10-16 Tim Hartill , Diana Benavides-Prado , Michael Witbrock , Patricia J. Riddle

Disentangling Recall and Reasoning in Transformer Models through Layer-wise Attention and Activation Analysis

Transformer-based language models excel at both recall (retrieving memorized facts) and reasoning (performing multi-step inference), but whether these abilities rely on distinct internal mechanisms remains unclear. Distinguishing recall…

Machine Learning · Computer Science 2026-03-16 Harshwardhan Fartale , Ashish Kattamuri , Rahul Raja , Arpita Vats , Ishita Prasad , Akshata Kishore Moharir

PERK: Long-Context Reasoning as Parameter-Efficient Test-Time Learning

Long-context reasoning requires accurately identifying relevant information in extensive, noisy input contexts. Previous research shows that using test-time learning to encode context directly into model parameters can effectively enable…

Computation and Language · Computer Science 2026-01-01 Zeming Chen , Angelika Romanou , Gail Weiss , Antoine Bosselut

Reasoning-Grounded Natural Language Explanations for Language Models

We propose a large language model explainability technique for obtaining faithful natural language explanations by grounding the explanations in a reasoning process. When converted to a sequence of tokens, the outputs of the reasoning…

Machine Learning · Computer Science 2026-03-17 Vojtech Cahlik , Rodrigo Alves , Pavel Kordik

Think Multilingual, Not Harder: A Data-Efficient Framework for Teaching Reasoning Models to Code-Switch

Recent developments in reasoning capabilities have enabled large language models to solve increasingly complex mathematical, symbolic, and logical tasks. Interestingly, while reasoning models are often trained to generate monolingual text,…

Computation and Language · Computer Science 2026-04-20 Eleanor M. Lin , David Jurgens

Long-Context Reasoning Through Proxy-Based Chain-of-Thought Tuning

Recent large language models support inputs of up to 10 million tokens, yet they perform poorly on long-context tasks that require complex reasoning. Such tasks can be solved using only a subset of the input -- a proxy context -- rather…

Computation and Language · Computer Science 2026-05-25 Miao Li , Irina Saparina , Alexander Gurung , Mirella Lapata

Disentangling Memory and Reasoning Ability in Large Language Models

Large Language Models (LLMs) have demonstrated strong performance in handling complex tasks requiring both extensive knowledge and reasoning abilities. However, the existing LLM inference pipeline operates as an opaque process without…

Computation and Language · Computer Science 2025-05-16 Mingyu Jin , Weidi Luo , Sitao Cheng , Xinyi Wang , Wenyue Hua , Ruixiang Tang , William Yang Wang , Yongfeng Zhang

Metacognitive Reuse: Turning Recurring LLM Reasoning Into Concise Behaviors

Large language models (LLMs) now solve multi-step problems by emitting extended chains of thought. During the process, they often re-derive the same intermediate steps across problems, inflating token usage and latency. This saturation of…

Machine Learning · Computer Science 2025-09-17 Aniket Didolkar , Nicolas Ballas , Sanjeev Arora , Anirudh Goyal

Teaching Smaller Language Models To Generalise To Unseen Compositional Questions (Full Thesis)

Pretrained large Language Models (LLMs) are able to answer questions that are unlikely to have been encountered during training. However a diversity of potential applications exist in the broad domain of reasoning systems and considerations…

Computation and Language · Computer Science 2024-11-27 Tim Hartill

Learning When to Stop: Adaptive Latent Reasoning via Reinforcement Learning

Latent reasoning represents a new development in Transformer language models that has shown potential in compressing reasoning lengths compared to chain-of-thought reasoning. By directly passing the information-rich previous final latent…

Machine Learning · Computer Science 2025-11-27 Alex Ning , Yen-Ling Kuo , Gabe Gomes

DySCO: Dynamic Attention-Scaling Decoding for Long-Context Language Models

Understanding and reasoning over long contexts is a crucial capability for language models (LMs). Although recent models support increasingly long context windows, their accuracy often deteriorates as input length grows. In practice, models…

Computation and Language · Computer Science 2026-04-17 Xi Ye , Wuwei Zhang , Fangcong Yin , Howard Yen , Danqi Chen

Interleaved Reasoning for Large Language Models via Reinforcement Learning

Long chain-of-thought (CoT) significantly enhances the reasoning capabilities of large language models (LLMs). However, extensive reasoning traces lead to inefficiencies and increased time-to-first-token (TTFT). We propose a training…

Computation and Language · Computer Science 2026-01-08 Roy Xie , David Qiu , Deepak Gopinath , Dong Lin , Yanchao Sun , Chong Wang , Saloni Potdar , Bhuwan Dhingra

Do Smaller Language Models Answer Contextualised Questions Through Memorisation Or Generalisation?

A distinction is often drawn between a model's ability to predict a label for an evaluation sample that is directly memorised from highly similar training samples versus an ability to predict the label via some method of generalisation. In…

Computation and Language · Computer Science 2023-11-22 Tim Hartill , Joshua Bensemann , Michael Witbrock , Patricia J. Riddle