English
Related papers

Related papers: RECKONING: Reasoning through Dynamic Knowledge Enc…

200 papers

In current Large Language Models we can trust the production of smoothly flowing prose on the basis of the principles of machine learning. However, there is no comparably principled basis to justify trust in the content of the text…

Artificial Intelligence · Computer Science 2026-05-15 Leslie G. Valiant

We propose RecaLLM, a set of reasoning language models post-trained to make effective use of long-context information. In-context retrieval, which identifies relevant evidence from context, and reasoning are deeply intertwined: retrieval…

Computation and Language · Computer Science 2026-04-13 Kyle Whitecross , Negin Rahimi

Reasoning models enhance performance by tackling problems in a step-by-step manner, decomposing them into sub-problems and exploring long chains of thought before producing an answer. However, applying extended reasoning to every step…

Artificial Intelligence · Computer Science 2025-10-08 Haiquan Lu , Gongfan Fang , Xinyin Ma , Qi Li , Xinchao Wang

While large language models (LLMs) leverage both knowledge and reasoning during inference, the capacity to distinguish between them plays a pivotal role in model analysis, interpretability, and development. Inspired by dual-system cognitive…

Artificial Intelligence · Computer Science 2025-07-25 Mutian Yang , Jiandong Gao , Ji Wu

In multi-turn dialog, utterances do not always take the full form of sentences \cite{Carbonell1983DiscoursePA}, which naturally makes understanding the dialog context more difficult. However, it is essential to fully grasp the dialog…

Computation and Language · Computer Science 2020-12-15 Xiuying Chen , Zhi Cui , Jiayi Zhang , Chen Wei , Jianwei Cui , Bin Wang , Dongyan Zhao , Rui Yan

We analyze reasoning in language models during task-specific fine-tuning and draws parallel between reasoning tokens--intermediate steps generated while solving problem and the human working memory. Drawing from cognitive science, we align…

Computation and Language · Computer Science 2025-12-01 Mukul Singh , Ananya Singha , Arjun Radhakrishna , Sumit Gulwani

Supervised fine-tuning enhances the problem-solving abilities of language models across various mathematical reasoning tasks. To maximize such benefits, existing research focuses on broadening the training set with various data augmentation…

Computation and Language · Computer Science 2024-10-08 Zhihan Zhang , Tao Ge , Zhenwen Liang , Wenhao Yu , Dian Yu , Mengzhao Jia , Dong Yu , Meng Jiang

When provided with sufficient explanatory context, smaller Language Models have been shown to exhibit strong reasoning ability on challenging short-answer question-answering tasks where the questions are unseen in training. We evaluate two…

Computation and Language · Computer Science 2023-10-16 Tim Hartill , Diana Benavides-Prado , Michael Witbrock , Patricia J. Riddle

Transformer-based language models excel at both recall (retrieving memorized facts) and reasoning (performing multi-step inference), but whether these abilities rely on distinct internal mechanisms remains unclear. Distinguishing recall…

Machine Learning · Computer Science 2026-03-16 Harshwardhan Fartale , Ashish Kattamuri , Rahul Raja , Arpita Vats , Ishita Prasad , Akshata Kishore Moharir

Long-context reasoning requires accurately identifying relevant information in extensive, noisy input contexts. Previous research shows that using test-time learning to encode context directly into model parameters can effectively enable…

Computation and Language · Computer Science 2026-01-01 Zeming Chen , Angelika Romanou , Gail Weiss , Antoine Bosselut

We propose a large language model explainability technique for obtaining faithful natural language explanations by grounding the explanations in a reasoning process. When converted to a sequence of tokens, the outputs of the reasoning…

Machine Learning · Computer Science 2026-03-17 Vojtech Cahlik , Rodrigo Alves , Pavel Kordik

Recent developments in reasoning capabilities have enabled large language models to solve increasingly complex mathematical, symbolic, and logical tasks. Interestingly, while reasoning models are often trained to generate monolingual text,…

Computation and Language · Computer Science 2026-04-20 Eleanor M. Lin , David Jurgens

Recent large language models support inputs of up to 10 million tokens, yet they perform poorly on long-context tasks that require complex reasoning. Such tasks can be solved using only a subset of the input -- a proxy context -- rather…

Computation and Language · Computer Science 2026-05-25 Miao Li , Irina Saparina , Alexander Gurung , Mirella Lapata

Large Language Models (LLMs) have demonstrated strong performance in handling complex tasks requiring both extensive knowledge and reasoning abilities. However, the existing LLM inference pipeline operates as an opaque process without…

Computation and Language · Computer Science 2025-05-16 Mingyu Jin , Weidi Luo , Sitao Cheng , Xinyi Wang , Wenyue Hua , Ruixiang Tang , William Yang Wang , Yongfeng Zhang

Large language models (LLMs) now solve multi-step problems by emitting extended chains of thought. During the process, they often re-derive the same intermediate steps across problems, inflating token usage and latency. This saturation of…

Machine Learning · Computer Science 2025-09-17 Aniket Didolkar , Nicolas Ballas , Sanjeev Arora , Anirudh Goyal

Pretrained large Language Models (LLMs) are able to answer questions that are unlikely to have been encountered during training. However a diversity of potential applications exist in the broad domain of reasoning systems and considerations…

Computation and Language · Computer Science 2024-11-27 Tim Hartill

Latent reasoning represents a new development in Transformer language models that has shown potential in compressing reasoning lengths compared to chain-of-thought reasoning. By directly passing the information-rich previous final latent…

Machine Learning · Computer Science 2025-11-27 Alex Ning , Yen-Ling Kuo , Gabe Gomes

Understanding and reasoning over long contexts is a crucial capability for language models (LMs). Although recent models support increasingly long context windows, their accuracy often deteriorates as input length grows. In practice, models…

Computation and Language · Computer Science 2026-04-17 Xi Ye , Wuwei Zhang , Fangcong Yin , Howard Yen , Danqi Chen

Long chain-of-thought (CoT) significantly enhances the reasoning capabilities of large language models (LLMs). However, extensive reasoning traces lead to inefficiencies and increased time-to-first-token (TTFT). We propose a training…

Computation and Language · Computer Science 2026-01-08 Roy Xie , David Qiu , Deepak Gopinath , Dong Lin , Yanchao Sun , Chong Wang , Saloni Potdar , Bhuwan Dhingra

A distinction is often drawn between a model's ability to predict a label for an evaluation sample that is directly memorised from highly similar training samples versus an ability to predict the label via some method of generalisation. In…

Computation and Language · Computer Science 2023-11-22 Tim Hartill , Joshua Bensemann , Michael Witbrock , Patricia J. Riddle
‹ Prev 1 2 3 10 Next ›