Related papers: Does Entity Abstraction Help Generative Transforme…

Towards Knowledge-Grounded Natural Language Understanding and Generation

This thesis investigates how natural language understanding and generation with transformer models can benefit from grounding the models with knowledge representations and addresses the following key research questions: (i) Can knowledge of…

Computation and Language · Computer Science 2024-03-25 Chenxi Whitehouse

RLAD: Training LLMs to Discover Abstractions for Solving Reasoning Problems

Reasoning requires going beyond pattern matching or memorization of solutions to identify and implement "algorithmic procedures" that can be used to deduce answers to hard problems. Doing so requires realizing the most relevant primitives,…

Artificial Intelligence · Computer Science 2025-10-03 Yuxiao Qu , Anikait Singh , Yoonho Lee , Amrith Setlur , Ruslan Salakhutdinov , Chelsea Finn , Aviral Kumar

Learning to Reason With Relational Abstractions

Large language models have recently shown promising progress in mathematical reasoning when fine-tuned with human-generated sequences walking through a sequence of solution steps. However, the solution sequences are not formally structured…

Machine Learning · Computer Science 2022-12-07 Andrew J. Nam , Mengye Ren , Chelsea Finn , James L. McClelland

Abstraction-of-Thought Makes Language Models Better Reasoners

Abstract reasoning, the ability to reason from the abstract essence of a problem, serves as a key to generalization in human reasoning. However, eliciting language models to perform reasoning with abstraction remains unexplored. This paper…

Computation and Language · Computer Science 2024-09-27 Ruixin Hong , Hongming Zhang , Xiaoman Pan , Dong Yu , Changshui Zhang

Group Deliberation Oriented Multi-Agent Conversational Model for Complex Reasoning

This paper proposes a group deliberation oriented multi-agent conversational model to address the limitations of single large language models in complex reasoning tasks. The model adopts a three-level role division architecture consisting…

Artificial Intelligence · Computer Science 2026-01-01 Zheyu Shi , Dong Qiu , Shanlong Yu

Does Deep Learning Learn to Abstract? A Systematic Probing Framework

Abstraction is a desirable capability for deep learning models, which means to induce abstract concepts from concrete instances and flexibly apply them beyond the learning context. At the same time, there is a lack of clear understanding…

Machine Learning · Computer Science 2023-02-24 Shengnan An , Zeqi Lin , Bei Chen , Qiang Fu , Nanning Zheng , Jian-Guang Lou

Attending to Entities for Better Text Understanding

Recent progress in NLP witnessed the development of large-scale pre-trained language models (GPT, BERT, XLNet, etc.) based on Transformer (Vaswani et al. 2017), and in a range of end tasks, such models have achieved state-of-the-art…

Computation and Language · Computer Science 2019-11-12 Pengxiang Cheng , Katrin Erk

Calibrating Trust of Multi-Hop Question Answering Systems with Decompositional Probes

Multi-hop Question Answering (QA) is a challenging task since it requires an accurate aggregation of information from multiple context paragraphs and a thorough understanding of the underlying reasoning chains. Recent work in multi-hop QA…

Computation and Language · Computer Science 2022-11-02 Kaige Xie , Sarah Wiegreffe , Mark Riedl

A Reasoning Paradigm for Named Entity Recognition

Generative LLMs typically improve Named Entity Recognition (NER) performance through instruction tuning. They excel at generating entities by semantic pattern matching but lack an explicit, verifiable reasoning mechanism. This "cognitive…

Computation and Language · Computer Science 2025-11-18 Hui Huang , Yanping Chen , Ruizhang Huang , Chuan Lin , Yongbin Qin

Analyzing the Effectiveness of the Underlying Reasoning Tasks in Multi-hop Question Answering

To explain the predicted answers and evaluate the reasoning abilities of models, several studies have utilized underlying reasoning (UR) tasks in multi-hop question answering (QA) datasets. However, it remains an open question as to how…

Computation and Language · Computer Science 2023-02-14 Xanh Ho , Anh-Khoa Duong Nguyen , Saku Sugawara , Akiko Aizawa

Context-Aware Answer Extraction in Question Answering

Extractive QA models have shown very promising performance in predicting the correct answer to a question for a given passage. However, they sometimes result in predicting the correct answer text but in a context irrelevant to the given…

Computation and Language · Computer Science 2020-11-06 Yeon Seonwoo , Ji-Hoon Kim , Jung-Woo Ha , Alice Oh

Are Machines Better at Complex Reasoning? Unveiling Human-Machine Inference Gaps in Entailment Verification

Making inferences in text comprehension to understand the meaning is essential in language processing. This work studies the entailment verification (EV) problem of multi-sentence premises that requires a system to make multiple inferences…

Computation and Language · Computer Science 2024-05-29 Soumya Sanyal , Tianyi Xiao , Jiacheng Liu , Wenya Wang , Xiang Ren

Can LLMs Reason Abstractly Over Math Word Problems Without CoT? Disentangling Abstract Formulation From Arithmetic Computation

Final-answer-based metrics are commonly used for evaluating large language models (LLMs) on math word problems, often taken as proxies for reasoning ability. However, such metrics conflate two distinct sub-skills: abstract formulation…

Computation and Language · Computer Science 2025-05-30 Ziling Cheng , Meng Cao , Leila Pishdad , Yanshuai Cao , Jackie Chi Kit Cheung

Conceptual and Unbiased Reasoning in Language Models

Conceptual reasoning, the ability to reason in abstract and high-level perspectives, is key to generalization in human cognition. However, limited study has been done on large language models' capability to perform conceptual reasoning. In…

Computation and Language · Computer Science 2024-04-02 Ben Zhou , Hongming Zhang , Sihao Chen , Dian Yu , Hongwei Wang , Baolin Peng , Dan Roth , Dong Yu

Abstractors and relational cross-attention: An inductive bias for explicit relational reasoning in Transformers

An extension of Transformers is proposed that enables explicit relational reasoning through a novel module called the Abstractor. At the core of the Abstractor is a variant of attention called relational cross-attention. The approach is…

Machine Learning · Statistics 2024-04-16 Awni Altabaa , Taylor Webb , Jonathan Cohen , John Lafferty

Abstract Activation Spaces for Content-Invariant Reasoning in Large Language Models

Large Language Models (LLMs) often struggle with deductive judgment in syllogistic reasoning, systematically conflating semantic plausibility with formal validity a phenomenon known as content effect. This bias persists even when models…

Computation and Language · Computer Science 2026-02-03 Gabriele Maraia , Marco Valentino , Fabio Massimo Zanzotto , Leonardo Ranaldi

Procedural Pretraining: Warming Up Language Models with Abstract Data

Pretraining language models directly on web-scale corpora is the de facto paradigm. We study an alternative where the model is initially exposed to abstract structured data to ease the subsequent acquisition of rich semantic knowledge, much…

Computation and Language · Computer Science 2026-05-29 Liangze Jiang , Zachary Shinnick , Anton van den Hengel , Hemanth Saratchandran , Damien Teney

Learning by Abstraction: The Neural State Machine

We introduce the Neural State Machine, seeking to bridge the gap between the neural and symbolic views of AI and integrate their complementary strengths for the task of visual reasoning. Given an image, we first predict a probabilistic…

Artificial Intelligence · Computer Science 2019-11-26 Drew A. Hudson , Christopher D. Manning

Improving Numerical Reasoning Skills in the Modular Approach for Complex Question Answering on Text

Numerical reasoning skills are essential for complex question answering (CQA) over text. It requires opertaions including counting, comparison, addition and subtraction. A successful approach to CQA on text, Neural Module Networks (NMNs),…

Computation and Language · Computer Science 2021-09-07 Xiao-Yu Guo , Yuan-Fang Li , Gholamreza Haffari

AbsInstruct: Eliciting Abstraction Ability from LLMs through Explanation Tuning with Plausibility Estimation

Abstraction ability is crucial in human intelligence, which can also benefit various tasks in NLP study. Existing work shows that LLMs are deficient in abstract ability, and how to improve it remains unexplored. In this work, we design the…

Computation and Language · Computer Science 2024-06-18 Zhaowei Wang , Wei Fan , Qing Zong , Hongming Zhang , Sehyun Choi , Tianqing Fang , Xin Liu , Yangqiu Song , Ginny Y. Wong , Simon See