Related papers: How Language Models Process Negation

Language models are not naysayers: An analysis of language models on negation benchmarks

Negation has been shown to be a major bottleneck for masked language models, such as BERT. However, whether this finding still holds for larger-sized auto-regressive language models (``LLMs'') has not been studied comprehensively. With the…

Computation and Language · Computer Science 2023-06-16 Thinh Hung Truong , Timothy Baldwin , Karin Verspoor , Trevor Cohn

This is not a Dataset: A Large Negation Benchmark to Challenge Large Language Models

Although large language models (LLMs) have apparently acquired a certain level of grammatical knowledge and the ability to make generalizations, they fail to interpret negation, a crucial step in Natural Language Processing. We try to…

Computation and Language · Computer Science 2023-10-25 Iker García-Ferrero , Begoña Altuna , Javier Álvez , Itziar Gonzalez-Dios , German Rigau

Developmental Negation Processing in Transformer Language Models

Reasoning using negation is known to be difficult for transformer-based language models. While previous studies have used the tools of psycholinguistics to probe a transformer's ability to reason over negation, none have focused on the…

Computation and Language · Computer Science 2022-05-02 Antonio Laverghetta , John Licato

Understanding by Understanding Not: Modeling Negation in Language Models

Negation is a core construction in natural language. Despite being very successful on many tasks, state-of-the-art pre-trained language models often handle negation incorrectly. To improve language models in this regard, we propose to…

Computation and Language · Computer Science 2021-05-11 Arian Hosseini , Siva Reddy , Dzmitry Bahdanau , R Devon Hjelm , Alessandro Sordoni , Aaron Courville

I've got the "Answer"! Interpretation of LLMs Hidden States in Question Answering

Interpretability and explainability of AI are becoming increasingly important in light of the rapid development of large language models (LLMs). This paper investigates the interpretation of LLMs in the context of the knowledge-based…

Computation and Language · Computer Science 2024-06-05 Valeriya Goloviznina , Evgeny Kotelnikov

Investigating and Addressing Hallucinations of LLMs in Tasks Involving Negation

Large Language Models (LLMs) have achieved remarkable performance across a wide variety of natural language tasks. However, they have been shown to suffer from a critical limitation pertinent to 'hallucination' in their output. Recent…

Computation and Language · Computer Science 2024-06-11 Neeraj Varshney , Satyam Raj , Venkatesh Mishra , Agneet Chatterjee , Ritika Sarkar , Amir Saeidi , Chitta Baral

Negation: A Pink Elephant in the Large Language Models' Room?

Negations are key to determining sentence meaning, making them essential for logical reasoning. Despite their importance, negations pose a substantial challenge for large language models (LLMs) and remain underexplored. We constructed and…

Computation and Language · Computer Science 2025-06-04 Tereza Vrabcová , Marek Kadlčík , Petr Sojka , Michal Štefánik , Michal Spiegel

Revisiting subword tokenization: A case study on affixal negation in large language models

In this work, we measure the impact of affixal negation on modern English large language models (LLMs). In affixal negation, the negated meaning is expressed through a negative morpheme, which is potentially challenging for LLMs as their…

Computation and Language · Computer Science 2024-04-05 Thinh Hung Truong , Yulia Otmakhova , Karin Verspoor , Trevor Cohn , Timothy Baldwin

Improving negation detection with negation-focused pre-training

Negation is a common linguistic feature that is crucial in many language understanding tasks, yet it remains a hard problem due to diversity in its expression in different types of text. Recent work has shown that state-of-the-art NLP…

Computation and Language · Computer Science 2022-05-10 Thinh Hung Truong , Timothy Baldwin , Trevor Cohn , Karin Verspoor

Assessing Step-by-Step Reasoning against Lexical Negation: A Case Study on Syllogism

Large language models (LLMs) take advantage of step-by-step reasoning instructions, e.g., chain-of-thought (CoT) prompting. Building on this, their ability to perform CoT-style reasoning robustly is of interest from a probing perspective.…

Computation and Language · Computer Science 2023-10-24 Mengyu Ye , Tatsuki Kuribayashi , Jun Suzuki , Goro Kobayashi , Hiroaki Funayama

Negation-Induced Forgetting in LLMs

The study explores whether Large Language Models (LLMs) exhibit negation-induced forgetting (NIF), a cognitive phenomenon observed in humans where negating incorrect attributes of an object or event leads to diminished recall of this object…

Computation and Language · Computer Science 2025-02-27 Francesca Capuano , Ellen Boschert , Barbara Kaup

Commonsense Knowledge with Negation: A Resource to Enhance Negation Understanding

Negation is a common and important semantic feature in natural language, yet Large Language Models (LLMs) struggle when negation is involved in natural language understanding tasks. Commonsense knowledge, on the other hand, despite being a…

Computation and Language · Computer Science 2026-04-23 Zijie Wang , MohammadHossein Rezaei , Farzana Rashid , Eduardo Blanco

NegVQA: Can Vision Language Models Understand Negation?

Negation is a fundamental linguistic phenomenon that can entirely reverse the meaning of a sentence. As vision language models (VLMs) continue to advance and are deployed in high-stakes applications, assessing their ability to comprehend…

Computation and Language · Computer Science 2025-05-30 Yuhui Zhang , Yuchang Su , Yiming Liu , Serena Yeung-Levy

Can large language models generate salient negative statements?

We examine the ability of large language models (LLMs) to generate salient (interesting) negative statements about real-world entities; an emerging research topic of the last few years. We probe the LLMs using zero- and k-shot unconstrained…

Computation and Language · Computer Science 2023-09-22 Hiba Arnaout , Simon Razniewski

Strong hallucinations from negation and how to fix them

Despite great performance on many tasks, language models (LMs) still struggle with reasoning, sometimes providing responses that cannot possibly be true because they stem from logical incoherence. We call such responses \textit{strong…

Computation and Language · Computer Science 2024-08-21 Nicholas Asher , Swarnadeep Bhar

Misinforming LLMs: vulnerabilities, challenges and opportunities

Large Language Models (LLMs) have made significant advances in natural language processing, but their underlying mechanisms are often misunderstood. Despite exhibiting coherent answers and apparent reasoning behaviors, LLMs rely on…

Computation and Language · Computer Science 2024-08-05 Bo Zhou , Daniel Geißler , Paul Lukowicz

Vision-Language Models Do Not Understand Negation

Many practical vision-language applications require models that understand negation, e.g., when using natural language to retrieve images which contain certain objects but not others. Despite advancements in vision-language models (VLMs)…

Computer Vision and Pattern Recognition · Computer Science 2025-05-14 Kumail Alhamoud , Shaden Alshammari , Yonglong Tian , Guohao Li , Philip Torr , Yoon Kim , Marzyeh Ghassemi

Towards Uncovering How Large Language Model Works: An Explainability Perspective

Large language models (LLMs) have led to breakthroughs in language tasks, yet the internal mechanisms that enable their remarkable generalization and reasoning abilities remain opaque. This lack of transparency presents challenges such as…

Computation and Language · Computer Science 2024-04-17 Haiyan Zhao , Fan Yang , Bo Shen , Himabindu Lakkaraju , Mengnan Du

Language-Specific Latent Process Hinders Cross-Lingual Performance

Large language models (LLMs) are demonstrably capable of cross-lingual transfer, but can produce inconsistent output when prompted with the same queries written in different languages. To understand how language models are able to…

Computation and Language · Computer Science 2025-09-29 Zheng Wei Lim , Alham Fikri Aji , Trevor Cohn

Exploring the Hidden Reasoning Process of Large Language Models by Misleading Them

Large language models (LLMs) have been able to perform various forms of reasoning tasks in a wide range of scenarios, but are they truly engaging in task abstraction and rule-based reasoning beyond mere memorization? To answer this…

Machine Learning · Computer Science 2025-12-09 Guanyu Chen , Peiyang Wang , Yizhou Jiang , Yuqian Liu , Chujie Zhao , Ying Fang , Tianren Zhang , Feng Chen