Related papers: Zero-Shot Multi-task Hallucination Detection

Quantity Matters: Towards Assessing and Mitigating Number Hallucination in Large Vision-Language Models

Large-scale vision-language models have demonstrated impressive skill in handling tasks that involve both areas. Nevertheless, these models frequently experience significant issues with generating inaccurate information, which is…

Computation and Language · Computer Science 2024-05-07 Huixuan Zhang , Junzhe Zhang , Xiaojun Wan

DelucionQA: Detecting Hallucinations in Domain-specific Question Answering

Hallucination is a well-known phenomenon in text generated by large language models (LLMs). The existence of hallucinatory responses is found in almost all application scenarios e.g., summarization, question-answering (QA) etc. For…

Computation and Language · Computer Science 2023-12-11 Mobashir Sadat , Zhengyu Zhou , Lukas Lange , Jun Araki , Arsalan Gundroo , Bingqing Wang , Rakesh R Menon , Md Rizwan Parvez , Zhe Feng

InterrogateLLM: Zero-Resource Hallucination Detection in LLM-Generated Answers

Despite the many advances of Large Language Models (LLMs) and their unprecedented rapid evolution, their impact and integration into every facet of our daily lives is limited due to various reasons. One critical factor hindering their…

Computation and Language · Computer Science 2024-08-20 Yakir Yehuda , Itzik Malkiel , Oren Barkan , Jonathan Weill , Royi Ronen , Noam Koenigstein

Enhancing Hallucination Detection through Perturbation-Based Synthetic Data Generation in System Responses

Detecting hallucinations in large language model (LLM) outputs is pivotal, yet traditional fine-tuning for this classification task is impeded by the expensive and quickly outdated annotation process, especially across numerous vertical…

Artificial Intelligence · Computer Science 2024-07-09 Dongxu Zhang , Varun Gangal , Barrett Martin Lattimer , Yi Yang

Enhancing Hallucination Detection through Noise Injection

Large Language Models (LLMs) are prone to generating plausible yet incorrect responses, known as hallucinations. Effectively detecting hallucinations is therefore crucial for the safe deployment of LLMs. Recent research has linked…

Computation and Language · Computer Science 2026-03-03 Litian Liu , Reza Pourreza , Sunny Panchal , Apratim Bhattacharyya , Yubing Jian , Yao Qin , Roland Memisevic

Enhancing Uncertainty-Based Hallucination Detection with Stronger Focus

Large Language Models (LLMs) have gained significant popularity for their impressive performance across diverse fields. However, LLMs are prone to hallucinate untruthful or nonsensical outputs that fail to meet user expectations in many…

Computation and Language · Computer Science 2023-11-23 Tianhang Zhang , Lin Qiu , Qipeng Guo , Cheng Deng , Yue Zhang , Zheng Zhang , Chenghu Zhou , Xinbing Wang , Luoyi Fu

Understanding and Detecting Hallucinations in Neural Machine Translation via Model Introspection

Neural sequence generation models are known to "hallucinate", by producing outputs that are unrelated to the source text. These hallucinations are potentially harmful, yet it remains unclear in what conditions they arise and how to mitigate…

Computation and Language · Computer Science 2023-02-28 Weijia Xu , Sweta Agrawal , Eleftheria Briakou , Marianna J. Martindale , Marine Carpuat

Hallucination Detection in Foundation Models for Decision-Making: A Flexible Definition and Review of the State of the Art

Autonomous systems are soon to be ubiquitous, spanning manufacturing, agriculture, healthcare, entertainment, and other industries. Most of these systems are developed with modular sub-components for decision-making, planning, and control…

Artificial Intelligence · Computer Science 2025-02-12 Neeloy Chakraborty , Melkior Ornik , Katherine Driggs-Campbell

Principled Detection of Hallucinations in Large Language Models via Multiple Testing

While Large Language Models (LLMs) have emerged as powerful foundational models to solve a variety of tasks, they have also been shown to be prone to hallucinations, i.e., generating responses that sound confident but are actually incorrect…

Computation and Language · Computer Science 2026-04-29 Jiawei Li , Akshayaa Magesh , Venugopal V. Veeravalli

Theoretical Foundations and Mitigation of Hallucination in Large Language Models

Hallucination in Large Language Models (LLMs) refers to the generation of content that is not faithful to the input or the real-world facts. This paper provides a rigorous treatment of hallucination in LLMs, including formal definitions and…

Computation and Language · Computer Science 2025-08-01 Esmail Gumaan

Cost-Effective Hallucination Detection for LLMs

Large language models (LLMs) can be prone to hallucinations - generating unreliable outputs that are unfaithful to their inputs, external facts or internally inconsistent. In this work, we address several challenges for post-hoc…

Computation and Language · Computer Science 2024-08-12 Simon Valentin , Jinmiao Fu , Gianluca Detommaso , Shaoyuan Xu , Giovanni Zappella , Bryan Wang

A Systematic Literature Review of Code Hallucinations in LLMs: Characterization, Mitigation Methods, Challenges, and Future Directions for Reliable AI

Model hallucination is one of the most critical challenges faced by Large Language Models (LLMs), especially in high-stakes code intelligence tasks. As LLMs become increasingly integrated into software engineering tasks, understanding and…

Software Engineering · Computer Science 2025-11-04 Cuiyun Gao , Guodong Fan , Chun Yong Chong , Shizhan Chen , Chao Liu , David Lo , Zibin Zheng , Qing Liao

Detecting Hallucinations in Large Language Model Generation: A Token Probability Approach

Concerns regarding the propensity of Large Language Models (LLMs) to produce inaccurate outputs, also known as hallucinations, have escalated. Detecting them is vital for ensuring the reliability of applications relying on LLM-generated…

Computation and Language · Computer Science 2024-05-31 Ernesto Quevedo , Jorge Yero , Rachel Koerner , Pablo Rivas , Tomas Cerny

Hallucination as Commitment Failure: Larger LLMs Misfire Despite Knowing the Answer

Hallucination is often viewed as a direct consequence of missing knowledge: a model answers incorrectly when the correct answer is absent from its generation-time distribution, and correctly when it is present. We test this assumption by…

Computation and Language · Computer Science 2026-05-22 Jewon Yeom , Jaewon Sok , Heejun Kim , Seonghyeon Park , Jeongjae Park , Taesup Kim

Developing a Reliable, Fast, General-Purpose Hallucination Detection and Mitigation Service

Hallucination, a phenomenon where large language models (LLMs) produce output that is factually incorrect or unrelated to the input, is a major challenge for LLM applications that require accuracy and dependability. In this paper, we…

Computation and Language · Computer Science 2025-04-01 Song Wang , Xun Wang , Jie Mei , Yujia Xie , Sean Muarray , Zhang Li , Lingfeng Wu , Si-Qing Chen , Wayne Xiong

On the Hallucination in Simultaneous Machine Translation

It is widely known that hallucination is a critical issue in Simultaneous Machine Translation (SiMT) due to the absence of source-side information. While many efforts have been made to enhance performance for SiMT, few of them attempt to…

Computation and Language · Computer Science 2024-06-12 Meizhi Zhong , Kehai Chen , Zhengshan Xue , Lemao Liu , Mingming Yang , Min Zhang

Detecting and Mitigating Hallucinations in Machine Translation: Model Internal Workings Alone Do Well, Sentence Similarity Even Better

While the problem of hallucinations in neural machine translation has long been recognized, so far the progress on its alleviation is very little. Indeed, recently it turned out that without artificially encouraging models to hallucinate,…

Computation and Language · Computer Science 2022-12-21 David Dale , Elena Voita , Loïc Barrault , Marta R. Costa-jussà

Detecting LLM Hallucination Through Layer-wise Information Deficiency: Analysis of Ambiguous Prompts and Unanswerable Questions

Large language models (LLMs) frequently generate confident yet inaccurate responses, introducing significant risks for deployment in safety-critical domains. We present a novel, test-time approach to detecting model hallucination through…

Machine Learning · Computer Science 2025-10-07 Hazel Kim , Tom A. Lamb , Adel Bibi , Philip Torr , Yarin Gal

HalLoc: Token-level Localization of Hallucinations for Vision Language Models

Hallucinations pose a significant challenge to the reliability of large vision-language models, making their detection essential for ensuring accuracy in critical applications. Current detection methods often rely on computationally…

Computer Vision and Pattern Recognition · Computer Science 2025-06-13 Eunkyu Park , Minyeong Kim , Gunhee Kim

SLPL SHROOM at SemEval2024 Task 06: A comprehensive study on models ability to detect hallucination

Language models, particularly generative models, are susceptible to hallucinations, generating outputs that contradict factual knowledge or the source text. This study explores methods for detecting hallucinations in three SemEval-2024 Task…

Computation and Language · Computer Science 2024-04-10 Pouya Fallah , Soroush Gooran , Mohammad Jafarinasab , Pouya Sadeghi , Reza Farnia , Amirreza Tarabkhah , Zainab Sadat Taghavi , Hossein Sameti