Related papers: CodeHalu: Investigating Code Hallucinations in LLM…

Hallucination by Code Generation LLMs: Taxonomy, Benchmarks, Mitigation, and Challenges

Recent technical breakthroughs in large language models (LLMs) have enabled them to fluently generate source code. Software developers often leverage both general-purpose and code-specialized LLMs to revise existing code or even generate a…

Software Engineering · Computer Science 2025-05-14 Yunseo Lee , John Youngeun Song , Dongsun Kim , Jindae Kim , Mijung Kim , Jaechang Nam

CodeMirage: Hallucinations in Code Generated by Large Language Models

Large Language Models (LLMs) have shown promising potentials in program generation and no-code automation. However, LLMs are prone to generate hallucinations, i.e., they generate text which sounds plausible but is incorrect. Although there…

Software Engineering · Computer Science 2025-07-10 Vibhor Agarwal , Yulong Pei , Salwa Alamir , Xiaomo Liu

Beyond Functional Correctness: Exploring Hallucinations in LLM-Generated Code

The rise of Large Language Models (LLMs) has significantly advanced various applications on software engineering tasks, particularly in code generation. Despite the promising performance, LLMs are prone to generate hallucinations, which…

Software Engineering · Computer Science 2026-01-22 Fang Liu , Yang Liu , Lin Shi , Zhen Yang , Li Zhang , Xiaoli Lian , Zhongqi Li , Yuchi Ma

A Systematic Literature Review of Code Hallucinations in LLMs: Characterization, Mitigation Methods, Challenges, and Future Directions for Reliable AI

Model hallucination is one of the most critical challenges faced by Large Language Models (LLMs), especially in high-stakes code intelligence tasks. As LLMs become increasingly integrated into software engineering tasks, understanding and…

Software Engineering · Computer Science 2025-11-04 Cuiyun Gao , Guodong Fan , Chun Yong Chong , Shizhan Chen , Chao Liu , David Lo , Zibin Zheng , Qing Liao

Collu-Bench: A Benchmark for Predicting Language Model Hallucinations in Code

Despite their success, large language models (LLMs) face the critical challenge of hallucinations, generating plausible but incorrect content. While much research has focused on hallucinations in multiple modalities including images and…

Software Engineering · Computer Science 2024-10-15 Nan Jiang , Qi Li , Lin Tan , Tianyi Zhang

LLM Hallucinations in Practical Code Generation: Phenomena, Mechanism, and Mitigation

Code generation aims to automatically generate code from input requirements, significantly enhancing development efficiency. Recent large language models (LLMs) based approaches have shown promising results and revolutionized code…

Software Engineering · Computer Science 2025-01-20 Ziyao Zhang , Yanlin Wang , Chong Wang , Jiachi Chen , Zibin Zheng

Delulu: A Verified Multi-Lingual Benchmark for Code Hallucination Detection in Fill-in-the-Middle Tasks

Large Language Models for code generation frequently produce hallucinations in Fill-in-the-Middle (FIM) tasks -- plausible but incorrect completions such as invented API methods, invalid parameters, undefined variables, or non-existent…

Machine Learning · Computer Science 2026-05-12 Mahdi Erfanian , Nelson Daniel Troncoso , Aashna Garg , Amabel Gale , Xiaoyu Liu , Pareesa Ameneh Golnari , Shengyu Fu

Classification-Based Automatic HDL Code Generation Using LLMs

While large language models (LLMs) have demonstrated the ability to generate hardware description language (HDL) code for digital circuits, they still suffer from the hallucination problem, which leads to the generation of incorrect HDL…

Hardware Architecture · Computer Science 2024-07-29 Wenhao Sun , Bing Li , Grace Li Zhang , Xunzhao Yin , Cheng Zhuo , Ulf Schlichtmann

Code Hallucination

Generative models such as large language models are extensively used as code copilots and for whole program generation. However, the programs they generate often have questionable correctness, authenticity and reliability in terms of…

Artificial Intelligence · Computer Science 2024-08-09 Mirza Masfiqur Rahman , Ashish Kundu

HALoGEN: Fantastic LLM Hallucinations and Where to Find Them

Despite their impressive ability to generate high-quality and fluent text, generative large language models (LLMs) also produce hallucinations: statements that are misaligned with established world knowledge or provided input context.…

Computation and Language · Computer Science 2025-01-15 Abhilasha Ravichander , Shrusti Ghela , David Wadden , Yejin Choi

Hallucination in LLM-Based Code Generation: An Automotive Case Study

Large Language Models (LLMs) have shown significant potential in automating code generation tasks offering new opportunities across software engineering domains. However, their practical application remains limited due to hallucinations -…

Software Engineering · Computer Science 2025-08-18 Marc Pavel , Nenad Petrovic , Lukasz Mazur , Vahid Zolfaghari , Fengjunjie Pan , Alois Knoll

CodeMMLU: A Multi-Task Benchmark for Assessing Code Understanding & Reasoning Capabilities of CodeLLMs

Recent advances in Code Large Language Models (CodeLLMs) have primarily focused on open-ended code generation, often overlooking the crucial aspect of code understanding and reasoning. To bridge this gap, we introduce CodeMMLU, a…

Software Engineering · Computer Science 2025-04-10 Dung Nguyen Manh , Thang Phan Chau , Nam Le Hai , Thong T. Doan , Nam V. Nguyen , Quang Pham , Nghi D. Q. Bui

Hal-Eval: A Universal and Fine-grained Hallucination Evaluation Framework for Large Vision Language Models

Large Vision Language Models exhibit remarkable capabilities but struggle with hallucinations inconsistencies between images and their descriptions. Previous hallucination evaluation studies on LVLMs have identified hallucinations in terms…

Artificial Intelligence · Computer Science 2024-11-11 Chaoya Jiang , Hongrui Jia , Wei Ye , Mengfan Dong , Haiyang Xu , Ming Yan , Ji Zhang , Shikun Zhang

MedHallu: A Comprehensive Benchmark for Detecting Medical Hallucinations in Large Language Models

Advancements in Large Language Models (LLMs) and their increasing use in medical question-answering necessitate rigorous evaluation of their reliability. A critical challenge lies in hallucination, where models generate plausible yet…

Computation and Language · Computer Science 2025-02-21 Shrey Pandit , Jiawei Xu , Junyuan Hong , Zhangyang Wang , Tianlong Chen , Kaidi Xu , Ying Ding

Hallucination Detection and Evaluation of Large Language Model

Hallucinations in Large Language Models (LLMs) pose a significant challenge, generating misleading or unverifiable content that undermines trust and reliability. Existing evaluation methods, such as KnowHalu, employ multi-stage verification…

Computation and Language · Computer Science 2026-04-10 Chenggong Zhang , Haopeng Wang , Hexi Meng

Large Language Models Hallucination: A Comprehensive Survey

Large language models (LLMs) have transformed natural language processing, achieving remarkable performance across diverse tasks. However, their impressive fluency often comes at the cost of producing false or fabricated information, a…

Computation and Language · Computer Science 2026-03-20 Aisha Alansari , Hamzah Luqman

Hallucinations in Code Change to Natural Language Generation: Prevalence and Evaluation of Detection Metrics

Language models have shown strong capabilities across a wide range of tasks in software engineering, such as code generation, yet they suffer from hallucinations. While hallucinations have been studied independently in natural language and…

Software Engineering · Computer Science 2025-08-13 Chunhua Liu , Hong Yi Lin , Patanamon Thongtanunam

HalluLens: LLM Hallucination Benchmark

Large language models (LLMs) often generate responses that deviate from user input or training data, a phenomenon known as "hallucination." These hallucinations undermine user trust and hinder the adoption of generative AI systems.…

Computation and Language · Computer Science 2025-04-25 Yejin Bang , Ziwei Ji , Alan Schelten , Anthony Hartshorn , Tara Fowler , Cheng Zhang , Nicola Cancedda , Pascale Fung

A Survey of Hallucination in Large Visual Language Models

The Large Visual Language Models (LVLMs) enhances user interaction and enriches user experience by integrating visual modality on the basis of the Large Language Models (LLMs). It has demonstrated their powerful information processing and…

Artificial Intelligence · Computer Science 2024-10-22 Wei Lan , Wenyi Chen , Qingfeng Chen , Shirui Pan , Huiyu Zhou , Yi Pan

A comprehensive taxonomy of hallucinations in Large Language Models

Large language models (LLMs) have revolutionized natural language processing, yet their propensity for hallucination, generating plausible but factually incorrect or fabricated content, remains a critical challenge. This report provides a…

Computation and Language · Computer Science 2025-08-05 Manuel Cossio