English
Related papers

Related papers: Ensemble-Based Uncertainty Estimation for Code Cor…

200 papers

Hallucination in large language models (LLMs) can be detected by assessing the uncertainty of model outputs, typically measured using entropy. Semantic entropy (SE) enhances traditional entropy estimation by quantifying uncertainty at the…

Machine Learning · Computer Science 2025-06-03 Dang Nguyen , Ali Payani , Baharan Mirzasoleiman

In this work, we introduce Entropy Area Score (EAS), a simple yet effective metric to quantify uncertainty in the answer generation process of reasoning large language models (LLMs). EAS requires neither external models nor repeated…

Artificial Intelligence · Computer Science 2025-08-29 Yongfu Zhu , Lin Sun , Guangxiang Zhao , Weihong Lin , Xiangzheng Zhang

Large language models (LLMs) have demonstrated remarkable performance, yet their diverse strengths and weaknesses prevent any single LLM from achieving dominance across all tasks. Ensembling multiple LLMs is a promising approach to generate…

Computation and Language · Computer Science 2025-03-17 Jiaxin Zhang , Zhuohang Li , Wendi Cui , Kamalika Das , Bradley malin , Sricharan Kumar

Uncertainty estimation is crucial for the reliability of safety-critical human and artificial intelligence (AI) interaction systems, particularly in the domain of healthcare engineering. However, a robust and general uncertainty measure for…

Computation and Language · Computer Science 2024-11-19 Zhiyuan Wang , Jinhao Duan , Chenxi Yuan , Qingyu Chen , Tianlong Chen , Yue Zhang , Ren Wang , Xiaoshuang Shi , Kaidi Xu

Uncertainty quantification in Large Language Models (LLMs) is crucial for applications where safety and reliability are important. In particular, uncertainty can be used to improve the trustworthiness of LLMs by detecting factually…

Machine Learning · Computer Science 2024-05-31 Alexander Nikitin , Jannik Kossen , Yarin Gal , Pekka Marttinen

LLMs' overconfidence, particularly when hallucinating, poses a significant challenge for the deployment of the models in safety-critical settings and makes a reliable estimation of uncertainty necessary. Existing approaches for uncertainty…

Machine Learning · Computer Science 2026-05-26 Hamed Karimi , Vaishali Meyappan , Reza Samavi

Uncertainty estimation in multi-LLM systems remains largely single-model-centric: existing methods quantify uncertainty within each model but do not adequately capture semantic disagreement across models. To address this gap, we propose…

Artificial Intelligence · Computer Science 2026-03-31 Kangkang Sun , Jun Wu , Jianhua Li , Minyi Guo , Xiuzhen Che , Jianwei Huang

In this work, we explore uncertainty estimation as a proxy for correctness in LLM-generated code. To this end, we adapt two state-of-the-art techniques from natural language generation -- one based on entropy and another on mutual…

Software Engineering · Computer Science 2025-07-02 Arindam Sharma , Cristina David

Reliable question answering with large language models (LLMs) is challenged by hallucinations, fluent but factually incorrect outputs arising from epistemic uncertainty. Existing entropy-based semantic-level uncertainty estimation methods…

Computation and Language · Computer Science 2025-09-29 Chaodong Tong , Qi Zhang , Lei Jiang , Yanbing Liu , Nannan Sun , Wei Li

LLMs show strong performance in code generation, but their outputs lack correctness guarantees. Sample-based uncertainty estimators address this by generating multiple candidate programs and measuring their disagreement. However, existing…

Software Engineering · Computer Science 2026-05-12 Weilin He , Arindam Sharma , Cristina David

Many black-box techniques for quantifying the uncertainty of large language models (LLMs) rely on repeated LLM sampling, which can be computationally expensive. Therefore, practical applicability demands reliable estimation from few…

Computation and Language · Computer Science 2026-02-09 Lucas H. McCabe , Rimon Melamed , Thomas Hartvigsen , H. Howie Huang

Large Language Models (LLMs) have shown significant advances in text generation but often lack the reliability needed for autonomous deployment in high-stakes domains like healthcare, law, and finance. Existing approaches rely on external…

Artificial Intelligence · Computer Science 2024-11-12 Ninad Naik

Large language models (LLMs) have shown remarkable capabilities in various natural language understanding tasks. With only a few demonstration examples, these LLMs can quickly adapt to target tasks without expensive gradient updates. Common…

Computation and Language · Computer Science 2023-11-14 Yue Yu , Jiaming Shen , Tianqi Liu , Zhen Qin , Jing Nathan Yan , Jialu Liu , Chao Zhang , Michael Bendersky

We propose semantic entropy probes (SEPs), a cheap and reliable method for uncertainty quantification in Large Language Models (LLMs). Hallucinations, which are plausible-sounding but factually incorrect and arbitrary model generations,…

Computation and Language · Computer Science 2024-06-25 Jannik Kossen , Jiatong Han , Muhammed Razzak , Lisa Schut , Shreshth Malik , Yarin Gal

To facilitate robust and trustworthy deployment of large language models (LLMs), it is essential to quantify the reliability of their generations through uncertainty estimation. While recent efforts have made significant advancements by…

Computation and Language · Computer Science 2025-07-22 Rui Li , Jing Long , Muge Qi , Heming Xia , Lei Sha , Peiyi Wang , Zhifang Sui

Reliable uncertainty quantification (UQ) is essential for deploying large language models (LLMs) in safety-critical scenarios, as it enables them to abstain from responding when uncertain, thereby avoiding hallucinations, i.e., plausible…

Computation and Language · Computer Science 2026-02-09 Xingtao Zhao , Hao Peng , Dingli Su , Xianghua Zeng , Chunyang Liu , Jinzhi Liao , Philip S. Yu

Accurately quantifying uncertainty in large language models (LLMs) is crucial for their reliable deployment, especially in high-stakes applications. Current state-of-the-art methods for measuring semantic uncertainty in LLMs rely on strict…

Machine Learning · Computer Science 2024-10-31 Yashvir S. Grewal , Edwin V. Bonilla , Thang D. Bui

We present a simple comparative framework for testing and developing uncertainty modeling in uncertain marching cubes implementations. The selection of a model to represent the probability distribution of uncertain values directly…

Human-Computer Interaction · Computer Science 2024-09-16 Robert Sisneros , Tushar M. Athawale , David Pugmire , Kenneth Moreland

To address the challenge of quantifying uncertainty in the outputs generated by language models, we propose a novel measure of semantic uncertainty, semantic spectral entropy, that is statistically consistent under mild assumptions. This…

Computation and Language · Computer Science 2025-05-27 Yi Liu

Ensemble learning has been widely used in machine learning to improve model robustness, accuracy, and generalization, but has not yet been applied to code generation tasks with large language models (LLMs). We propose an ensemble approach…

Software Engineering · Computer Science 2025-07-22 Tarek Mahmud , Bin Duan , Corina Pasareanu , Guowei Yang
‹ Prev 1 2 3 10 Next ›