Related papers: Ensemble-Based Uncertainty Estimation for Code Cor…

Beyond Semantic Entropy: Boosting LLM Uncertainty Quantification with Pairwise Semantic Similarity

Hallucination in large language models (LLMs) can be detected by assessing the uncertainty of model outputs, typically measured using entropy. Semantic entropy (SE) enhances traditional entropy estimation by quantifying uncertainty at the…

Machine Learning · Computer Science 2025-06-03 Dang Nguyen , Ali Payani , Baharan Mirzasoleiman

Uncertainty Under the Curve: A Sequence-Level Entropy Area Metric for Reasoning LLM

In this work, we introduce Entropy Area Score (EAS), a simple yet effective metric to quantify uncertainty in the answer generation process of reasoning large language models (LLMs). EAS requires neither external models nor repeated…

Artificial Intelligence · Computer Science 2025-08-29 Yongfu Zhu , Lin Sun , Guangxiang Zhao , Weihong Lin , Xiangzheng Zhang

SCE: Scalable Consistency Ensembles Make Blackbox Large Language Model Generation More Reliable

Large language models (LLMs) have demonstrated remarkable performance, yet their diverse strengths and weaknesses prevent any single LLM from achieving dominance across all tasks. Ensembling multiple LLMs is a promising approach to generate…

Computation and Language · Computer Science 2025-03-17 Jiaxin Zhang , Zhuohang Li , Wendi Cui , Kamalika Das , Bradley malin , Sricharan Kumar

Word-Sequence Entropy: Towards Uncertainty Estimation in Free-Form Medical Question Answering Applications and Beyond

Uncertainty estimation is crucial for the reliability of safety-critical human and artificial intelligence (AI) interaction systems, particularly in the domain of healthcare engineering. However, a robust and general uncertainty measure for…

Computation and Language · Computer Science 2024-11-19 Zhiyuan Wang , Jinhao Duan , Chenxi Yuan , Qingyu Chen , Tianlong Chen , Yue Zhang , Ren Wang , Xiaoshuang Shi , Kaidi Xu

Kernel Language Entropy: Fine-grained Uncertainty Quantification for LLMs from Semantic Similarities

Uncertainty quantification in Large Language Models (LLMs) is crucial for applications where safety and reliability are important. In particular, uncertainty can be used to improve the trustworthiness of LLMs by detecting factually…

Machine Learning · Computer Science 2024-05-31 Alexander Nikitin , Jannik Kossen , Yarin Gal , Pekka Marttinen

LLMs Uncertainty Quantification via Adaptive Conformal Semantic Entropy

LLMs' overconfidence, particularly when hallucinating, poses a significant challenge for the deployment of the models in safety-critical settings and makes a reliable estimation of uncertainty necessary. Existing approaches for uncertainty…

Machine Learning · Computer Science 2026-05-26 Hamed Karimi , Vaishali Meyappan , Reza Samavi

CoE: Collaborative Entropy for Uncertainty Quantification in Agentic Multi-LLM Systems

Uncertainty estimation in multi-LLM systems remains largely single-model-centric: existing methods quantify uncertainty within each model but do not adequately capture semantic disagreement across models. To address this gap, we propose…

Artificial Intelligence · Computer Science 2026-03-31 Kangkang Sun , Jun Wu , Jianhua Li , Minyi Guo , Xiuzhen Che , Jianwei Huang

Assessing Correctness in LLM-Based Code Generation via Uncertainty Estimation

In this work, we explore uncertainty estimation as a proxy for correctness in LLM-generated code. To this end, we adapt two state-of-the-art techniques from natural language generation -- one based on entropy and another on mutual…

Software Engineering · Computer Science 2025-07-02 Arindam Sharma , Cristina David

Semantic Reformulation Entropy for Robust Hallucination Detection in QA Tasks

Reliable question answering with large language models (LLMs) is challenged by hallucinations, fluent but factually incorrect outputs arising from epistemic uncertainty. Existing entropy-based semantic-level uncertainty estimation methods…

Computation and Language · Computer Science 2025-09-29 Chaodong Tong , Qi Zhang , Lei Jiang , Yanbing Liu , Nannan Sun , Wei Li

Using Semantic Distance to Estimate Uncertainty in LLM-Based Code Generation

LLMs show strong performance in code generation, but their outputs lack correctness guarantees. Sample-based uncertainty estimators address this by generating multiple candidate programs and measuring their disagreement. However, existing…

Software Engineering · Computer Science 2026-05-12 Weilin He , Arindam Sharma , Cristina David

Estimating Semantic Alphabet Size for LLM Uncertainty Quantification

Many black-box techniques for quantifying the uncertainty of large language models (LLMs) rely on repeated LLM sampling, which can be computationally expensive. Therefore, practical applicability demands reliable estimation from few…

Computation and Language · Computer Science 2026-02-09 Lucas H. McCabe , Rimon Melamed , Thomas Hartvigsen , H. Howie Huang

Probabilistic Consensus through Ensemble Validation: A Framework for LLM Reliability

Large Language Models (LLMs) have shown significant advances in text generation but often lack the reliability needed for autonomous deployment in high-stakes domains like healthcare, law, and finance. Existing approaches rely on external…

Artificial Intelligence · Computer Science 2024-11-12 Ninad Naik

Explanation-aware Soft Ensemble Empowers Large Language Model In-context Learning

Large language models (LLMs) have shown remarkable capabilities in various natural language understanding tasks. With only a few demonstration examples, these LLMs can quickly adapt to target tasks without expensive gradient updates. Common…

Computation and Language · Computer Science 2023-11-14 Yue Yu , Jiaming Shen , Tianqi Liu , Zhen Qin , Jing Nathan Yan , Jialu Liu , Chao Zhang , Michael Bendersky

Semantic Entropy Probes: Robust and Cheap Hallucination Detection in LLMs

We propose semantic entropy probes (SEPs), a cheap and reliable method for uncertainty quantification in Large Language Models (LLMs). Hallucinations, which are plausible-sounding but factually incorrect and arbitrary model generations,…

Computation and Language · Computer Science 2024-06-25 Jannik Kossen , Jiatong Han , Muhammed Razzak , Lisa Schut , Shreshth Malik , Yarin Gal

Towards Harmonized Uncertainty Estimation for Large Language Models

To facilitate robust and trustworthy deployment of large language models (LLMs), it is essential to quantify the reliability of their generations through uncertainty estimation. While recent efforts have made significant advancements by…

Computation and Language · Computer Science 2025-07-22 Rui Li , Jing Long , Muge Qi , Heming Xia , Lei Sha , Peiyi Wang , Zhifang Sui

SeSE: Black-Box Uncertainty Quantification for Large Language Models Based on Structural Information Theory

Reliable uncertainty quantification (UQ) is essential for deploying large language models (LLMs) in safety-critical scenarios, as it enables them to abstain from responding when uncertain, thereby avoiding hallucinations, i.e., plausible…

Computation and Language · Computer Science 2026-02-09 Xingtao Zhao , Hao Peng , Dingli Su , Xianghua Zeng , Chunyang Liu , Jinzhi Liao , Philip S. Yu

Improving Uncertainty Quantification in Large Language Models via Semantic Embeddings

Accurately quantifying uncertainty in large language models (LLMs) is crucial for their reliable deployment, especially in high-stakes applications. Current state-of-the-art methods for measuring semantic uncertainty in LLMs rely on strict…

Machine Learning · Computer Science 2024-10-31 Yashvir S. Grewal , Edwin V. Bonilla , Thang D. Bui

An Entropy-Based Test and Development Framework for Uncertainty Modeling in Level-Set Visualizations

We present a simple comparative framework for testing and developing uncertainty modeling in uncertain marching cubes implementations. The selection of a model to represent the probability distribution of uncertain values directly…

Human-Computer Interaction · Computer Science 2024-09-16 Robert Sisneros , Tushar M. Athawale , David Pugmire , Kenneth Moreland

A statistically consistent measure of semantic uncertainty using Language Models

To address the challenge of quantifying uncertainty in the outputs generated by language models, we propose a novel measure of semantic uncertainty, semantic spectral entropy, that is statistically consistent under mild assumptions. This…

Computation and Language · Computer Science 2025-05-27 Yi Liu

Enhancing LLM Code Generation with Ensembles: A Similarity-Based Selection Approach

Ensemble learning has been widely used in machine learning to improve model robustness, accuracy, and generalization, but has not yet been applied to code generation tasks with large language models (LLMs). We propose an ensemble approach…

Software Engineering · Computer Science 2025-07-22 Tarek Mahmud , Bin Duan , Corina Pasareanu , Guowei Yang