Related papers: Uncertainty Quantification for LLM Function-Callin…

Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey

Large Language Models (LLMs) excel in text generation, reasoning, and decision-making, enabling their adoption in high-stakes domains such as healthcare, law, and transportation. However, their reliability is a major concern, as they often…

Computation and Language · Computer Science 2025-06-05 Xiaoou Liu , Tiejin Chen , Longchao Da , Chacha Chen , Zhen Lin , Hua Wei

Why Don't You Know? Evaluating the Impact of Uncertainty Sources on Uncertainty Quantification in LLMs

As Large Language Models (LLMs) are increasingly deployed in real-world applications, reliable uncertainty quantification (UQ) becomes critical for safe and effective use. Most existing UQ approaches for language models aim to produce a…

Computation and Language · Computer Science 2026-04-14 Maiya Goloburda , Roman Vashurin , Fedor Chernogorsky , Nurkhan Laiyk , Daniil Orel , Preslav Nakov , Maxim Panov

Evaluating Uncertainty Quantification Methods in Argumentative Large Language Models

Research in uncertainty quantification (UQ) for large language models (LLMs) is increasingly important towards guaranteeing the reliability of this groundbreaking technology. We explore the integration of LLM UQ methods in argumentative…

Computation and Language · Computer Science 2026-05-08 Kevin Zhou , Adam Dejl , Gabriel Freedman , Lihu Chen , Antonio Rago , Francesca Toni

Uncertainty Quantification of Large Language Models through Multi-Dimensional Responses

Large Language Models (LLMs) have demonstrated remarkable capabilities across various tasks due to large training datasets and powerful transformer architecture. However, the reliability of responses from LLMs remains a question.…

Computation and Language · Computer Science 2025-02-26 Tiejin Chen , Xiaoou Liu , Longchao Da , Jia Chen , Vagelis Papalexakis , Hua Wei

From Calibration to Collaboration: LLM Uncertainty Quantification Should Be More Human-Centered

Large Language Models (LLMs) are increasingly assisting users in the real world, yet their reliability remains a concern. Uncertainty quantification (UQ) has been heralded as a tool to enhance human-LLM collaboration by enabling users to…

Computation and Language · Computer Science 2025-06-10 Siddartha Devic , Tejas Srinivasan , Jesse Thomason , Willie Neiswanger , Vatsal Sharan

Uncertainty Quantification for LLMs through Minimum Bayes Risk: Bridging Confidence and Consistency

Uncertainty quantification (UQ) methods for Large Language Models (LLMs) encompass a variety of approaches, with two major types being particularly prominent: information-based, which focus on model confidence expressed as token…

Computation and Language · Computer Science 2025-12-10 Roman Vashurin , Maiya Goloburda , Albina Ilina , Aleksandr Rubashevskii , Preslav Nakov , Artem Shelmanov , Maxim Panov

LUQ: Long-text Uncertainty Quantification for LLMs

Large Language Models (LLMs) have demonstrated remarkable capability in a variety of NLP tasks. However, LLMs are also prone to generate nonfactual content. Uncertainty Quantification (UQ) is pivotal in enhancing our understanding of a…

Computation and Language · Computer Science 2024-10-07 Caiqi Zhang , Fangyu Liu , Marco Basaldella , Nigel Collier

Uncertainty Quantification for Hallucination Detection in Large Language Models: Foundations, Methodology, and Future Directions

The rapid advancement of large language models (LLMs) has transformed the landscape of natural language processing, enabling breakthroughs across a wide range of areas including question answering, machine translation, and text…

Computation and Language · Computer Science 2025-10-15 Sungmin Kang , Yavuz Faruk Bakman , Duygu Nur Yaldiz , Baturalp Buyukates , Salman Avestimehr

CoT-UQ: Improving Response-wise Uncertainty Quantification in LLMs with Chain-of-Thought

Large language models (LLMs) excel in many tasks but struggle to accurately quantify uncertainty in their generated responses. This limitation makes it challenging to detect misinformation and ensure reliable decision-making. Existing…

Computation and Language · Computer Science 2025-06-04 Boxuan Zhang , Ruqi Zhang

Benchmarking Uncertainty Quantification Methods for Large Language Models with LM-Polygraph

The rapid proliferation of large language models (LLMs) has stimulated researchers to seek effective and efficient approaches to deal with LLM hallucinations and low-quality outputs. Uncertainty quantification (UQ) is a key element of…

Computation and Language · Computer Science 2025-07-01 Roman Vashurin , Ekaterina Fadeeva , Artem Vazhentsev , Lyudmila Rvanova , Akim Tsvigun , Daniil Vasilev , Rui Xing , Abdelrahman Boda Sadallah , Kirill Grishchenkov , Sergey Petrakov , Alexander Panchenko , Timothy Baldwin , Preslav Nakov , Maxim Panov , Artem Shelmanov

Comparing Uncertainty Measurement and Mitigation Methods for Large Language Models: A Systematic Review

Large Language Models (LLMs) have been transformative across many domains. However, hallucination, i.e., confidently outputting incorrect information, remains one of the leading challenges for LLMs. This raises the question of how to…

Computation and Language · Computer Science 2026-03-19 Toghrul Abbasli , Kentaroh Toyoda , Yuan Wang , Leon Witt , Muhammad Asif Ali , Yukai Miao , Dan Li , Qingsong Wei

Functional Entropy: Predicting Functional Correctness in LLM-Generated Code with Uncertainty Quantification

Large language models have shown impressive capabilities in code generation, yet they often produce functionally incorrect code. Uncertainty quantification (UQ) methods have emerged as a promising approach for detecting hallucinations in…

Computation and Language · Computer Science 2026-05-28 Dylan Bouchard , Mohit Singh Chauhan , Zeya Ahmad , Ho-Kyeong Ra

Benchmarking Uncertainty Calibration in Large Language Model Long-Form Question Answering

Large Language Models (LLMs) are commonly used in Question Answering (QA) settings, increasingly in the natural sciences if not science at large. Reliable Uncertainty Quantification (UQ) is critical for the trustworthy uptake of generated…

Computation and Language · Computer Science 2026-02-03 Philip Müller , Nicholas Popovič , Michael Färber , Peter Steinbach

Mind the Gap: Benchmarking LLM Uncertainty and Calibration with Specialty-Aware Clinical QA and Reasoning-Based Behavioural Features

Reliable uncertainty quantification (UQ) is essential when employing large language models (LLMs) in high-risk domains such as clinical question answering (QA). In this work, we evaluate uncertainty estimation methods for clinical QA…

Computation and Language · Computer Science 2026-01-27 Alberto Testoni , Iacer Calixto

Functional-level Uncertainty Quantification for Calibrated Fine-tuning on LLMs

Accurate uncertainty quantification in large language models (LLMs) is essential for reliable confidence estimation, yet fine-tuned LLMs often become overconfident under limited adaptation data. Existing uncertainty methods for PEFT-based…

Machine Learning · Computer Science 2026-05-15 Ruijia Niu , Dongxia Wu , Rose Yu , Yi-An Ma

Position: Uncertainty Quantification in LLMs is Just Unsupervised Clustering

Uncertainty Quantification (UQ) is widely regarded as the primary safeguard for deploying Large Language Models (LLMs) in high-stakes domains. However, we argue that the field suffers from a category error: mainstream UQ methods for LLMs…

Computation and Language · Computer Science 2026-05-20 Tiejin Chen , Longchao Da , Xiaoou Liu , Hua Wei

When does a large language model (LLM) know what it does not know? Uncertainty quantification (UQ) provides measures of uncertainty, such as an estimate of the confidence in an LLM's generated output, and is therefore increasingly…

Computation and Language · Computer Science 2025-10-17 Debarun Bhattacharjya , Balaji Ganesan , Junkyu Lee , Radu Marinescu , Katsiaryna Mirylenka , Michael Glass , Xiao Shou

Grammars of Formal Uncertainty: When to Trust LLMs in Automated Reasoning Tasks

Large language models (LLMs) show remarkable promise for democratizing automated reasoning by generating formal specifications. However, a fundamental tension exists: LLMs are probabilistic, while formal verification demands deterministic…

Computation and Language · Computer Science 2025-05-27 Debargha Ganguly , Vikash Singh , Sreehari Sankar , Biyao Zhang , Xuecen Zhang , Srinivasan Iyengar , Xiaotian Han , Amit Sharma , Shivkumar Kalyanaraman , Vipin Chaudhary

Can Linear Probes Measure LLM Uncertainty?

Effective Uncertainty Quantification (UQ) represents a key aspect for reliable deployment of Large Language Models (LLMs) in automated decision-making and beyond. Yet, for LLM generation with multiple choice structure, the state-of-the-art…

Machine Learning · Computer Science 2025-11-18 Ramzi Dakhmouche , Adrien Letellier , Hossein Gorji

The Consistency Hypothesis in Uncertainty Quantification for Large Language Models

Estimating the confidence of large language model (LLM) outputs is essential for real-world applications requiring high user trust. Black-box uncertainty quantification (UQ) methods, relying solely on model API access, have gained…

Computation and Language · Computer Science 2025-06-30 Quan Xiao , Debarun Bhattacharjya , Balaji Ganesan , Radu Marinescu , Katsiaryna Mirylenka , Nhan H Pham , Michael Glass , Junkyu Lee