Related papers: Uncertainty Quantification for LLM-based Code Gene…

Random-Set Large Language Models

Large Language Models (LLMs) are known to produce very high-quality tests and responses to our queries. But how much can we trust this generated text? In this paper, we study the problem of uncertainty quantification in LLMs. We propose a…

Computation and Language · Computer Science 2025-04-28 Muhammad Mubashar , Shireen Kudukkil Manchingal , Fabio Cuzzolin

PAC Prediction Sets for Large Language Models of Code

Prediction sets have recently been shown to be a promising strategy for quantifying the uncertainty of deep neural networks in a way that provides theoretical guarantees. However, existing techniques have largely targeted settings where the…

Machine Learning · Computer Science 2023-06-22 Adam Khakhar , Stephen Mell , Osbert Bastani

Assessing Correctness in LLM-Based Code Generation via Uncertainty Estimation

In this work, we explore uncertainty estimation as a proxy for correctness in LLM-generated code. To this end, we adapt two state-of-the-art techniques from natural language generation -- one based on entropy and another on mutual…

Software Engineering · Computer Science 2025-07-02 Arindam Sharma , Cristina David

Uncertainty-Guided Chain-of-Thought for Code Generation with LLMs

Chain-of-Thought (CoT) reasoning has been demonstrated as an effective technique for improving the problem-solving capabilities of large language models (LLMs) in the context of code generation. However, existing CoT methods often exhibit a…

Software Engineering · Computer Science 2025-03-20 Yuqi Zhu , Ge Li , Xue Jiang , Jia Li , Hong Mei , Zhi Jin , Yihong Dong

Systematic Evaluation of Uncertainty Estimation Methods in Large Language Models

Large language models (LLMs) produce outputs with varying levels of uncertainty, and, just as often, varying levels of correctness; making their practical reliability far from guaranteed. To quantify this uncertainty, we systematically…

Computation and Language · Computer Science 2025-10-24 Christian Hobelsberger , Theresa Winner , Andreas Nawroth , Oliver Mitevski , Anna-Carolina Haensch

Generating with Confidence: Uncertainty Quantification for Black-box Large Language Models

Large language models (LLMs) specializing in natural language generation (NLG) have recently started exhibiting promising capabilities across a variety of domains. However, gauging the trustworthiness of responses generated by LLMs remains…

Computation and Language · Computer Science 2024-05-21 Zhen Lin , Shubhendu Trivedi , Jimeng Sun

Uncertainty Quantification and Decomposition for LLM-based Recommendation

Despite the widespread adoption of large language models (LLMs) for recommendation, we demonstrate that LLMs often exhibit uncertainty in their recommendations. To ensure the trustworthy use of LLMs in generating recommendations, we…

Information Retrieval · Computer Science 2025-02-13 Wonbin Kweon , Sanghwan Jang , SeongKu Kang , Hwanjo Yu

Quantifying and Understanding Uncertainty in Large Reasoning Models

Large Reasoning Models (LRMs) have recently demonstrated significant improvements in complex reasoning. While quantifying generation uncertainty in LRMs is crucial, traditional methods are often insufficient because they do not provide…

Artificial Intelligence · Computer Science 2026-04-16 Yangyi Li , Chenxu Zhao , Mengdi Huai

Uncertainty Quantification and Confidence Calibration in Large Language Models: A Survey

Large Language Models (LLMs) excel in text generation, reasoning, and decision-making, enabling their adoption in high-stakes domains such as healthcare, law, and transportation. However, their reliability is a major concern, as they often…

Computation and Language · Computer Science 2025-06-05 Xiaoou Liu , Tiejin Chen , Longchao Da , Chacha Chen , Zhen Lin , Hua Wei

Rethinking Uncertainty Estimation in LLMs: A Principled Single-Sequence Measure

Large Language Models (LLMs) are increasingly employed in real-world applications, driving the need to evaluate the trustworthiness of their generated text. To this end, reliable uncertainty estimation is essential. Leading uncertainty…

Machine Learning · Computer Science 2026-04-21 Lukas Aichberger , Kajetan Schweighofer , Sepp Hochreiter

A Survey of Uncertainty Estimation in LLMs: Theory Meets Practice

As large language models (LLMs) continue to evolve, understanding and quantifying the uncertainty in their predictions is critical for enhancing application credibility. However, the existing literature relevant to LLM uncertainty…

Computation and Language · Computer Science 2024-10-22 Hsiu-Yuan Huang , Yutong Yang , Zhaoxi Zhang , Sanwoo Lee , Yunfang Wu

Fact-Checking the Output of Large Language Models via Token-Level Uncertainty Quantification

Large language models (LLMs) are notorious for hallucinating, i.e., producing erroneous claims in their output. Such hallucinations can be dangerous, as occasional factual inaccuracies in the generated text might be obscured by the rest of…

Computation and Language · Computer Science 2024-06-10 Ekaterina Fadeeva , Aleksandr Rubashevskii , Artem Shelmanov , Sergey Petrakov , Haonan Li , Hamdy Mubarak , Evgenii Tsymbalov , Gleb Kuzmin , Alexander Panchenko , Timothy Baldwin , Preslav Nakov , Maxim Panov

Uncertainty Quantification for LLMs through Minimum Bayes Risk: Bridging Confidence and Consistency

Uncertainty quantification (UQ) methods for Large Language Models (LLMs) encompass a variety of approaches, with two major types being particularly prominent: information-based, which focus on model confidence expressed as token…

Computation and Language · Computer Science 2025-12-10 Roman Vashurin , Maiya Goloburda , Albina Ilina , Aleksandr Rubashevskii , Preslav Nakov , Artem Shelmanov , Maxim Panov

Using Semantic Distance to Estimate Uncertainty in LLM-Based Code Generation

LLMs show strong performance in code generation, but their outputs lack correctness guarantees. Sample-based uncertainty estimators address this by generating multiple candidate programs and measuring their disagreement. However, existing…

Software Engineering · Computer Science 2026-05-12 Weilin He , Arindam Sharma , Cristina David

System-Level Uncertainty Quantification with Multiple Machine Learning Models: A Theoretical Framework

ML models have errors when used for predictions. The errors are unknown but can be quantified by model uncertainty. When multiple ML models are trained using the same training points, their model uncertainties may be statistically…

Machine Learning · Statistics 2025-09-23 Xiaoping Du

Look Before You Leap: An Exploratory Study of Uncertainty Measurement for Large Language Models

The recent performance leap of Large Language Models (LLMs) opens up new opportunities across numerous industrial applications and domains. However, erroneous generations, such as false predictions, misinformation, and hallucination made by…

Software Engineering · Computer Science 2025-01-07 Yuheng Huang , Jiayang Song , Zhijie Wang , Shengming Zhao , Huaming Chen , Felix Juefei-Xu , Lei Ma

Multi-group Uncertainty Quantification for Long-form Text Generation

While past works have shown how uncertainty quantification can be applied to large language model (LLM) outputs, the question of whether resulting uncertainty guarantees still hold within sub-groupings of data remains open. In our work,…

Computation and Language · Computer Science 2025-06-13 Terrance Liu , Zhiwei Steven Wu

From Data to Uncertainty Sets: a Machine Learning Approach

Existing approaches of prescriptive analytics -- where inputs of an optimization model can be predicted by leveraging covariates in a machine learning model -- often attempt to optimize the mean value of an uncertain objective. However,…

Machine Learning · Computer Science 2025-03-05 Dimitris Bertsimas , Benjamin Boucher

Uncertainty Profiles for LLMs: Uncertainty Source Decomposition and Adaptive Model-Metric Selection

Large language models (LLMs) often generate fluent but factually incorrect outputs, known as hallucinations, which undermine their reliability in real-world applications. While uncertainty estimation has emerged as a promising strategy for…

Machine Learning · Computer Science 2025-05-13 Pei-Fu Guo , Yun-Da Tsai , Shou-De Lin

Uncertainty Quantification for In-Context Learning of Large Language Models

In-context learning has emerged as a groundbreaking ability of Large Language Models (LLMs) and revolutionized various fields by providing a few task-relevant demonstrations in the prompt. However, trustworthy issues with LLM's response,…

Computation and Language · Computer Science 2024-04-01 Chen Ling , Xujiang Zhao , Xuchao Zhang , Wei Cheng , Yanchi Liu , Yiyou Sun , Mika Oishi , Takao Osaki , Katsushi Matsuda , Jie Ji , Guangji Bai , Liang Zhao , Haifeng Chen