English
Related papers

Related papers: Uncertainty Quantification for LLM-based Code Gene…

200 papers

Large Language Models (LLMs) are known to produce very high-quality tests and responses to our queries. But how much can we trust this generated text? In this paper, we study the problem of uncertainty quantification in LLMs. We propose a…

Computation and Language · Computer Science 2025-04-28 Muhammad Mubashar , Shireen Kudukkil Manchingal , Fabio Cuzzolin

Prediction sets have recently been shown to be a promising strategy for quantifying the uncertainty of deep neural networks in a way that provides theoretical guarantees. However, existing techniques have largely targeted settings where the…

Machine Learning · Computer Science 2023-06-22 Adam Khakhar , Stephen Mell , Osbert Bastani

In this work, we explore uncertainty estimation as a proxy for correctness in LLM-generated code. To this end, we adapt two state-of-the-art techniques from natural language generation -- one based on entropy and another on mutual…

Software Engineering · Computer Science 2025-07-02 Arindam Sharma , Cristina David

Chain-of-Thought (CoT) reasoning has been demonstrated as an effective technique for improving the problem-solving capabilities of large language models (LLMs) in the context of code generation. However, existing CoT methods often exhibit a…

Software Engineering · Computer Science 2025-03-20 Yuqi Zhu , Ge Li , Xue Jiang , Jia Li , Hong Mei , Zhi Jin , Yihong Dong

Large language models (LLMs) produce outputs with varying levels of uncertainty, and, just as often, varying levels of correctness; making their practical reliability far from guaranteed. To quantify this uncertainty, we systematically…

Computation and Language · Computer Science 2025-10-24 Christian Hobelsberger , Theresa Winner , Andreas Nawroth , Oliver Mitevski , Anna-Carolina Haensch

Large language models (LLMs) specializing in natural language generation (NLG) have recently started exhibiting promising capabilities across a variety of domains. However, gauging the trustworthiness of responses generated by LLMs remains…

Computation and Language · Computer Science 2024-05-21 Zhen Lin , Shubhendu Trivedi , Jimeng Sun

Despite the widespread adoption of large language models (LLMs) for recommendation, we demonstrate that LLMs often exhibit uncertainty in their recommendations. To ensure the trustworthy use of LLMs in generating recommendations, we…

Information Retrieval · Computer Science 2025-02-13 Wonbin Kweon , Sanghwan Jang , SeongKu Kang , Hwanjo Yu

Large Reasoning Models (LRMs) have recently demonstrated significant improvements in complex reasoning. While quantifying generation uncertainty in LRMs is crucial, traditional methods are often insufficient because they do not provide…

Artificial Intelligence · Computer Science 2026-04-16 Yangyi Li , Chenxu Zhao , Mengdi Huai

Large Language Models (LLMs) excel in text generation, reasoning, and decision-making, enabling their adoption in high-stakes domains such as healthcare, law, and transportation. However, their reliability is a major concern, as they often…

Computation and Language · Computer Science 2025-06-05 Xiaoou Liu , Tiejin Chen , Longchao Da , Chacha Chen , Zhen Lin , Hua Wei

Large Language Models (LLMs) are increasingly employed in real-world applications, driving the need to evaluate the trustworthiness of their generated text. To this end, reliable uncertainty estimation is essential. Leading uncertainty…

Machine Learning · Computer Science 2026-04-21 Lukas Aichberger , Kajetan Schweighofer , Sepp Hochreiter

As large language models (LLMs) continue to evolve, understanding and quantifying the uncertainty in their predictions is critical for enhancing application credibility. However, the existing literature relevant to LLM uncertainty…

Computation and Language · Computer Science 2024-10-22 Hsiu-Yuan Huang , Yutong Yang , Zhaoxi Zhang , Sanwoo Lee , Yunfang Wu

Large language models (LLMs) are notorious for hallucinating, i.e., producing erroneous claims in their output. Such hallucinations can be dangerous, as occasional factual inaccuracies in the generated text might be obscured by the rest of…

Uncertainty quantification (UQ) methods for Large Language Models (LLMs) encompass a variety of approaches, with two major types being particularly prominent: information-based, which focus on model confidence expressed as token…

Computation and Language · Computer Science 2025-12-10 Roman Vashurin , Maiya Goloburda , Albina Ilina , Aleksandr Rubashevskii , Preslav Nakov , Artem Shelmanov , Maxim Panov

LLMs show strong performance in code generation, but their outputs lack correctness guarantees. Sample-based uncertainty estimators address this by generating multiple candidate programs and measuring their disagreement. However, existing…

Software Engineering · Computer Science 2026-05-12 Weilin He , Arindam Sharma , Cristina David

ML models have errors when used for predictions. The errors are unknown but can be quantified by model uncertainty. When multiple ML models are trained using the same training points, their model uncertainties may be statistically…

Machine Learning · Statistics 2025-09-23 Xiaoping Du

The recent performance leap of Large Language Models (LLMs) opens up new opportunities across numerous industrial applications and domains. However, erroneous generations, such as false predictions, misinformation, and hallucination made by…

Software Engineering · Computer Science 2025-01-07 Yuheng Huang , Jiayang Song , Zhijie Wang , Shengming Zhao , Huaming Chen , Felix Juefei-Xu , Lei Ma

While past works have shown how uncertainty quantification can be applied to large language model (LLM) outputs, the question of whether resulting uncertainty guarantees still hold within sub-groupings of data remains open. In our work,…

Computation and Language · Computer Science 2025-06-13 Terrance Liu , Zhiwei Steven Wu

Existing approaches of prescriptive analytics -- where inputs of an optimization model can be predicted by leveraging covariates in a machine learning model -- often attempt to optimize the mean value of an uncertain objective. However,…

Machine Learning · Computer Science 2025-03-05 Dimitris Bertsimas , Benjamin Boucher

Large language models (LLMs) often generate fluent but factually incorrect outputs, known as hallucinations, which undermine their reliability in real-world applications. While uncertainty estimation has emerged as a promising strategy for…

Machine Learning · Computer Science 2025-05-13 Pei-Fu Guo , Yun-Da Tsai , Shou-De Lin

In-context learning has emerged as a groundbreaking ability of Large Language Models (LLMs) and revolutionized various fields by providing a few task-relevant demonstrations in the prompt. However, trustworthy issues with LLM's response,…

Computation and Language · Computer Science 2024-04-01 Chen Ling , Xujiang Zhao , Xuchao Zhang , Wei Cheng , Yanchi Liu , Yiyou Sun , Mika Oishi , Takao Osaki , Katsushi Matsuda , Jie Ji , Guangji Bai , Liang Zhao , Haifeng Chen
‹ Prev 1 2 3 10 Next ›