English
Related papers

Related papers: Uncertainty Aware Learning for Language Model Alig…

200 papers

Instead of randomly acquiring training data points, Uncertainty-based Active Learning (UAL) operates by querying the label(s) of pivotal samples from an unlabeled pool selected based on the prediction uncertainty, thereby aiming at…

Machine Learning · Computer Science 2024-08-27 Amir Hossein Rahmati , Mingzhou Fan , Ruida Zhou , Nathan M. Urban , Byung-Jun Yoon , Xiaoning Qian

In recent years, large-scale language models (LLMs) have gained attention for their impressive text generation capabilities. However, these models often face the challenge of "hallucination," which undermines their reliability. In this…

Computation and Language · Computer Science 2023-10-10 Yuchen Yang , Houqiang Li , Yanfeng Wang , Yu Wang

Learning against label noise is a vital topic to guarantee a reliable performance for deep neural networks. Recent research usually refers to dynamic noise modeling with model output probabilities and loss values, and then separates clean…

Machine Learning · Statistics 2022-07-13 Yingsong Huang , Bing Bai , Shengwei Zhao , Kun Bai , Fei Wang

Current person image retrieval methods have achieved great improvements in accuracy metrics. However, they rarely describe the reliability of the prediction. In this paper, we propose an Uncertainty-Aware Learning (UAL) method to remedy…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Zhaopeng Dou , Zhongdao Wang , Weihua Chen , Yali Li , Shengjin Wang

Large language models (LLMs) are increasingly used in decision-making contexts, but when they present answers without signaling low confidence, users may unknowingly act on erroneous outputs. Prior work shows that LLMs maintain internal…

Computation and Language · Computer Science 2025-10-23 Mark Steyvers , Catarina Belem , Padhraic Smyth

Despite demonstrating impressive capabilities, Large Language Models (LLMs) still often struggle to accurately express the factual knowledge they possess, especially in cases where the LLMs' knowledge boundaries are ambiguous. To improve…

Computation and Language · Computer Science 2025-05-26 Boyang Xue , Fei Mi , Qi Zhu , Hongru Wang , Rui Wang , Sheng Wang , Erxin Yu , Xuming Hu , Kam-Fai Wong

Large Language Models (LLMs) have been widely employed in programming language analysis to enhance human productivity. Yet, their reliability can be compromised by various code distribution shifts, leading to inconsistent outputs. While…

Software Engineering · Computer Science 2024-02-12 Yufei Li , Simin Chen , Yanghong Guo , Wei Yang , Yue Dong , Cong Liu

Reward models (RMs) are essential for aligning large language models (LLM) with human expectations. However, existing RMs struggle to capture the stochastic and uncertain nature of human preferences and fail to assess the reliability of…

Machine Learning · Computer Science 2025-02-13 Xingzhou Lou , Dong Yan , Wei Shen , Yuzi Yan , Jian Xie , Junge Zhang

Reliable Large Language Models (LLMs) should abstain when confidence is insufficient. However, prior studies often treat refusal as a generic "I don't know'', failing to distinguish input-level ambiguity (data uncertainty) from capability…

Computation and Language · Computer Science 2026-04-21 Jingyi Ren , Ante Wang , Yunghwei Lai , Xiaolong Wang , Linlu Gong , Weitao Li , Weizhi Ma , Yang Liu

Large language models (LLMs) have revolutionized the field of natural language processing with their impressive reasoning and question-answering capabilities. However, these models are sometimes prone to generating credible-sounding but…

Computation and Language · Computer Science 2026-04-21 Ranganath Krishnan , Piyush Khanna , Omesh Tickoo

While pre-trained language model (PLM) fine-tuning has achieved strong performance in many NLP tasks, the fine-tuning stage can be still demanding in labeled data. Recent works have resorted to active fine-tuning to improve the label…

Computation and Language · Computer Science 2022-05-04 Yue Yu , Lingkai Kong , Jieyu Zhang , Rongzhi Zhang , Chao Zhang

Active Learning (AL) has emerged as a powerful approach for minimizing labeling costs by selectively sampling the most informative data for neural network model development. Effective AL for large-scale vision-language models necessitates…

Computer Vision and Pattern Recognition · Computer Science 2025-07-30 Athmanarayanan Lakshmi Narayanan , Amrutha Machireddy , Ranganath Krishnan

Active Learning (AL) addresses the crucial challenge of enabling machines to efficiently gather labeled examples through strategic queries. Among the many AL strategies, Uncertainty Sampling (US) stands out as one of the most widely…

Machine Learning · Computer Science 2025-06-24 Po-Yi Lu , Yi-Jie Cheng , Chun-Liang Li , Hsuan-Tien Lin

Large language models (LLMs) demonstrate remarkable capabilities but face challenges from hallucinations, which typically arise from insufficient knowledge or context. While instructing LLMs to acknowledge knowledge limitations by…

Computation and Language · Computer Science 2025-08-08 Jiaqi Li , Yixuan Tang , Yi Yang

Large Language Models (LLMs) are known to hallucinate and generate non-factual outputs which can undermine user trust. Traditional methods to directly mitigate hallucinations, such as representation editing and contrastive decoding, often…

Machine Learning · Computer Science 2025-03-11 Prasenjit Dey , Srujana Merugu , Sivaramakrishnan Kaveri

Alignment of large language models (LLMs) typically involves training a reward model on preference data, followed by policy optimization with respect to the reward model. However, optimizing policies with respect to a single reward model…

Machine Learning · Computer Science 2025-07-23 Debangshu Banerjee , Kintan Saha , Aditya Gopalan

Large language models (LLMs) remain unreliable for global enterprise applications due to substantial performance gaps between high-resource and mid/low-resource languages, driven by English-centric pretraining and internal reasoning biases.…

Computation and Language · Computer Science 2025-10-28 Amit Agarwal , Hansa Meghwani , Hitesh Laxmichand Patel , Tao Sheng , Sujith Ravi , Dan Roth

While large language models (LLMs) demonstrate strong capabilities across diverse user queries, they still suffer from hallucinations, often arising from knowledge misalignment between pre-training and fine-tuning. To address this…

Computation and Language · Computer Science 2026-04-08 Joosung Lee , Hwiyeol Jo , Donghyeon Ko , Kyubyung Chae , Cheonbok Park , Jeonghoon Kim

AI deployed in many real-world use cases should be capable of adapting to novelties encountered after deployment. Here, we consider a challenging, under-explored and realistic continual adaptation problem: a deployed AI agent is…

Machine Learning · Computer Science 2024-12-16 Amanda Rios , Ibrahima Ndiour , Parual Datta , Jerry Sydir , Omesh Tickoo , Nilesh Ahuja

Large language models (LLMs) are increasingly employed in information-seeking and decision-making tasks. Despite their broad utility, LLMs tend to generate information that conflicts with real-world facts, and their persuasive style can…

Computation and Language · Computer Science 2024-09-19 Arslan Chaudhry , Sridhar Thiagarajan , Dilan Gorur
‹ Prev 1 2 3 10 Next ›