Related papers: Optimizing Alignment with Less: Leveraging Data Au…

Empowering Large Language Models for Textual Data Augmentation

With the capabilities of understanding and executing natural language instructions, Large language models (LLMs) can potentially act as a powerful tool for textual data augmentation. However, the quality of augmented data depends heavily on…

Computation and Language · Computer Science 2024-04-30 Yichuan Li , Kaize Ding , Jianling Wang , Kyumin Lee

Aligning Black-box Language Models with Human Judgments

Large language models (LLMs) are increasingly used as automated judges to evaluate recommendation systems, search engines, and other subjective tasks, where relying on human evaluators can be costly, time-consuming, and unscalable. LLMs…

Computation and Language · Computer Science 2025-02-10 Gerrit J. J. van den Burg , Gen Suzuki , Wei Liu , Murat Sensoy

Can LLM be a Personalized Judge?

Ensuring that large language models (LLMs) reflect diverse user values and preferences is crucial as their user bases expand globally. It is therefore encouraging to see the growing interest in LLM personalization within the research…

Computation and Language · Computer Science 2024-06-18 Yijiang River Dong , Tiancheng Hu , Nigel Collier

Reward-Augmented Data Enhances Direct Preference Alignment of LLMs

Preference alignment in Large Language Models (LLMs) has significantly improved their ability to adhere to human instructions and intentions. However, existing direct alignment algorithms primarily focus on relative preferences and often…

Machine Learning · Computer Science 2025-05-13 Shenao Zhang , Zhihan Liu , Boyi Liu , Yufeng Zhang , Yingxiang Yang , Yongfei Liu , Liyu Chen , Tao Sun , Zhaoran Wang

Diversity-oriented Data Augmentation with Large Language Models

Data augmentation is an essential technique in natural language processing (NLP) for enriching training datasets by generating diverse samples. This process is crucial for improving the robustness and generalization capabilities of NLP…

Computation and Language · Computer Science 2025-10-16 Zaitian Wang , Jinghan Zhang , Xinhao Zhang , Kunpeng Liu , Pengfei Wang , Yuanchun Zhou

Leveraging LLMs as Meta-Judges: A Multi-Agent Framework for Evaluating LLM Judgments

Large language models (LLMs) are being widely applied across various fields, but as tasks become more complex, evaluating their responses is increasingly challenging. Compared to human evaluators, the use of LLMs to support performance…

Artificial Intelligence · Computer Science 2025-04-25 Yuran Li , Jama Hussein Mohamud , Chongren Sun , Di Wu , Benoit Boulet

Adaptive Augmentation Policy Optimization with LLM Feedback

Data augmentation is a critical component of deep learning pipelines, enhancing model generalization by increasing dataset diversity. Traditional augmentation strategies rely on manually designed transformations, stochastic sampling, or…

Computer Vision and Pattern Recognition · Computer Science 2025-08-06 Ant Duru , Alptekin Temizel

A Survey on Personalized Alignment -- The Missing Piece for Large Language Models in Real-World Applications

Large Language Models (LLMs) have demonstrated remarkable capabilities, yet their transition to real-world applications reveals a critical limitation: the inability to adapt to individual preferences while maintaining alignment with…

Computation and Language · Computer Science 2025-05-06 Jian Guan , Junfei Wu , Jia-Nan Li , Chuanqi Cheng , Wei Wu

From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment

Large language models (LLMs) have traditionally been aligned through one-size-fits-all approaches that assume uniform human preferences, fundamentally overlooking the diversity in user values and needs. This paper introduces a comprehensive…

Computation and Language · Computer Science 2025-05-23 Jia-Nan Li , Jian Guan , Songhao Wu , Wei Wu , Rui Yan

Refine-n-Judge: Curating High-Quality Preference Chains for LLM-Fine-Tuning

Large Language Models (LLMs) have demonstrated remarkable progress through preference-based fine-tuning, which critically depends on the quality of the underlying training data. While human feedback is essential for improving data quality,…

Artificial Intelligence · Computer Science 2025-10-31 Derin Cayir , Renjie Tao , Rashi Rungta , Kai Sun , Sean Chen , Haidar Khan , Minseok Kim , Julia Reinspach , Yue Liu

Meta-Rewarding Language Models: Self-Improving Alignment with LLM-as-a-Meta-Judge

Large Language Models (LLMs) are rapidly surpassing human knowledge in many domains. While improving these models traditionally relies on costly human data, recent self-rewarding mechanisms (Yuan et al., 2024) have shown that LLMs can…

Computation and Language · Computer Science 2024-07-31 Tianhao Wu , Weizhe Yuan , Olga Golovneva , Jing Xu , Yuandong Tian , Jiantao Jiao , Jason Weston , Sainbayar Sukhbaatar

Learning Personalized Alignment for Evaluating Open-ended Text Generation

Recent research has increasingly focused on evaluating large language models' (LLMs) alignment with diverse human values and preferences, particularly for open-ended tasks like story generation. Traditional evaluation metrics rely heavily…

Computation and Language · Computer Science 2024-10-07 Danqing Wang , Kevin Yang , Hanlin Zhu , Xiaomeng Yang , Andrew Cohen , Lei Li , Yuandong Tian

Direct Judgement Preference Optimization

Auto-evaluation is crucial for assessing response quality and offering feedback for model development. Recent studies have explored training large language models (LLMs) as generative judges to evaluate and critique other models' outputs.…

Computation and Language · Computer Science 2025-09-15 Peifeng Wang , Austin Xu , Yilun Zhou , Caiming Xiong , Shafiq Joty

Adaptation with Self-Evaluation to Improve Selective Prediction in LLMs

Large language models (LLMs) have recently shown great advances in a variety of tasks, including natural language understanding and generation. However, their use in high-stakes decision-making scenarios is still limited due to the…

Computation and Language · Computer Science 2023-11-14 Jiefeng Chen , Jinsung Yoon , Sayna Ebrahimi , Sercan O Arik , Tomas Pfister , Somesh Jha

Optimization Methods for Personalizing Large Language Models through Retrieval Augmentation

This paper studies retrieval-augmented approaches for personalizing large language models (LLMs), which potentially have a substantial impact on various applications and domains. We propose the first attempt to optimize the retrieval models…

Computation and Language · Computer Science 2024-04-19 Alireza Salemi , Surya Kallumadi , Hamed Zamani

A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models

Personalized preference alignment for large language models (LLMs), the process of tailoring LLMs to individual users' preferences, is an emerging research direction spanning the area of NLP and personalization. In this survey, we present…

Computation and Language · Computer Science 2025-04-10 Zhouhang Xie , Junda Wu , Yiran Shen , Yu Xia , Xintong Li , Aaron Chang , Ryan Rossi , Sachin Kumar , Bodhisattwa Prasad Majumder , Jingbo Shang , Prithviraj Ammanabrolu , Julian McAuley

Data Selection for LLM Alignment Using Fine-Grained Preferences

Large language models (LLMs) alignment aims to ensure that the behavior of LLMs meets human preferences. While collecting data from multiple fine-grained, aspect-specific preferences becomes more and more feasible, existing alignment…

Machine Learning · Computer Science 2026-03-03 Jia Zhang , Yao Liu , Chen-Xi Zhang , Yi Liu , Yi-Xuan Jin , Lan-Zhe Guo , Yu-Feng Li

The Effects of Data Augmentation on Confidence Estimation for LLMs

Confidence estimation is crucial for reflecting the reliability of large language models (LLMs), particularly in the widely used closed-source models. Utilizing data augmentation for confidence estimation is viable, but discussions focus on…

Machine Learning · Computer Science 2025-06-16 Rui Wang , Renyu Zhu , Minmin Lin , Runze Wu , Tangjie Lv , Changjie Fan , Haobo Wang

Large Language Models for Healthcare Data Augmentation: An Example on Patient-Trial Matching

The process of matching patients with suitable clinical trials is essential for advancing medical research and providing optimal care. However, current approaches face challenges such as data standardization, ethical considerations, and a…

Computation and Language · Computer Science 2023-08-08 Jiayi Yuan , Ruixiang Tang , Xiaoqian Jiang , Xia Hu

Selecting User Histories to Generate LLM Users for Cold-Start Item Recommendation

Large Language Models (LLMs) have demonstrated remarkable capabilities in reasoning, generalization, and simulating human-like behavior across a wide range of tasks. These strengths present new opportunities to enhance traditional…

Information Retrieval · Computer Science 2025-12-01 Nachiket Subbaraman , Jaskinder Sarai , Aniruddh Nath , Lichan Hong , Lukasz Heldt , Li Wei , Zhe Zhao