Related papers: Comparison-based Active Preference Learning for Mu…

MAPLE: A Framework for Active Preference Learning Guided by Large Language Models

The advent of large language models (LLMs) has sparked significant interest in using natural language for preference learning. However, existing methods often suffer from high computational burdens, taxing human supervision, and lack of…

Machine Learning · Computer Science 2024-12-23 Saaduddin Mahmud , Mason Nakamura , Shlomo Zilberstein

Nearly Optimal Active Preference Learning and Its Application to LLM Alignment

Aligning large language models (LLMs) depends on high-quality datasets of human preference labels, which are costly to collect. Although active learning has been studied to improve sample efficiency relative to passive collection, many…

Machine Learning · Computer Science 2026-02-03 Yao Zhao , Kwang-Sung Jun

Sample Efficient Preference Alignment in LLMs via Active Exploration

Preference-based feedback is important for many applications in machine learning where evaluation of a reward function is not feasible. Notable recent examples arise in preference alignment for large language models, including in…

Machine Learning · Computer Science 2025-03-21 Viraj Mehta , Syrine Belakaria , Vikramjeet Das , Ojash Neopane , Yijia Dai , Ilija Bogunovic , Barbara Engelhardt , Stefano Ermon , Jeff Schneider , Willie Neiswanger

On Diversified Preferences of Large Language Model Alignment

Aligning large language models (LLMs) with human preferences has been recognized as the key to improving LLMs' interaction quality. However, in this pluralistic world, human preferences can be diversified due to annotators' different…

Artificial Intelligence · Computer Science 2024-10-08 Dun Zeng , Yong Dai , Pengyu Cheng , Longyue Wang , Tianhao Hu , Wanshun Chen , Nan Du , Zenglin Xu

Active Preference Inference using Language Models and Probabilistic Reasoning

Actively inferring user preferences, for example by asking good questions, is important for any human-facing decision-making system. Active inference allows such systems to adapt and personalize themselves to nuanced individual preferences.…

Computation and Language · Computer Science 2024-06-27 Wasu Top Piriyakulkij , Volodymyr Kuleshov , Kevin Ellis

A Survey on Human Preference Learning for Large Language Models

The recent surge of versatile large language models (LLMs) largely depends on aligning increasingly capable foundation models with human intentions by preference learning, enhancing LLMs with excellent applicability and effectiveness in a…

Computation and Language · Computer Science 2024-06-19 Ruili Jiang , Kehai Chen , Xuefeng Bai , Zhixuan He , Juntao Li , Muyun Yang , Tiejun Zhao , Liqiang Nie , Min Zhang

Is Active Persona Inference Necessary for Aligning Small Models to Personal Preferences?

A prominent issue in aligning language models (LMs) to personalized preferences is underspecification -- the lack of information from users about their preferences. A popular trend of injecting such specification is adding a prefix (e.g.…

Computation and Language · Computer Science 2025-09-30 Zilu Tang , Afra Feyza Akyürek , Ekin Akyürek , Derry Wijaya

AMPO: Active Multi-Preference Optimization for Self-play Preference Selection

Multi-preference optimization enriches language-model alignment beyond pairwise preferences by contrasting entire sets of helpful and undesired responses, thereby enabling richer training signals for large language models. During self-play…

Machine Learning · Computer Science 2025-06-10 Taneesh Gupta , Rahul Madhavan , Xuchao Zhang , Chetan Bansal , Saravan Rajmohan

ActiveDPO: Active Direct Preference Optimization for Sample-Efficient Alignment

The recent success in using human preferences to align large language models (LLMs) has significantly improved their performance in various downstream tasks, such as question answering, mathematical reasoning, and code generation. However,…

Machine Learning · Computer Science 2026-05-18 Xiaoqiang Lin , Arun Verma , Zhongxiang Dai , Daniela Rus , See-Kiong Ng , Bryan Kian Hsiang Low

From 1,000,000 Users to Every User: Scaling Up Personalized Preference for User-level Alignment

Large language models (LLMs) have traditionally been aligned through one-size-fits-all approaches that assume uniform human preferences, fundamentally overlooking the diversity in user values and needs. This paper introduces a comprehensive…

Computation and Language · Computer Science 2025-05-23 Jia-Nan Li , Jian Guan , Songhao Wu , Wei Wu , Rui Yan

A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models

Personalized preference alignment for large language models (LLMs), the process of tailoring LLMs to individual users' preferences, is an emerging research direction spanning the area of NLP and personalization. In this survey, we present…

Computation and Language · Computer Science 2025-04-10 Zhouhang Xie , Junda Wu , Yiran Shen , Yu Xia , Xintong Li , Aaron Chang , Ryan Rossi , Sachin Kumar , Bodhisattwa Prasad Majumder , Jingbo Shang , Prithviraj Ammanabrolu , Julian McAuley

Personalizing LLMs with Binary Feedback: A Preference-Corrected Optimization Framework

Large Language Model (LLM) personalization aims to align model behaviors with individual user preferences. Existing methods often focus on isolated user histories, neglecting the essential role of inter-user differences. We propose C-BPO, a…

Computation and Language · Computer Science 2026-05-12 Xilai Ma , Liye Zhao , Weijun Yao , Haibing Di , Wenya Wang , Jing Li

Memory Assisted LLM for Personalized Recommendation System

Large language models (LLMs) have demonstrated significant potential in solving recommendation tasks. With proven capabilities in understanding user preferences, LLM personalization has emerged as a critical area for providing tailored…

Information Retrieval · Computer Science 2025-11-04 Jiarui Chen

Users as Annotators: LLM Preference Learning from Comparison Mode

Pairwise preference data have played an important role in the alignment of large language models (LLMs). Each sample of such data consists of a prompt, two different responses to the prompt, and a binary label indicating which of the two…

Computation and Language · Computer Science 2026-05-12 Zhongze Cai , Xiaocheng Li

PRIME: Large Language Model Personalization with Cognitive Dual-Memory and Personalized Thought Process

Large language model (LLM) personalization aims to align model outputs with individuals' unique preferences and opinions. While recent efforts have implemented various personalization methods, a unified theoretical framework that can…

Computation and Language · Computer Science 2025-09-30 Xinliang Frederick Zhang , Nick Beauchamp , Lu Wang

PAL: Pluralistic Alignment Framework for Learning from Heterogeneous Preferences

Large foundation models pretrained on raw web-scale data are not readily deployable without additional step of extensive alignment to human preferences. Such alignment is typically done by collecting large amounts of pairwise comparisons…

Machine Learning · Computer Science 2024-06-13 Daiwei Chen , Yi Chen , Aniket Rege , Ramya Korlakai Vinayak

Personalize Your LLM: Fake it then Align it

Personalizing large language models (LLMs) is essential for delivering tailored interactions that improve user experience. Many existing personalization methods require fine-tuning LLMs for each user, rendering them prohibitively expensive…

Machine Learning · Computer Science 2025-03-06 Yijing Zhang , Dyah Adila , Changho Shin , Frederic Sala

Aligning LLMs with Individual Preferences via Interaction

As large language models (LLMs) demonstrate increasingly advanced capabilities, aligning their behaviors with human values and preferences becomes crucial for their wide adoption. While previous research focuses on general alignment to…

Computation and Language · Computer Science 2024-12-17 Shujin Wu , May Fung , Cheng Qian , Jeonghwan Kim , Dilek Hakkani-Tur , Heng Ji

MetaAlign: Align Large Language Models with Diverse Preferences during Inference Time

Large Language Models (LLMs) acquire extensive knowledge and remarkable abilities from extensive text corpora, making them powerful tools for various applications. To make LLMs more usable, aligning them with human preferences is essential.…

Computation and Language · Computer Science 2024-10-21 Mozhi Zhang , Pengyu Wang , Chenkun Tan , Mianqiu Huang , Dong Zhang , Yaqian Zhou , Xipeng Qiu

Multi-Response Preference Optimization with Augmented Ranking Dataset

Recent advancements in Large Language Models (LLMs) have been remarkable, with new models consistently surpassing their predecessors. These advancements are underpinned by extensive research on various training mechanisms. Among these,…

Computation and Language · Computer Science 2024-12-12 Hansle Gwon , Imjin Ahn , Young-Hak Kim , Sanghyun Park , Tae Joon Jun