English
Related papers

Related papers: Comparison-based Active Preference Learning for Mu…

200 papers

The advent of large language models (LLMs) has sparked significant interest in using natural language for preference learning. However, existing methods often suffer from high computational burdens, taxing human supervision, and lack of…

Machine Learning · Computer Science 2024-12-23 Saaduddin Mahmud , Mason Nakamura , Shlomo Zilberstein

Aligning large language models (LLMs) depends on high-quality datasets of human preference labels, which are costly to collect. Although active learning has been studied to improve sample efficiency relative to passive collection, many…

Machine Learning · Computer Science 2026-02-03 Yao Zhao , Kwang-Sung Jun

Preference-based feedback is important for many applications in machine learning where evaluation of a reward function is not feasible. Notable recent examples arise in preference alignment for large language models, including in…

Aligning large language models (LLMs) with human preferences has been recognized as the key to improving LLMs' interaction quality. However, in this pluralistic world, human preferences can be diversified due to annotators' different…

Artificial Intelligence · Computer Science 2024-10-08 Dun Zeng , Yong Dai , Pengyu Cheng , Longyue Wang , Tianhao Hu , Wanshun Chen , Nan Du , Zenglin Xu

Actively inferring user preferences, for example by asking good questions, is important for any human-facing decision-making system. Active inference allows such systems to adapt and personalize themselves to nuanced individual preferences.…

Computation and Language · Computer Science 2024-06-27 Wasu Top Piriyakulkij , Volodymyr Kuleshov , Kevin Ellis

The recent surge of versatile large language models (LLMs) largely depends on aligning increasingly capable foundation models with human intentions by preference learning, enhancing LLMs with excellent applicability and effectiveness in a…

Computation and Language · Computer Science 2024-06-19 Ruili Jiang , Kehai Chen , Xuefeng Bai , Zhixuan He , Juntao Li , Muyun Yang , Tiejun Zhao , Liqiang Nie , Min Zhang

A prominent issue in aligning language models (LMs) to personalized preferences is underspecification -- the lack of information from users about their preferences. A popular trend of injecting such specification is adding a prefix (e.g.…

Computation and Language · Computer Science 2025-09-30 Zilu Tang , Afra Feyza Akyürek , Ekin Akyürek , Derry Wijaya

Multi-preference optimization enriches language-model alignment beyond pairwise preferences by contrasting entire sets of helpful and undesired responses, thereby enabling richer training signals for large language models. During self-play…

Machine Learning · Computer Science 2025-06-10 Taneesh Gupta , Rahul Madhavan , Xuchao Zhang , Chetan Bansal , Saravan Rajmohan

The recent success in using human preferences to align large language models (LLMs) has significantly improved their performance in various downstream tasks, such as question answering, mathematical reasoning, and code generation. However,…

Machine Learning · Computer Science 2026-05-18 Xiaoqiang Lin , Arun Verma , Zhongxiang Dai , Daniela Rus , See-Kiong Ng , Bryan Kian Hsiang Low

Large language models (LLMs) have traditionally been aligned through one-size-fits-all approaches that assume uniform human preferences, fundamentally overlooking the diversity in user values and needs. This paper introduces a comprehensive…

Computation and Language · Computer Science 2025-05-23 Jia-Nan Li , Jian Guan , Songhao Wu , Wei Wu , Rui Yan

Personalized preference alignment for large language models (LLMs), the process of tailoring LLMs to individual users' preferences, is an emerging research direction spanning the area of NLP and personalization. In this survey, we present…

Large Language Model (LLM) personalization aims to align model behaviors with individual user preferences. Existing methods often focus on isolated user histories, neglecting the essential role of inter-user differences. We propose C-BPO, a…

Computation and Language · Computer Science 2026-05-12 Xilai Ma , Liye Zhao , Weijun Yao , Haibing Di , Wenya Wang , Jing Li

Large language models (LLMs) have demonstrated significant potential in solving recommendation tasks. With proven capabilities in understanding user preferences, LLM personalization has emerged as a critical area for providing tailored…

Information Retrieval · Computer Science 2025-11-04 Jiarui Chen

Pairwise preference data have played an important role in the alignment of large language models (LLMs). Each sample of such data consists of a prompt, two different responses to the prompt, and a binary label indicating which of the two…

Computation and Language · Computer Science 2026-05-12 Zhongze Cai , Xiaocheng Li

Large language model (LLM) personalization aims to align model outputs with individuals' unique preferences and opinions. While recent efforts have implemented various personalization methods, a unified theoretical framework that can…

Computation and Language · Computer Science 2025-09-30 Xinliang Frederick Zhang , Nick Beauchamp , Lu Wang

Large foundation models pretrained on raw web-scale data are not readily deployable without additional step of extensive alignment to human preferences. Such alignment is typically done by collecting large amounts of pairwise comparisons…

Machine Learning · Computer Science 2024-06-13 Daiwei Chen , Yi Chen , Aniket Rege , Ramya Korlakai Vinayak

Personalizing large language models (LLMs) is essential for delivering tailored interactions that improve user experience. Many existing personalization methods require fine-tuning LLMs for each user, rendering them prohibitively expensive…

Machine Learning · Computer Science 2025-03-06 Yijing Zhang , Dyah Adila , Changho Shin , Frederic Sala

As large language models (LLMs) demonstrate increasingly advanced capabilities, aligning their behaviors with human values and preferences becomes crucial for their wide adoption. While previous research focuses on general alignment to…

Computation and Language · Computer Science 2024-12-17 Shujin Wu , May Fung , Cheng Qian , Jeonghwan Kim , Dilek Hakkani-Tur , Heng Ji

Large Language Models (LLMs) acquire extensive knowledge and remarkable abilities from extensive text corpora, making them powerful tools for various applications. To make LLMs more usable, aligning them with human preferences is essential.…

Computation and Language · Computer Science 2024-10-21 Mozhi Zhang , Pengyu Wang , Chenkun Tan , Mianqiu Huang , Dong Zhang , Yaqian Zhou , Xipeng Qiu

Recent advancements in Large Language Models (LLMs) have been remarkable, with new models consistently surpassing their predecessors. These advancements are underpinned by extensive research on various training mechanisms. Among these,…

Computation and Language · Computer Science 2024-12-12 Hansle Gwon , Imjin Ahn , Young-Hak Kim , Sanghyun Park , Tae Joon Jun
‹ Prev 1 2 3 10 Next ›