Related papers: Personalizing LLMs with Binary Feedback: A Prefere…

Binary Classifier Optimization for Large Language Model Alignment

In real-world services such as ChatGPT, aligning models based on user feedback is crucial for improving model performance. However, due to the simplicity and convenience of providing feedback, users typically offer only basic binary…

Machine Learning · Computer Science 2025-06-10 Seungjae Jung , Gunsoo Han , Daniel Wontae Nam , Kyoung-Woon On

BAPO: Base-Anchored Preference Optimization for Overcoming Forgetting in Large Language Models Personalization

While learning to align Large Language Models (LLMs) with human preferences has shown remarkable success, aligning these models to meet the diverse user preferences presents further challenges in preserving previous knowledge. This paper…

Artificial Intelligence · Computer Science 2024-10-01 Gihun Lee , Minchan Jeong , Yujin Kim , Hojung Jung , Jaehoon Oh , Sangmook Kim , Se-Young Yun

Unified Preference Optimization: Language Model Alignment Beyond the Preference Frontier

For aligning large language models (LLMs), prior work has leveraged reinforcement learning via human feedback (RLHF) or variations of direct preference optimization (DPO). While DPO offers a simpler framework based on maximum likelihood…

Artificial Intelligence · Computer Science 2025-05-27 Anirudhan Badrinath , Prabhat Agarwal , Jiajing Xu

Beyond the Binary: Capturing Diverse Preferences With Reward Regularization

Large language models (LLMs) are increasingly deployed via public-facing interfaces to interact with millions of users, each with diverse preferences. Despite this, preference tuning of LLMs predominantly relies on reward models trained…

Computation and Language · Computer Science 2024-12-06 Vishakh Padmakumar , Chuanyang Jin , Hannah Rose Kirk , He He

Reflective Personalization Optimization: A Post-hoc Rewriting Framework for Black-Box Large Language Models

The personalization of black-box large language models (LLMs) is a critical yet challenging task. Existing approaches predominantly rely on context injection, where user history is embedded into the prompt to directly guide the generation…

Computation and Language · Computer Science 2025-11-10 Teqi Hao , Xioayu Tan , Shaojie Shi , Yinghui Xu , Xihe Qiu

Comparing Bad Apples to Good Oranges: Aligning Large Language Models via Joint Preference Optimization

A common technique for aligning large language models (LLMs) relies on acquiring human preferences by comparing multiple generations conditioned on a fixed context. This method, however, relies solely on pairwise comparisons, where the…

Computation and Language · Computer Science 2025-01-09 Hritik Bansal , Ashima Suvarna , Gantavya Bhatt , Nanyun Peng , Kai-Wei Chang , Aditya Grover

Personalisation within bounds: A risk taxonomy and policy framework for the alignment of large language models with personalised feedback

Large language models (LLMs) are used to generate content for a wide range of tasks, and are set to reach a growing audience in coming years due to integration in product interfaces like ChatGPT or search engines like Bing. This intensifies…

Computation and Language · Computer Science 2023-03-10 Hannah Rose Kirk , Bertie Vidgen , Paul Röttger , Scott A. Hale

What Makes LLMs Effective Sequential Recommenders? A Study on Preference Intensity and Temporal Context

What enables large language models (LLMs) to effectively model user preferences in sequential recommendation? Our investigation reveals that existing preference-alignment approaches largely rely on binary pairwise comparisons, overlooking…

Information Retrieval · Computer Science 2026-04-20 Zhongyu Ouyang , Qianlong Wen , Chunhui Zhang , Yanfang Ye , Soroush Vosoughi

Optimizing Language Models for Human Preferences is a Causal Inference Problem

As large language models (LLMs) see greater use in academic and commercial settings, there is increasing interest in methods that allow language models to generate texts aligned with human preferences. In this paper, we present an initial…

Machine Learning · Computer Science 2024-06-07 Victoria Lin , Eli Ben-Michael , Louis-Philippe Morency

Beyond Pairwise: Empowering LLM Alignment With Ranked Choice Modeling

Alignment of large language models (LLMs) has predominantly relied on pairwise preference optimization, where annotators select the better of two responses to a prompt. While simple, this approach overlooks the opportunity to learn from…

Machine Learning · Computer Science 2026-02-11 Yuxuan Tang , Yifan Feng

ComPO: Community Preferences for Language Model Personalization

Conventional algorithms for training language models (LMs) with human feedback rely on preferences that are assumed to account for an "average" user, disregarding subjectivity and finer-grained variations. Recent studies have raised…

Computation and Language · Computer Science 2024-10-22 Sachin Kumar , Chan Young Park , Yulia Tsvetkov , Noah A. Smith , Hannaneh Hajishirzi

A Survey on Personalized and Pluralistic Preference Alignment in Large Language Models

Personalized preference alignment for large language models (LLMs), the process of tailoring LLMs to individual users' preferences, is an emerging research direction spanning the area of NLP and personalization. In this survey, we present…

Computation and Language · Computer Science 2025-04-10 Zhouhang Xie , Junda Wu , Yiran Shen , Yu Xia , Xintong Li , Aaron Chang , Ryan Rossi , Sachin Kumar , Bodhisattwa Prasad Majumder , Jingbo Shang , Prithviraj Ammanabrolu , Julian McAuley

Bootstrapping LLMs via Preference-Based Policy Optimization

Bootstrapping large language models (LLMs) through preference-based policy optimization offers a promising direction for aligning model behavior with human preferences without relying on extensive manual annotations. In this work, we…

Artificial Intelligence · Computer Science 2025-12-25 Chen Jia

One Adapts to Any: Meta Reward Modeling for Personalized LLM Alignment

Alignment of Large Language Models (LLMs) aims to align outputs with human preferences, and personalized alignment further adapts models to individual users. This relies on personalized reward models that capture user-specific preferences…

Computation and Language · Computer Science 2026-04-21 Hongru Cai , Yongqi Li , Tiezheng Yu , Fengbin Zhu , Wenjie Wang , Fuli Feng , Wenjie Li

Multi-Reference Preference Optimization for Large Language Models

How can Large Language Models (LLMs) be aligned with human intentions and values? A typical solution is to gather human preference on model outputs and finetune the LLMs accordingly while ensuring that updates do not deviate too far from a…

Computation and Language · Computer Science 2024-05-28 Hung Le , Quan Tran , Dung Nguyen , Kien Do , Saloni Mittal , Kelechi Ogueji , Svetha Venkatesh

LoRe: Personalizing LLMs via Low-Rank Reward Modeling

Personalizing large language models (LLMs) to accommodate diverse user preferences is essential for enhancing alignment and user satisfaction. Traditional reinforcement learning from human feedback (RLHF) approaches often rely on monolithic…

Machine Learning · Computer Science 2025-04-22 Avinandan Bose , Zhihan Xiong , Yuejie Chi , Simon Shaolei Du , Lin Xiao , Maryam Fazel

Strengthening Multimodal Large Language Model with Bootstrapped Preference Optimization

Multimodal Large Language Models (MLLMs) excel in generating responses based on visual inputs. However, they often suffer from a bias towards generating responses similar to their pretraining corpus, overshadowing the importance of visual…

Computation and Language · Computer Science 2024-04-04 Renjie Pi , Tianyang Han , Wei Xiong , Jipeng Zhang , Runtao Liu , Rui Pan , Tong Zhang

InfoPO: On Mutual Information Maximization for Large Language Model Alignment

We study the post-training of large language models (LLMs) with human preference data. Recently, direct preference optimization and its variants have shown considerable promise in aligning language models, eliminating the need for reward…

Machine Learning · Computer Science 2025-05-14 Teng Xiao , Zhen Ge , Sujay Sanghavi , Tian Wang , Julian Katz-Samuels , Marc Versage , Qingjun Cui , Trishul Chilimbi

Personalized Group Relative Policy Optimization for Heterogenous Preference Alignment

Despite their sophisticated general-purpose capabilities, Large Language Models (LLMs) often fail to align with diverse individual preferences because standard post-training methods, like Reinforcement Learning with Human Feedback (RLHF),…

Machine Learning · Computer Science 2026-03-12 Jialu Wang , Heinrich Peters , Asad A. Butt , Navid Hashemi , Alireza Hashemi , Pouya M. Ghari , Joseph Hoover , James Rae , Morteza Dehghani

Optimizing LLMs with Direct Preferences: A Data Efficiency Perspective

Aligning the output of Large Language Models (LLMs) with human preferences (e.g., by means of reinforcement learning with human feedback, or RLHF) is essential for ensuring their effectiveness in real-world scenarios. Despite significant…

Artificial Intelligence · Computer Science 2024-10-23 Pietro Bernardelle , Gianluca Demartini