Related papers: Sequential Preference-Based Optimization

An Expandable Machine Learning-Optimization Framework to Sequential Decision-Making

We present an integrated prediction-optimization (PredOpt) framework to efficiently solve sequential decision-making problems by predicting the values of binary decision variables in an optimal solution. We address the key issues of…

Machine Learning · Computer Science 2023-11-14 Dogacan Yilmaz , İ. Esra Büyüktahtakın

From Preference-Based to Multiobjective Sequential Decision-Making

In this paper, we present a link between preference-based and multiobjective sequential decision-making. While transforming a multiobjective problem to a preference-based one is quite natural, the other direction is a bit less obvious. We…

Artificial Intelligence · Computer Science 2017-01-04 Paul Weng

iPrOp: Interactive Prompt Optimization for Large Language Models with a Human in the Loop

Prompt engineering has made significant contributions to the era of large language models, yet its effectiveness depends on the skills of a prompt author. This paper introduces $\textit{iPrOp}$, a novel interactive prompt optimization…

Computation and Language · Computer Science 2025-06-30 Jiahui Li , Roman Klinger

Consecutive Preferential Bayesian Optimization

Preferential Bayesian optimization allows optimization of objectives that are either expensive or difficult to measure directly, by relying on a minimal number of comparative evaluations done by a human expert. Generating candidate…

Machine Learning · Computer Science 2025-11-10 Aras Erarslan , Carlos Sevilla Salcedo , Ville Tanskanen , Anni Nisov , Eero Päiväkumpu , Heikki Aisala , Kaisu Honkapää , Arto Klami , Petrus Mikkola

Human-in-the-loop: Real-time Preference Optimization

Optimization with preference feedback is an active research area with many applications in engineering systems where humans play a central role, such as building control and autonomous vehicles. While most existing studies focus on…

Optimization and Control · Mathematics 2026-03-31 Wenbin Wang , Wenjie Xu , Colin N. Jones

PrefPO: Pairwise Preference Prompt Optimization

Prompt engineering is effective but labor-intensive, motivating automated optimization methods. Existing methods typically require labeled datasets, which are often unavailable, and produce verbose, repetitive prompts. We introduce PrefPO,…

Computation and Language · Computer Science 2026-03-26 Rahul Singhal , Pradyumna Tambwekar , Karime Maamari

Online Learning with Preference Feedback

We propose a new online learning model for learning with preference feedback. The model is especially suited for applications like web search and recommender systems, where preference data is readily available from implicit user feedback…

Machine Learning · Computer Science 2011-11-04 Pannagadatta K. Shivaswamy , Thorsten Joachims

BayesOpt: A Library for Bayesian optimization with Robotics Applications

The purpose of this paper is twofold. On one side, we present a general framework for Bayesian optimization and we compare it with some related fields in active learning and Bayesian numerical analysis. On the other hand, Bayesian…

Robotics · Computer Science 2018-02-13 Ruben Martinez-Cantin

Contextual Preference Distribution Learning

Decision-making problems often feature uncertainty stemming from heterogeneous and context-dependent human preferences. To address this, we propose a sequential learning-and-optimization pipeline to learn preference distributions and…

Machine Learning · Computer Science 2026-03-19 Benjamin Hudson , Laurent Charlin , Emma Frejinger

Sequential Recommendation with User Evolving Preference Decomposition

Modeling user sequential behaviors has recently attracted increasing attention in the recommendation domain. Existing methods mostly assume coherent preference in the same sequence. However, user personalities are volatile and easily…

Information Retrieval · Computer Science 2022-04-01 Weiqi Shao , Xu Chen , Long Xia , Jiashu Zhao , Dawei Yin

SPO: Multi-Dimensional Preference Sequential Alignment With Implicit Reward Modeling

Human preference alignment is critical in building powerful and reliable large language models (LLMs). However, current methods either ignore the multi-dimensionality of human preferences (e.g. helpfulness and harmlessness) or struggle with…

Machine Learning · Computer Science 2024-10-14 Xingzhou Lou , Junge Zhang , Jian Xie , Lifeng Liu , Dong Yan , Kaiqi Huang

Data-Centric Human Preference with Rationales for Direct Preference Alignment

Aligning language models with human preferences through reinforcement learning from human feedback is crucial for their safe and effective deployment. The human preference is typically represented through comparison where one response is…

Machine Learning · Computer Science 2025-07-15 Hoang Anh Just , Ming Jin , Anit Sahu , Huy Phan , Ruoxi Jia

What Makes LLMs Effective Sequential Recommenders? A Study on Preference Intensity and Temporal Context

What enables large language models (LLMs) to effectively model user preferences in sequential recommendation? Our investigation reveals that existing preference-alignment approaches largely rely on binary pairwise comparisons, overlooking…

Information Retrieval · Computer Science 2026-04-20 Zhongyu Ouyang , Qianlong Wen , Chunhui Zhang , Yanfang Ye , Soroush Vosoughi

Sample Enrichment via Temporary Operations on Subsequences for Sequential Recommendation

Sequential recommendation leverages interaction sequences to predict forthcoming user behaviors, crucial for crafting personalized recommendations. However, the true preferences of a user are inherently complex and high-dimensional, while…

Information Retrieval · Computer Science 2024-07-26 Shu Chen , Jinwei Luo , Weike Pan , Jiangxing Yu , Xin Huang , Zhong Ming

Opportunistic Qualitative Planning in Stochastic Systems with Preferences over Temporal Logic Objectives

Preferences play a key role in determining what goals/constraints to satisfy when not all constraints can be satisfied simultaneously. In this work, we study preference-based planning in a stochastic system modeled as a Markov decision…

Formal Languages and Automata Theory · Computer Science 2022-03-28 Abhishek Ninad Kulkarni , Jie Fu

PREDICT: Preference Reasoning by Evaluating Decomposed preferences Inferred from Candidate Trajectories

Accommodating human preferences is essential for creating AI agents that deliver personalized and effective interactions. Recent work has shown the potential for LLMs to infer preferences from user interactions, but they often produce broad…

Artificial Intelligence · Computer Science 2024-10-10 Stephane Aroca-Ouellette , Natalie Mackraz , Barry-John Theobald , Katherine Metcalf

Accelerating Direct Preference Optimization with Prefix Sharing

Offline paired preference optimization algorithms have become a popular approach for fine-tuning on preference data, outperforming traditional supervised fine-tuning in various tasks. However, traditional implementations often involve…

Machine Learning · Computer Science 2024-11-01 Franklin Wang , Sumanth Hegde

Optimizing Offer Sets in Sub-Linear Time

Personalization and recommendations are now accepted as core competencies in just about every online setting, ranging from media platforms to e-commerce to social networks. While the challenge of estimating user preferences has garnered…

Artificial Intelligence · Computer Science 2020-11-18 Vivek F. Farias , Andrew A. Li , Deeksha Sinha

BayesOpt: A Bayesian Optimization Library for Nonlinear Optimization, Experimental Design and Bandits

BayesOpt is a library with state-of-the-art Bayesian optimization methods to solve nonlinear optimization, stochastic bandits or sequential experimental design problems. Bayesian optimization is sample efficient by building a posterior…

Machine Learning · Computer Science 2014-05-30 Ruben Martinez-Cantin

Specifying and Reasoning about Contextual Preferences in the Goal-oriented Requirements Modelling

Goal-oriented requirements variability modelling has established the understanding for adaptability in the early stage of software development-the Requirements Engineering phase. Goal-oriented requirements variability modelling considers…

Software Engineering · Computer Science 2019-05-17 Khavee Agustus Botangen , Jian Yu , Sira Yongchareon , LiangHuai Yang , Quan Bai