English
Related papers

Related papers: Sequential Preference-Based Optimization

200 papers

We present an integrated prediction-optimization (PredOpt) framework to efficiently solve sequential decision-making problems by predicting the values of binary decision variables in an optimal solution. We address the key issues of…

Machine Learning · Computer Science 2023-11-14 Dogacan Yilmaz , İ. Esra Büyüktahtakın

In this paper, we present a link between preference-based and multiobjective sequential decision-making. While transforming a multiobjective problem to a preference-based one is quite natural, the other direction is a bit less obvious. We…

Artificial Intelligence · Computer Science 2017-01-04 Paul Weng

Prompt engineering has made significant contributions to the era of large language models, yet its effectiveness depends on the skills of a prompt author. This paper introduces $\textit{iPrOp}$, a novel interactive prompt optimization…

Computation and Language · Computer Science 2025-06-30 Jiahui Li , Roman Klinger

Preferential Bayesian optimization allows optimization of objectives that are either expensive or difficult to measure directly, by relying on a minimal number of comparative evaluations done by a human expert. Generating candidate…

Optimization with preference feedback is an active research area with many applications in engineering systems where humans play a central role, such as building control and autonomous vehicles. While most existing studies focus on…

Optimization and Control · Mathematics 2026-03-31 Wenbin Wang , Wenjie Xu , Colin N. Jones

Prompt engineering is effective but labor-intensive, motivating automated optimization methods. Existing methods typically require labeled datasets, which are often unavailable, and produce verbose, repetitive prompts. We introduce PrefPO,…

Computation and Language · Computer Science 2026-03-26 Rahul Singhal , Pradyumna Tambwekar , Karime Maamari

We propose a new online learning model for learning with preference feedback. The model is especially suited for applications like web search and recommender systems, where preference data is readily available from implicit user feedback…

Machine Learning · Computer Science 2011-11-04 Pannagadatta K. Shivaswamy , Thorsten Joachims

The purpose of this paper is twofold. On one side, we present a general framework for Bayesian optimization and we compare it with some related fields in active learning and Bayesian numerical analysis. On the other hand, Bayesian…

Robotics · Computer Science 2018-02-13 Ruben Martinez-Cantin

Decision-making problems often feature uncertainty stemming from heterogeneous and context-dependent human preferences. To address this, we propose a sequential learning-and-optimization pipeline to learn preference distributions and…

Machine Learning · Computer Science 2026-03-19 Benjamin Hudson , Laurent Charlin , Emma Frejinger

Modeling user sequential behaviors has recently attracted increasing attention in the recommendation domain. Existing methods mostly assume coherent preference in the same sequence. However, user personalities are volatile and easily…

Information Retrieval · Computer Science 2022-04-01 Weiqi Shao , Xu Chen , Long Xia , Jiashu Zhao , Dawei Yin

Human preference alignment is critical in building powerful and reliable large language models (LLMs). However, current methods either ignore the multi-dimensionality of human preferences (e.g. helpfulness and harmlessness) or struggle with…

Machine Learning · Computer Science 2024-10-14 Xingzhou Lou , Junge Zhang , Jian Xie , Lifeng Liu , Dong Yan , Kaiqi Huang

Aligning language models with human preferences through reinforcement learning from human feedback is crucial for their safe and effective deployment. The human preference is typically represented through comparison where one response is…

Machine Learning · Computer Science 2025-07-15 Hoang Anh Just , Ming Jin , Anit Sahu , Huy Phan , Ruoxi Jia

What enables large language models (LLMs) to effectively model user preferences in sequential recommendation? Our investigation reveals that existing preference-alignment approaches largely rely on binary pairwise comparisons, overlooking…

Information Retrieval · Computer Science 2026-04-20 Zhongyu Ouyang , Qianlong Wen , Chunhui Zhang , Yanfang Ye , Soroush Vosoughi

Sequential recommendation leverages interaction sequences to predict forthcoming user behaviors, crucial for crafting personalized recommendations. However, the true preferences of a user are inherently complex and high-dimensional, while…

Information Retrieval · Computer Science 2024-07-26 Shu Chen , Jinwei Luo , Weike Pan , Jiangxing Yu , Xin Huang , Zhong Ming

Preferences play a key role in determining what goals/constraints to satisfy when not all constraints can be satisfied simultaneously. In this work, we study preference-based planning in a stochastic system modeled as a Markov decision…

Formal Languages and Automata Theory · Computer Science 2022-03-28 Abhishek Ninad Kulkarni , Jie Fu

Accommodating human preferences is essential for creating AI agents that deliver personalized and effective interactions. Recent work has shown the potential for LLMs to infer preferences from user interactions, but they often produce broad…

Artificial Intelligence · Computer Science 2024-10-10 Stephane Aroca-Ouellette , Natalie Mackraz , Barry-John Theobald , Katherine Metcalf

Offline paired preference optimization algorithms have become a popular approach for fine-tuning on preference data, outperforming traditional supervised fine-tuning in various tasks. However, traditional implementations often involve…

Machine Learning · Computer Science 2024-11-01 Franklin Wang , Sumanth Hegde

Personalization and recommendations are now accepted as core competencies in just about every online setting, ranging from media platforms to e-commerce to social networks. While the challenge of estimating user preferences has garnered…

Artificial Intelligence · Computer Science 2020-11-18 Vivek F. Farias , Andrew A. Li , Deeksha Sinha

BayesOpt is a library with state-of-the-art Bayesian optimization methods to solve nonlinear optimization, stochastic bandits or sequential experimental design problems. Bayesian optimization is sample efficient by building a posterior…

Machine Learning · Computer Science 2014-05-30 Ruben Martinez-Cantin

Goal-oriented requirements variability modelling has established the understanding for adaptability in the early stage of software development-the Requirements Engineering phase. Goal-oriented requirements variability modelling considers…

Software Engineering · Computer Science 2019-05-17 Khavee Agustus Botangen , Jian Yu , Sira Yongchareon , LiangHuai Yang , Quan Bai
‹ Prev 1 2 3 10 Next ›