Related papers: On Preference Learning Based on Sequential Bayesia…

Bayes-Optimal Entropy Pursuit for Active Choice-Based Preference Learning

We analyze the problem of learning a single user's preferences in an active learning setting, sequentially and adaptively querying the user over a finite time horizon. Learning is conducted via choice-based queries, where the user selects…

Machine Learning · Statistics 2017-02-27 Stephen N. Pallone , Peter I. Frazier , Shane G. Henderson

Learning the Preferences of Ignorant, Inconsistent Agents

An important use of machine learning is to learn what people value. What posts or photos should a user be shown? Which jobs or activities would a person find rewarding? In each case, observations of people's past choices can inform our…

Artificial Intelligence · Computer Science 2015-12-21 Owain Evans , Andreas Stuhlmueller , Noah D. Goodman

Preference Inference from Demonstration in Multi-objective Multi-agent Decision Making

It is challenging to quantify numerical preferences for different objectives in a multi-objective decision-making problem. However, the demonstrations of a user are often accessible. We propose an algorithm to infer linear preference…

Artificial Intelligence · Computer Science 2023-04-28 Junlin Lu

Stochastic Pairwise Preference Convergence in Bayesian Agents

Beliefs inform the behavior of forward-thinking agents in complex environments. Recently, sequential Bayesian inference has emerged as a mechanism to study belief formation among agents adapting to dynamical conditions. However, we lack…

Adaptation and Self-Organizing Systems · Physics 2023-11-08 Jordan T Kemp , Max-Olivier Hongler , Olivier Gallay

Preference elicitation and inverse reinforcement learning

We state the problem of inverse reinforcement learning in terms of preference elicitation, resulting in a principled (Bayesian) statistical formulation. This generalises previous work on Bayesian inverse reinforcement learning and allows us…

Machine Learning · Statistics 2011-06-30 Constantin Rothkopf , Christos Dimitrakakis

Bayesian Persuasion in Sequential Decision-Making

We study a dynamic model of Bayesian persuasion in sequential decision-making settings. An informed principal observes an external parameter of the world and advises an uninformed agent about actions to take over time. The agent takes…

Computer Science and Game Theory · Computer Science 2022-05-25 Jiarui Gan , Rupak Majumdar , Goran Radanovic , Adish Singla

Multi-Attribute Bayesian Optimization With Interactive Preference Learning

We consider black-box global optimization of time-consuming-to-evaluate functions on behalf of a decision-maker (DM) whose preferences must be learned. Each feasible design is associated with a time-consuming-to-evaluate vector of…

Machine Learning · Statistics 2020-03-05 Raul Astudillo , Peter I. Frazier

Exploiting Prior Knowledge in Preferential Learning of Individualized Autonomous Vehicle Driving Styles

Trajectory planning for automated vehicles commonly employs optimization over a moving horizon - Model Predictive Control - where the cost function critically influences the resulting driving style. However, finding a suitable cost function…

Systems and Control · Electrical Eng. & Systems 2025-10-20 Lukas Theiner , Sebastian Hirt , Alexander Steinke , Rolf Findeisen

Bayesian Active Learning for Classification and Preference Learning

Information theoretic active learning has been widely studied for probabilistic models. For simple regression an optimal myopic policy is easily tractable. However, for other tasks and with more complex models, such as classification with…

Machine Learning · Statistics 2011-12-30 Neil Houlsby , Ferenc Huszár , Zoubin Ghahramani , Máté Lengyel

Efficiently Learning from Revealed Preference

In this paper, we consider the revealed preferences problem from a learning perspective. Every day, a price vector and a budget is drawn from an unknown distribution, and a rational agent buys his most preferred bundle according to some…

Computer Science and Game Theory · Computer Science 2012-11-20 Morteza Zadimoghaddam , Aaron Roth

Projective Preferential Bayesian Optimization

Bayesian optimization is an effective method for finding extrema of a black-box function. We propose a new type of Bayesian optimization for learning user preferences in high-dimensional spaces. The central assumption is that the underlying…

Machine Learning · Statistics 2020-08-17 Petrus Mikkola , Milica Todorović , Jari Järvi , Patrick Rinke , Samuel Kaski

Unified Representation Learning for Multi-Intent Diversity and Behavioral Uncertainty in Recommender Systems

This paper addresses the challenge of jointly modeling user intent diversity and behavioral uncertainty in recommender systems. A unified representation learning framework is proposed. The framework builds a multi-intent representation…

Information Retrieval · Computer Science 2025-09-08 Wei Xu , Jiasen Zheng , Junjiang Lin , Mingxuan Han , Junliang Du

Preference Estimation via Opponent Modeling in Multi-Agent Negotiation

Automated negotiation in complex, multi-party and multi-issue settings critically depends on accurate opponent modeling. However, conventional numerical-only approaches fail to capture the qualitative information embedded in natural…

Computation and Language · Computer Science 2026-04-20 Yuta Konishi , Kento Yamamoto , Eisuke Sonomoto , Rikuho Takeda , Ryo Furukawa , Yusuke Muraki , Takafumi Shimizu , Kazuma Fukumura , Yuya Kanemoto , Takayuki Ito , Shiyao Ding

Learning to Trust: Bayesian Adaptation to Varying Suggester Reliability in Sequential Decision Making

Autonomous agents operating in sequential decision-making tasks under uncertainty can benefit from external action suggestions, which provide valuable guidance but inherently vary in reliability. Existing methods for incorporating such…

Artificial Intelligence · Computer Science 2026-05-26 Dylan M. Asmar , Mykel J. Kochenderfer

Online Learning with Preference Feedback

We propose a new online learning model for learning with preference feedback. The model is especially suited for applications like web search and recommender systems, where preference data is readily available from implicit user feedback…

Machine Learning · Computer Science 2011-11-04 Pannagadatta K. Shivaswamy , Thorsten Joachims

After Talking with 1,000 Personas: Learning Preference-Aligned Proactive Assistants From Large-Scale Persona Interactions

Smart assistants increasingly act proactively, yet mistimed or intrusive behavior often causes users to lose trust and disable these features. Learning user preferences for proactive assistance is difficult because real-world studies are…

Human-Computer Interaction · Computer Science 2026-02-05 Ziyi Xuan , Yiwen Wu , Zhaoyang Yan , Vinod Namboodiri , Yu Yang

Bayesian Exploration with Heterogeneous Agents

It is common in recommendation systems that users both consume and produce information as they make strategic choices under uncertainty. While a social planner would balance "exploration" and "exploitation" using a multi-armed bandit…

Computer Science and Game Theory · Computer Science 2019-02-20 Nicole Immorlica , Jieming Mao , Aleksandrs Slivkins , Zhiwei Steven Wu

Preference Construction: A Bayesian Interactive Preference Elicitation Framework Based on Monte Carlo Tree Search

We present a novel preference learning framework to capture participant preferences efficiently within limited interaction rounds. It involves three main contributions. First, we develop a variational Bayesian approach to infer the…

Machine Learning · Computer Science 2025-03-20 Yan Wang , Jiapeng Liu , Milosz Kadziński , Xiuwu Liao

Offline Preference-Based Apprenticeship Learning

Learning a reward function from human preferences is challenging as it typically requires having a high-fidelity simulator or using expensive and potentially unsafe actual physical rollouts in the environment. However, in many tasks the…

Machine Learning · Computer Science 2022-02-18 Daniel Shin , Daniel S. Brown , Anca D. Dragan

Preferential Bayesian Optimization

Bayesian optimization (BO) has emerged during the last few years as an effective approach to optimizing black-box functions where direct queries of the objective are expensive. In this paper we consider the case where direct access to the…

Machine Learning · Statistics 2017-04-13 Javier Gonzalez , Zhenwen Dai , Andreas Damianou , Neil D. Lawrence