Related papers: From Preference-Based to Multiobjective Sequential…

Inferring Lexicographically-Ordered Rewards from Preferences

Modeling the preferences of agents over a set of alternatives is a principal concern in many areas. The dominant approach has been to find a single reward/utility function with the property that alternatives yielding higher rewards are…

Machine Learning · Computer Science 2022-06-09 Alihan Hüyük , William R. Zame , Mihaela van der Schaar

A Survey of Multi-Objective Sequential Decision-Making

Sequential decision-making problems with multiple objectives arise naturally in practice and pose unique challenges for research in decision-theoretic planning and learning, which has largely focused on single-objective settings. This…

Artificial Intelligence · Computer Science 2014-02-05 Diederik Marijn Roijers , Peter Vamplew , Shimon Whiteson , Richard Dazeley

On categorical approach to derived preference relations in some decision making problems

A structure called a decision making problem is considered. The set of outcomes (consequences) is partially ordered according to the decision maker's preferences. The problem is how these preferences affect a decision maker to prefer one of…

Category Theory · Mathematics 2007-05-23 Victor V. Rozen , Grigori Zhitomirski

Learning What Matters Now: Dynamic Preference Inference under Contextual Shifts

Humans often juggle multiple, sometimes conflicting objectives and shift their priorities as circumstances change, rather than following a fixed objective function. In contrast, most computational decision-making and multi-objective RL…

Artificial Intelligence · Computer Science 2026-03-25 Xianwei Cao , Dou Quan , Zhenliang Zhang , Shuang Wang

From Common Sense Reasoning to Neural Network Models through Multiple Preferences: an overview

In this paper we discuss the relationships between conditional and preferential logics and neural network models, based on a multi-preferential semantics. We propose a concept-wise multipreference semantics, recently introduced for…

Artificial Intelligence · Computer Science 2021-07-13 Laura Giordano , Valentina Gliozzi , Daniele Theseider Dupré

Regularized Conditional Diffusion Model for Multi-Task Preference Alignment

Sequential decision-making is desired to align with human intents and exhibit versatility across various tasks. Previous methods formulate it as a conditional generation process, utilizing return-conditioned diffusion models to directly…

Machine Learning · Computer Science 2024-10-11 Xudong Yu , Chenjia Bai , Haoran He , Changhong Wang , Xuelong Li

A Distributional View on Multi-Objective Policy Optimization

Many real-world problems require trading off multiple competing objectives. However, these objectives are often in different units and/or scales, which can make it challenging for practitioners to express numerical preferences over…

Machine Learning · Computer Science 2020-05-18 Abbas Abdolmaleki , Sandy H. Huang , Leonard Hasenclever , Michael Neunert , H. Francis Song , Martina Zambelli , Murilo F. Martins , Nicolas Heess , Raia Hadsell , Martin Riedmiller

Preference-based Multi-Objective Reinforcement Learning

Multi-objective reinforcement learning (MORL) is a structured approach for optimizing tasks with multiple objectives. However, it often relies on pre-defined reward functions, which can be hard to design for balancing conflicting goals and…

Machine Learning · Computer Science 2025-07-21 Ni Mu , Yao Luan , Qing-Shan Jia

Sequential Preference-Based Optimization

Many real-world engineering problems rely on human preferences to guide their design and optimization. We present PrefOpt, an open source package to simplify sequential optimization tasks that incorporate human preference feedback. Our…

Machine Learning · Computer Science 2018-01-10 Ian Dewancker , Jakob Bauer , Michael McCourt

Inferring Preferences from Demonstrations in Multi-objective Reinforcement Learning: A Dynamic Weight-based Approach

Many decision-making problems feature multiple objectives. In such problems, it is not always possible to know the preferences of a decision-maker for different objectives. However, it is often possible to observe the behavior of…

Artificial Intelligence · Computer Science 2023-04-28 Junlin Lu , Patrick Mannion , Karl Mason

Preference Queries over Taxonomic Domains

When composing multiple preferences characterizing the most suitable results for a user, several issues may arise. Indeed, preferences can be partially contradictory, suffer from a mismatch with the level of detail of the actual data, and…

Databases · Computer Science 2025-01-10 Paolo Ciaccia , Davide Martinenghi , Riccardo Torlone

Eliciting User Preferences for Personalized Multi-Objective Decision Making through Comparative Feedback

In classic reinforcement learning (RL) and decision making problems, policies are evaluated with respect to a scalar reward function, and all optimal policies are the same with regards to their expected return. However, many real-world…

Machine Learning · Computer Science 2023-11-02 Han Shao , Lee Cohen , Avrim Blum , Yishay Mansour , Aadirupa Saha , Matthew R. Walter

Preference Learning for AI Alignment: a Causal Perspective

Reward modelling from preference data is a crucial step in aligning large language models (LLMs) with human values, requiring robust generalisation to novel prompt-response pairs. In this work, we propose to frame this problem in a causal…

Artificial Intelligence · Computer Science 2026-05-12 Katarzyna Kobalczyk , Mihaela van der Schaar

Preference Transformer: Modeling Human Preferences using Transformers for RL

Preference-based reinforcement learning (RL) provides a framework to train agents using human preferences between two behaviors. However, preference-based RL has been challenging to scale since it requires a large amount of human feedback…

Machine Learning · Computer Science 2023-03-03 Changyeon Kim , Jongjin Park , Jinwoo Shin , Honglak Lee , Pieter Abbeel , Kimin Lee

Assumption-Based Approaches to Reasoning with Priorities

This paper maps out the relation between different approaches for handling preferences in argumentation with strict rules and defeasible assumptions by offering translations between them. The systems we compare are: non-prioritized defeats…

Artificial Intelligence · Computer Science 2017-10-02 Jesse Heyninck , Christian Straßer , Pere Pardo

A preference learning framework for multiple criteria sorting with diverse additive value models and valued assignment examples

We present a preference learning framework for multiple criteria sorting. We consider sorting procedures applying an additive value model with diverse types of marginal value functions (including linear, piecewise-linear, splined, and…

Machine Learning · Computer Science 2019-10-15 Jiapeng Liu , Milosz Kadzinski , Xiuwu Liao , Xiaoxin Mao , Yao Wang

Preference Inference from Demonstration in Multi-objective Multi-agent Decision Making

It is challenging to quantify numerical preferences for different objectives in a multi-objective decision-making problem. However, the demonstrations of a user are often accessible. We propose an algorithm to infer linear preference…

Artificial Intelligence · Computer Science 2023-04-28 Junlin Lu

Preference-based Conditional Treatment Effects and Policy Learning

We introduce a new preference-based framework for conditional treatment effect estimation and policy learning, built on the Conditional Preference-based Treatment Effect (CPTE). CPTE requires only that outcomes be ranked under a preference…

Machine Learning · Statistics 2026-02-04 Dovid Parnas , Mathieu Even , Julie Josse , Uri Shalit

Opportunistic Qualitative Planning in Stochastic Systems with Preferences over Temporal Logic Objectives

Preferences play a key role in determining what goals/constraints to satisfy when not all constraints can be satisfied simultaneously. In this work, we study preference-based planning in a stochastic system modeled as a Markov decision…

Formal Languages and Automata Theory · Computer Science 2022-03-28 Abhishek Ninad Kulkarni , Jie Fu

Decisions in elections --- transitive or intransitive quantum preferences

Our preferences depend on the circumstances in which we reveal them. We will introduce a dependency which allows us to illustrate the relation between the possibility of winning of particular candidates in a quantum election and the type of…

Quantum Physics · Physics 2015-05-27 Marcin Makowski , Edward W. Piotrowski