Related papers: A Distributional View on Multi-Objective Policy Op…

A Generalized Algorithm for Multi-Objective Reinforcement Learning and Policy Adaptation

We introduce a new algorithm for multi-objective reinforcement learning (MORL) with linear preferences, with the goal of enabling few-shot adaptation to new tasks. In MORL, the aim is to learn policies over multiple competing objectives…

Machine Learning · Computer Science 2019-11-07 Runzhe Yang , Xingyuan Sun , Karthik Narasimhan

Learning Locally, Communicating Globally: Reinforcement Learning of Multi-robot Task Allocation for Cooperative Transport

We consider task allocation for multi-object transport using a multi-robot system, in which each robot selects one object among multiple objects with different and unknown weights. The existing centralized methods assume the number of…

Robotics · Computer Science 2022-12-07 Kazuki Shibata , Tomohiko Jimbo , Tadashi Odashima , Keisuke Takeshita , Takamitsu Matsubara

Multi-Task Policy Search

Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for…

Machine Learning · Statistics 2014-02-13 Marc Peter Deisenroth , Peter Englert , Jan Peters , Dieter Fox

A Practical Guide to Multi-Objective Reinforcement Learning and Planning

Real-world decision-making tasks are generally complex, requiring trade-offs between multiple, often conflicting, objectives. Despite this, the majority of research in reinforcement learning and decision-theoretic planning either assumes…

Artificial Intelligence · Computer Science 2022-04-22 Conor F. Hayes , Roxana Rădulescu , Eugenio Bargiacchi , Johan Källström , Matthew Macfarlane , Mathieu Reymond , Timothy Verstraeten , Luisa M. Zintgraf , Richard Dazeley , Fredrik Heintz , Enda Howley , Athirai A. Irissappane , Patrick Mannion , Ann Nowé , Gabriel Ramos , Marcello Restelli , Peter Vamplew , Diederik M. Roijers

Multi-Task Learning as Multi-Objective Optimization

In multi-task learning, multiple tasks are solved jointly, sharing inductive bias between them. Multi-task learning is inherently a multi-objective problem because different tasks may conflict, necessitating a trade-off. A common compromise…

Machine Learning · Computer Science 2019-01-14 Ozan Sener , Vladlen Koltun

Preference Inference from Demonstration in Multi-objective Multi-agent Decision Making

It is challenging to quantify numerical preferences for different objectives in a multi-objective decision-making problem. However, the demonstrations of a user are often accessible. We propose an algorithm to infer linear preference…

Artificial Intelligence · Computer Science 2023-04-28 Junlin Lu

Lexicographic Multi-Objective Reinforcement Learning

In this work we introduce reinforcement learning techniques for solving lexicographic multi-objective problems. These are problems that involve multiple reward signals, and where the goal is to learn a policy that maximises the first reward…

Machine Learning · Computer Science 2022-12-29 Joar Skalse , Lewis Hammond , Charlie Griffin , Alessandro Abate

Deep Reinforcement Learning for Resource Allocation in Business Processes

Assigning resources in business processes execution is a repetitive task that can be effectively automated. However, different automation methods may give varying results that may not be optimal. Proper resource allocation is crucial as it…

Machine Learning · Computer Science 2021-04-02 Kamil Żbikowski , Michał Ostapowicz , Piotr Gawrysiak

Learning from Multiple Outlooks

We propose a novel problem formulation of learning a single task when the data are provided in different feature spaces. Each such space is called an outlook, and is assumed to contain both labeled and unlabeled data. The objective is to…

Machine Learning · Computer Science 2011-06-15 Maayan Harel , Shie Mannor

Eliciting User Preferences for Personalized Multi-Objective Decision Making through Comparative Feedback

In classic reinforcement learning (RL) and decision making problems, policies are evaluated with respect to a scalar reward function, and all optimal policies are the same with regards to their expected return. However, many real-world…

Machine Learning · Computer Science 2023-11-02 Han Shao , Lee Cohen , Avrim Blum , Yishay Mansour , Aadirupa Saha , Matthew R. Walter

Transfer Learning by Modeling a Distribution over Policies

Exploration and adaptation to new tasks in a transfer learning setup is a central challenge in reinforcement learning. In this work, we build on the idea of modeling a distribution over policies in a Bayesian deep reinforcement learning…

Machine Learning · Computer Science 2019-06-11 Disha Shrivastava , Eeshan Gunesh Dhekane , Riashat Islam

Scalable Centralized Deep Multi-Agent Reinforcement Learning via Policy Gradients

In this paper, we explore using deep reinforcement learning for problems with multiple agents. Most existing methods for deep multi-agent reinforcement learning consider only a small number of agents. When the number of agents increases,…

Machine Learning · Computer Science 2018-05-24 Arbaaz Khan , Clark Zhang , Daniel D. Lee , Vijay Kumar , Alejandro Ribeiro

Outcome-Driven Reinforcement Learning via Variational Inference

While reinforcement learning algorithms provide automated acquisition of optimal policies, practical application of such methods requires a number of design decisions, such as manually designing reward functions that not only define the…

Machine Learning · Computer Science 2022-12-29 Tim G. J. Rudner , Vitchyr H. Pong , Rowan McAllister , Yarin Gal , Sergey Levine

Multi-Task Reinforcement Learning with Soft Modularization

Multi-task learning is a very challenging problem in reinforcement learning. While training multiple tasks jointly allow the policies to share parameters across different tasks, the optimization problem becomes non-trivial: It remains…

Machine Learning · Computer Science 2020-12-08 Ruihan Yang , Huazhe Xu , Yi Wu , Xiaolong Wang

A Robust Policy Bootstrapping Algorithm for Multi-objective Reinforcement Learning in Non-stationary Environments

Multi-objective Markov decision processes are a special kind of multi-objective optimization problem that involves sequential decision making while satisfying the Markov property of stochastic processes. Multi-objective reinforcement…

Machine Learning · Computer Science 2023-08-22 Sherif Abdelfattah , Kathryn Kasmarik , Jiankun Hu

Relationship Explainable Multi-objective Optimization Via Vector Value Function Based Reinforcement Learning

Solving multi-objective optimization problems is important in various applications where users are interested in obtaining optimal policies subject to multiple, yet often conflicting objectives. A typical approach to obtain optimal policies…

Systems and Control · Electrical Eng. & Systems 2019-10-07 Huixin Zhan , Yongcan Cao

Unsupervised Reinforcement Learning in Multiple Environments

Several recent works have been dedicated to unsupervised reinforcement learning in a single environment, in which a policy is first pre-trained with unsupervised interactions, and then fine-tuned towards the optimal policy for several…

Machine Learning · Computer Science 2021-12-17 Mirco Mutti , Mattia Mancassola , Marcello Restelli

Towards Principled Unsupervised Multi-Agent Reinforcement Learning

In reinforcement learning, we typically refer to unsupervised pre-training when we aim to pre-train a policy without a priori access to the task specification, i.e. rewards, to be later employed for efficient learning of downstream tasks.…

Machine Learning · Computer Science 2025-10-21 Riccardo Zamboni , Mirco Mutti , Marcello Restelli

Scalable Multi-Objective and Meta Reinforcement Learning via Gradient Estimation

We study the problem of efficiently estimating policies that simultaneously optimize multiple objectives in reinforcement learning (RL). Given $n$ objectives (or tasks), we seek the optimal partition of these objectives into $k \ll n$…

Machine Learning · Computer Science 2026-02-24 Zhenshuo Zhang , Minxuan Duan , Youran Ye , Hongyang R. Zhang

Distributional Multi-Objective Decision Making

For effective decision support in scenarios with conflicting objectives, sets of potentially optimal solutions can be presented to the decision maker. We explore both what policies these sets should contain and how such sets can be computed…

Artificial Intelligence · Computer Science 2023-07-19 Willem Röpke , Conor F. Hayes , Patrick Mannion , Enda Howley , Ann Nowé , Diederik M. Roijers