Related papers: PaCo: Parameter-Compositional Multi-Task Reinforce…

Auxiliary Learning for Deep Multi-task Learning

Multi-task learning (MTL) is an efficient solution to solve multiple tasks simultaneously in order to get better speed and performance than handling each single-task in turn. The most current methods can be categorized as either: (i) hard…

Computer Vision and Pattern Recognition · Computer Science 2019-12-02 Yifan Liu , Bohan Zhuang , Chunhua Shen , Hao Chen , Wei Yin

Prism: Spectral Parameter Sharing for Multi-Agent Reinforcement Learning

Parameter sharing is a key strategy in multi-agent reinforcement learning (MARL) for improving scalability, yet conventional fully shared architectures often collapse into homogeneous behaviors. Recent methods introduce diversity through…

Multiagent Systems · Computer Science 2026-02-09 Kyungbeom Kim , Seungwon Oh , Kyung-Joong Kim

Self-Paced Multi-Task Learning

In this paper, we propose a novel multi-task learning (MTL) framework, called Self-Paced Multi-Task Learning (SPMTL). Different from previous works treating all tasks and instances equally when training, SPMTL attempts to jointly learn the…

Machine Learning · Computer Science 2017-04-04 Changsheng Li , Junchi Yan , Fan Wei , Weishan Dong , Qingshan Liu , Hongyuan Zha

Co-learning Planning and Control Policies Constrained by Differentiable Logic Specifications

Synthesizing planning and control policies in robotics is a fundamental task, further complicated by factors such as complex logic specifications and high-dimensional robot dynamics. This paper presents a novel reinforcement learning…

Robotics · Computer Science 2023-10-03 Zikang Xiong , Daniel Lawson , Joe Eappen , Ahmed H. Qureshi , Suresh Jagannathan

MAESTRO: Multi-Agent Environment Shaping through Task and Reward Optimization

Cooperative Multi-Agent Reinforcement Learning (MARL) faces two major design bottlenecks: crafting dense reward functions and constructing curricula that avoid local optima in high-dimensional, non-stationary environments. Existing…

Machine Learning · Computer Science 2025-12-11 Boyuan Wu

MTLoRA: A Low-Rank Adaptation Approach for Efficient Multi-Task Learning

Adapting models pre-trained on large-scale datasets to a variety of downstream tasks is a common strategy in deep learning. Consequently, parameter-efficient fine-tuning methods have emerged as a promising way to adapt pre-trained models to…

Computer Vision and Pattern Recognition · Computer Science 2024-04-01 Ahmed Agiza , Marina Neseem , Sherief Reda

TemplateRL: Structured Template-Guided Reinforcement Learning for LLM Reasoning

Reinforcement learning (RL) has emerged as an effective paradigm for enhancing model reasoning. However, existing RL methods like GRPO typically rely on unstructured self-sampling to fit scalar rewards, often producing inefficient rollouts…

Computation and Language · Computer Science 2026-05-18 Jinyang Wu , Chonghua Liao , Mingkuan Feng , Shuai Zhang , Zhengqi Wen , Haoran Luo , Ling Yang , Huazhe Xu , Jianhua Tao

Constrained Optimization of Charged Particle Tracking with Multi-Agent Reinforcement Learning

Reinforcement learning demonstrated immense success in modelling complex physics-driven systems, providing end-to-end trainable solutions by interacting with a simulated or real environment, maximizing a scalar reward signal. In this work,…

Computational Physics · Physics 2025-01-10 Tobias Kortus , Ralf Keidel , Nicolas R. Gauger , Jan Kieseler

Concurrent Learning of Policy and Unknown Safety Constraints in Reinforcement Learning

Reinforcement learning (RL) has revolutionized decision-making across a wide range of domains over the past few decades. Yet, deploying RL policies in real-world scenarios presents the crucial challenge of ensuring safety. Traditional safe…

Systems and Control · Electrical Eng. & Systems 2024-03-26 Lunet Yifru , Ali Baheri

A view on learning robust goal-conditioned value functions: Interplay between RL and MPC

Reinforcement learning (RL) and model predictive control (MPC) offer a wealth of distinct approaches for automatic decision-making under uncertainty. Given the impact both fields have had independently across numerous domains, there is…

Systems and Control · Electrical Eng. & Systems 2025-10-13 Nathan P. Lawrence , Philip D. Loewen , Michael G. Forbes , R. Bhushan Gopaluni , Ali Mesbah

AMAGO-2: Breaking the Multi-Task Barrier in Meta-Reinforcement Learning with Transformers

Language models trained on diverse datasets unlock generalization by in-context learning. Reinforcement Learning (RL) policies can achieve a similar effect by meta-learning within the memory of a sequence model. However, meta-RL research…

Machine Learning · Computer Science 2024-11-19 Jake Grigsby , Justin Sasek , Samyak Parajuli , Daniel Adebi , Amy Zhang , Yuke Zhu

Theoretical Insights into Overparameterized Models in Multi-Task and Replay-Based Continual Learning

Multi-task learning (MTL) is a machine learning paradigm that aims to improve the generalization performance of a model on multiple related tasks by training it simultaneously on those tasks. Unlike MTL, where the model has instant access…

Machine Learning · Computer Science 2025-03-21 Amin Banayeeanzade , Mahdi Soltanolkotabi , Mohammad Rostami

MAESTRO: Task-Relevant Optimization via Adaptive Feature Enhancement and Suppression for Multi-task 3D Perception

The goal of multi-task learning is to learn to conduct multiple tasks simultaneously based on a shared data representation. While this approach can improve learning efficiency, it may also cause performance degradation due to task conflicts…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Changwon Kang , Jisong Kim , Hongjae Shin , Junseo Park , Jun Won Choi

Multi-level Advantage Credit Assignment for Cooperative Multi-Agent Reinforcement Learning

Cooperative multi-agent reinforcement learning (MARL) aims to coordinate multiple agents to achieve a common goal. A key challenge in MARL is credit assignment, which involves assessing each agent's contribution to the shared reward. Given…

Artificial Intelligence · Computer Science 2025-08-12 Xutong Zhao , Yaqi Xie

DisCo RL: Distribution-Conditioned Reinforcement Learning for General-Purpose Policies

Can we use reinforcement learning to learn general-purpose policies that can perform a wide range of different tasks, resulting in flexible and reusable skills? Contextual policies provide this capability in principle, but the…

Machine Learning · Computer Science 2021-04-26 Soroush Nasiriany , Vitchyr H. Pong , Ashvin Nair , Alexander Khazatsky , Glen Berseth , Sergey Levine

Learning Policy Committees for Effective Personalization in MDPs with Diverse Tasks

Many dynamic decision problems, such as robotic control, involve a series of tasks, many of which are unknown at training time. Typical approaches for these problems, such as multi-task and meta reinforcement learning, do not generalize…

Machine Learning · Computer Science 2025-05-28 Luise Ge , Michael Lanier , Anindya Sarkar , Bengisu Guresti , Chongjie Zhang , Yevgeniy Vorobeychik

CompoSuite: A Compositional Reinforcement Learning Benchmark

We present CompoSuite, an open-source simulated robotic manipulation benchmark for compositional multi-task reinforcement learning (RL). Each CompoSuite task requires a particular robot arm to manipulate one individual object to achieve a…

Machine Learning · Computer Science 2022-07-12 Jorge A. Mendez , Marcel Hussing , Meghna Gummadi , Eric Eaton

MULTIPOLAR: Multi-Source Policy Aggregation for Transfer Reinforcement Learning between Diverse Environmental Dynamics

Transfer reinforcement learning (RL) aims at improving the learning efficiency of an agent by exploiting knowledge from other source agents trained on relevant tasks. However, it remains challenging to transfer knowledge between different…

Machine Learning · Computer Science 2020-12-11 Mohammadamin Barekatain , Ryo Yonetani , Masashi Hamaya

Hyperparameter Optimisation with Practical Interpretability and Explanation Methods in Probabilistic Curriculum Learning

Hyperparameter optimisation (HPO) is crucial for achieving strong performance in reinforcement learning (RL), as RL algorithms are inherently sensitive to hyperparameter settings. Probabilistic Curriculum Learning (PCL) is a curriculum…

Machine Learning · Computer Science 2025-04-10 Llewyn Salt , Marcus Gallagher

Residual Policy Learning

We present Residual Policy Learning (RPL): a simple method for improving nondifferentiable policies using model-free deep reinforcement learning. RPL thrives in complex robotic manipulation tasks where good but imperfect controllers are…

Robotics · Computer Science 2019-01-04 Tom Silver , Kelsey Allen , Josh Tenenbaum , Leslie Kaelbling