Related papers: Multi-task Learning with Gradient Guided Policy Sp…

Multi-Task Reinforcement Learning with Soft Modularization

Multi-task learning is a very challenging problem in reinforcement learning. While training multiple tasks jointly allow the policies to share parameters across different tasks, the optimization problem becomes non-trivial: It remains…

Machine Learning · Computer Science 2020-12-08 Ruihan Yang , Huazhe Xu , Yi Wu , Xiaolong Wang

Scalable Multi-Objective and Meta Reinforcement Learning via Gradient Estimation

We study the problem of efficiently estimating policies that simultaneously optimize multiple objectives in reinforcement learning (RL). Given $n$ objectives (or tasks), we seek the optimal partition of these objectives into $k \ll n$…

Machine Learning · Computer Science 2026-02-24 Zhenshuo Zhang , Minxuan Duan , Youran Ye , Hongyang R. Zhang

DiGrad: Multi-Task Reinforcement Learning with Shared Actions

Most reinforcement learning algorithms are inefficient for learning multiple tasks in complex robotic systems, where different tasks share a set of actions. In such environments a compound policy may be learnt with shared neural network…

Machine Learning · Computer Science 2018-03-01 Parijat Dewangan , S Phaniteja , K Madhava Krishna , Abhishek Sarkar , Balaraman Ravindran

Learning Modular Robot Visual-motor Locomotion Policies

Control policy learning for modular robot locomotion has previously been limited to proprioceptive feedback and flat terrain. This paper develops policies for modular systems with vision traversing more challenging environments. These…

Robotics · Computer Science 2023-05-02 Julian Whitman , Howie Choset

Modular Adaptive Policy Selection for Multi-Task Imitation Learning through Task Division

Deep imitation learning requires many expert demonstrations, which can be hard to obtain, especially when many tasks are involved. However, different tasks often share similarities, so learning them jointly can greatly benefit them and…

Machine Learning · Computer Science 2022-05-16 Dafni Antotsiou , Carlo Ciliberto , Tae-Kyun Kim

Multi-Task Policy Search

Learning policies that generalize across multiple tasks is an important and challenging research topic in reinforcement learning and robotics. Training individual policies for every single potential task is often impractical, especially for…

Machine Learning · Statistics 2014-02-13 Marc Peter Deisenroth , Peter Englert , Jan Peters , Dieter Fox

Composite Motion Learning with Task Control

We present a deep learning method for composite and task-driven motion control for physically simulated characters. In contrast to existing data-driven approaches using reinforcement learning that imitate full-body motions, we learn…

Graphics · Computer Science 2023-05-08 Pei Xu , Xiumin Shang , Victor Zordan , Ioannis Karamouzas

Robust Finetuning of Vision-Language-Action Robot Policies via Parameter Merging

Generalist robot policies, trained on large and diverse datasets, have demonstrated the ability to generalize across a wide spectrum of behaviors, enabling a single policy to act in varied real-world environments. However, they still fall…

Robotics · Computer Science 2026-03-03 Yajat Yadav , Zhiyuan Zhou , Andrew Wagenmaker , Karl Pertsch , Sergey Levine

Learning to Act Through Contact: A Unified View of Multi-Task Robot Learning

We present a unified framework for multi-task locomotion and manipulation policy learning grounded in a contact-explicit representation. Instead of designing different policies for different tasks, our approach unifies the definition of a…

Robotics · Computer Science 2026-05-05 Shafeef Omar , Majid Khadiv

Learning a Unified Policy for Position and Force Control in Legged Loco-Manipulation

Robotic loco-manipulation tasks often involve contact-rich interactions with the environment, requiring the joint modeling of contact force and robot position. However, recent visuomotor policies often focus solely on learning position or…

Robotics · Computer Science 2025-10-07 Peiyuan Zhi , Peiyang Li , Jianqin Yin , Baoxiong Jia , Siyuan Huang

Efficient Multi-Task Reinforcement Learning with Cross-Task Policy Guidance

Multi-task reinforcement learning endeavors to efficiently leverage shared information across various tasks, facilitating the simultaneous learning of multiple tasks. Existing approaches primarily focus on parameter sharing with carefully…

Machine Learning · Computer Science 2025-07-10 Jinmin He , Kai Li , Yifan Zang , Haobo Fu , Qiang Fu , Junliang Xing , Jian Cheng

High-Dimensional Continuous Control Using Generalized Advantage Estimation

Policy gradient methods are an appealing approach in reinforcement learning because they directly optimize the cumulative reward and can straightforwardly be used with nonlinear function approximators such as neural networks. The two main…

Machine Learning · Computer Science 2018-10-23 John Schulman , Philipp Moritz , Sergey Levine , Michael Jordan , Pieter Abbeel

Learning Locally, Communicating Globally: Reinforcement Learning of Multi-robot Task Allocation for Cooperative Transport

We consider task allocation for multi-object transport using a multi-robot system, in which each robot selects one object among multiple objects with different and unknown weights. The existing centralized methods assume the number of…

Robotics · Computer Science 2022-12-07 Kazuki Shibata , Tomohiko Jimbo , Tadashi Odashima , Keisuke Takeshita , Takamitsu Matsubara

Discrete Policy: Learning Disentangled Action Space for Multi-Task Robotic Manipulation

Learning visuomotor policy for multi-task robotic manipulation has been a long-standing challenge for the robotics community. The difficulty lies in the diversity of action space: typically, a goal can be accomplished in multiple ways,…

Robotics · Computer Science 2025-03-24 Kun Wu , Yichen Zhu , Jinming Li , Junjie Wen , Ning Liu , Zhiyuan Xu , Jian Tang

Walk These Ways: Tuning Robot Control for Generalization with Multiplicity of Behavior

Learned locomotion policies can rapidly adapt to diverse environments similar to those experienced during training but lack a mechanism for fast tuning when they fail in an out-of-distribution test environment. This necessitates a slow and…

Robotics · Computer Science 2022-12-07 Gabriel B Margolis , Pulkit Agrawal

Discovery of skill switching criteria for learning agile quadruped locomotion

This paper develops a hierarchical learning and optimization framework that can learn and achieve well-coordinated multi-skill locomotion. The learned multi-skill policy can switch between skills automatically and naturally in tracking…

Robotics · Computer Science 2025-02-11 Wanming Yu , Fernando Acero , Vassil Atanassov , Chuanyu Yang , Ioannis Havoutis , Dimitrios Kanoulas , Zhibin Li

Kinematics-Aware Multi-Policy Reinforcement Learning for Force-Capable Humanoid Loco-Manipulation

Humanoid robots, with their human-like morphology, hold great potential for industrial applications. However, existing loco-manipulation methods primarily focus on dexterous manipulation, falling short of the combined requirements for…

Robotics · Computer Science 2025-11-27 Kaiyan Xiao , Zihan Xu , Cheng Zhe , Chengju Liu , Qijun Chen

Guided Reinforcement Learning for Robust Multi-Contact Loco-Manipulation

Reinforcement learning (RL) often necessitates a meticulous Markov Decision Process (MDP) design tailored to each task. This work aims to address this challenge by proposing a systematic approach to behavior synthesis and control for…

Robotics · Computer Science 2024-10-18 Jean-Pierre Sleiman , Mayank Mittal , Marco Hutter

Learning Modular Neural Network Policies for Multi-Task and Multi-Robot Transfer

Reinforcement learning (RL) can automate a wide variety of robotic skills, but learning each new skill requires considerable real-world data collection and manual representation engineering to design policy classes or features. Using deep…

Machine Learning · Computer Science 2016-09-23 Coline Devin , Abhishek Gupta , Trevor Darrell , Pieter Abbeel , Sergey Levine

Learning Modular Robot Control Policies

Modular robots can be rearranged into a new design, perhaps each day, to handle a wide variety of tasks by forming a customized robot for each new task. However, reconfiguring just the mechanism is not sufficient: each design also requires…

Robotics · Computer Science 2021-11-11 Julian Whitman , Matthew Travers , Howie Choset