Related papers: Flexible Multi-task Networks by Learning Parameter…

DynaShare: Task and Instance Conditioned Parameter Sharing for Multi-Task Learning

Multi-task networks rely on effective parameter sharing to achieve robust generalization across tasks. In this paper, we present a novel parameter sharing method for multi-task learning that conditions parameter sharing on both the task and…

Computer Vision and Pattern Recognition · Computer Science 2023-05-30 Elahe Rahimian , Golara Javadi , Frederick Tung , Gabriel Oliveira

Learning Sparse Sharing Architectures for Multiple Tasks

Most existing deep multi-task learning models are based on parameter sharing, such as hard sharing, hierarchical sharing, and soft sharing. How choosing a suitable sharing mechanism depends on the relations among the tasks, which is not…

Computation and Language · Computer Science 2019-11-19 Tianxiang Sun , Yunfan Shao , Xiaonan Li , Pengfei Liu , Hang Yan , Xipeng Qiu , Xuanjing Huang

Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks

State-of-the-art parameter-efficient fine-tuning methods rely on introducing adapter modules between the layers of a pretrained language model. However, such modules are trained separately for each task and thus do not enable sharing…

Computation and Language · Computer Science 2021-06-09 Rabeeh Karimi Mahabadi , Sebastian Ruder , Mostafa Dehghani , James Henderson

Adaptive parameter sharing for multi-agent reinforcement learning

Parameter sharing, as an important technique in multi-agent systems, can effectively solve the scalability issue in large-scale agent problems. However, the effectiveness of parameter sharing largely depends on the environment setting. When…

Artificial Intelligence · Computer Science 2025-03-04 Dapeng Li , Na Lou , Bin Zhang , Zhiwei Xu , Guoliang Fan

Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing

Sharing parameters in multi-agent deep reinforcement learning has played an essential role in allowing algorithms to scale to a large number of agents. Parameter sharing between agents significantly decreases the number of trainable…

Multiagent Systems · Computer Science 2021-06-15 Filippos Christianos , Georgios Papoudakis , Arrasy Rahman , Stefano V. Albrecht

Learning Task Grouping and Overlap in Multi-task Learning

In the paradigm of multi-task learning, mul- tiple related prediction tasks are learned jointly, sharing information across the tasks. We propose a framework for multi-task learn- ing that enables one to selectively share the information…

Machine Learning · Computer Science 2012-07-03 Abhishek Kumar , Hal Daume

Learning to Branch for Multi-Task Learning

Training multiple tasks jointly in one deep network yields reduced latency during inference and better performance over the single-task counterpart by sharing certain layers of a network. However, over-sharing a network could erroneously…

Machine Learning · Computer Science 2020-06-11 Pengsheng Guo , Chen-Yu Lee , Daniel Ulbricht

Leveraging convergence behavior to balance conflicting tasks in multi-task learning

Multi-Task Learning is a learning paradigm that uses correlated tasks to improve performance generalization. A common way to learn multiple tasks is through the hard parameter sharing approach, in which a single architecture is used to…

Machine Learning · Computer Science 2022-04-15 Angelica Tiemi Mizuno Nakamura , Denis Fernando Wolf , Valdir Grassi

Distributed Multi-Task Relationship Learning

Multi-task learning aims to learn multiple tasks jointly by exploiting their relatedness to improve the generalization performance for each task. Traditionally, to perform multi-task learning, one needs to centralize data from all the tasks…

Machine Learning · Computer Science 2017-06-21 Sulin Liu , Sinno Jialin Pan , Qirong Ho

Parameter Sharing Methods for Multilingual Self-Attentional Translation Models

In multilingual neural machine translation, it has been shown that sharing a single translation model between multiple languages can achieve competitive performance, sometimes even leading to performance gains over bilingually trained…

Computation and Language · Computer Science 2018-09-14 Devendra Singh Sachan , Graham Neubig

Efficient Computation Sharing for Multi-Task Visual Scene Understanding

Solving multiple visual tasks using individual models can be resource-intensive, while multi-task learning can conserve resources by sharing knowledge across different tasks. Despite the benefits of multi-task learning, such techniques can…

Computer Vision and Pattern Recognition · Computer Science 2023-08-16 Sara Shoouri , Mingyu Yang , Zichen Fan , Hun-Seok Kim

Scalable Weight Reparametrization for Efficient Transfer Learning

This paper proposes a novel, efficient transfer learning method, called Scalable Weight Reparametrization (SWR) that is efficient and effective for multiple downstream tasks. Efficient transfer learning involves utilizing a pre-trained…

Machine Learning · Computer Science 2023-02-28 Byeonggeun Kim , Jun-Tae Lee , Seunghan yang , Simyung Chang

Joint auto-encoders: a flexible multi-task learning framework

The incorporation of prior knowledge into learning is essential in achieving good performance based on small noisy samples. Such knowledge is often incorporated through the availability of related data arising from domains and tasks similar…

Machine Learning · Statistics 2026-02-24 Baruch Epstein , Ron Meir , Tomer Michaeli

K for the Price of 1: Parameter-efficient Multi-task and Transfer Learning

We introduce a novel method that enables parameter-efficient transfer and multi-task learning with deep neural networks. The basic approach is to learn a model patch - a small set of parameters - that will specialize to each task, instead…

Machine Learning · Computer Science 2019-02-26 Pramod Kaushik Mudrakarta , Mark Sandler , Andrey Zhmoginov , Andrew Howard

Compositional Models: Multi-Task Learning and Knowledge Transfer with Modular Networks

Conditional computation and modular networks have been recently proposed for multitask learning and other problems as a way to decompose problem solving into multiple reusable computational blocks. We propose a new approach for learning…

Machine Learning · Computer Science 2021-07-26 Andrey Zhmoginov , Dina Bashkirova , Mark Sandler

Multi-Task Reinforcement Learning with Soft Modularization

Multi-task learning is a very challenging problem in reinforcement learning. While training multiple tasks jointly allow the policies to share parameters across different tasks, the optimization problem becomes non-trivial: It remains…

Machine Learning · Computer Science 2020-12-08 Ruihan Yang , Huazhe Xu , Yi Wu , Xiaolong Wang

Learning Good Features to Transfer Across Tasks and Domains

Availability of labelled data is the major obstacle to the deployment of deep learning algorithms for computer vision tasks in new domains. The fact that many frameworks adopted to solve different tasks share the same architecture suggests…

Computer Vision and Pattern Recognition · Computer Science 2023-01-27 Pierluigi Zama Ramirez , Adriano Cardace , Luca De Luigi , Alessio Tonioni , Samuele Salti , Luigi Di Stefano

Rethinking Hard-Parameter Sharing in Multi-Domain Learning

Hard parameter sharing in multi-domain learning (MDL) allows domains to share some of the model parameters to reduce storage cost while improving prediction accuracy. One common sharing practice is to share the bottom layers of a deep…

Machine Learning · Computer Science 2022-03-22 Lijun Zhang , Qizheng Yang , Xiao Liu , Hui Guan

Understanding Parameter Sharing in Transformers

Parameter sharing has proven to be a parameter-efficient approach. Previous work on Transformers has focused on sharing parameters in different layers, which can improve the performance of models with limited parameters by increasing model…

Machine Learning · Computer Science 2023-06-19 Ye Lin , Mingxuan Wang , Zhexi Zhang , Xiaohui Wang , Tong Xiao , Jingbo Zhu

Parameter Sharing with Network Pruning for Scalable Multi-Agent Deep Reinforcement Learning

Handling the problem of scalability is one of the essential issues for multi-agent reinforcement learning (MARL) algorithms to be applied to real-world problems typically involving massively many agents. For this, parameter sharing across…

Multiagent Systems · Computer Science 2023-03-03 Woojun Kim , Youngchul Sung