Related papers: Task Adaptive Parameter Sharing for Multi-Task Lea…

Parameter-Efficient Multi-Task Learning via Progressive Task-Specific Adaptation

Parameter-efficient fine-tuning methods have emerged as a promising solution for adapting pre-trained models to various downstream tasks. While these methods perform well in single-task learning, extending them to multi-task learning…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Neeraj Gangwar , Anshuka Rangi , Rishabh Deshmukh , Holakou Rahmanian , Yesh Dattatreya , Nickvash Kani

Fully-adaptive Feature Sharing in Multi-Task Networks with Applications in Person Attribute Classification

Multi-task learning aims to improve generalization performance of multiple prediction tasks by appropriately sharing relevant information across them. In the context of deep neural networks, this idea is often realized by hand-designed…

Computer Vision and Pattern Recognition · Computer Science 2016-11-17 Yongxi Lu , Abhishek Kumar , Shuangfei Zhai , Yu Cheng , Tara Javidi , Rogerio Feris

Learning Sparse Sharing Architectures for Multiple Tasks

Most existing deep multi-task learning models are based on parameter sharing, such as hard sharing, hierarchical sharing, and soft sharing. How choosing a suitable sharing mechanism depends on the relations among the tasks, which is not…

Computation and Language · Computer Science 2019-11-19 Tianxiang Sun , Yunfan Shao , Xiaonan Li , Pengfei Liu , Hang Yan , Xipeng Qiu , Xuanjing Huang

AdaShare: Learning What To Share For Efficient Deep Multi-Task Learning

Multi-task learning is an open and challenging problem in computer vision. The typical way of conducting multi-task learning with deep neural networks is either through handcrafted schemes that share all initial layers and branch out at an…

Computer Vision and Pattern Recognition · Computer Science 2020-11-20 Ximeng Sun , Rameswar Panda , Rogerio Feris , Kate Saenko

Adaptive parameter sharing for multi-agent reinforcement learning

Parameter sharing, as an important technique in multi-agent systems, can effectively solve the scalability issue in large-scale agent problems. However, the effectiveness of parameter sharing largely depends on the environment setting. When…

Artificial Intelligence · Computer Science 2025-03-04 Dapeng Li , Na Lou , Bin Zhang , Zhiwei Xu , Guoliang Fan

Efficient Computation Sharing for Multi-Task Visual Scene Understanding

Solving multiple visual tasks using individual models can be resource-intensive, while multi-task learning can conserve resources by sharing knowledge across different tasks. Despite the benefits of multi-task learning, such techniques can…

Computer Vision and Pattern Recognition · Computer Science 2023-08-16 Sara Shoouri , Mingyu Yang , Zichen Fan , Hun-Seok Kim

TAPAS: Two-pass Approximate Adaptive Sampling for Softmax

TAPAS is a novel adaptive sampling method for the softmax model. It uses a two pass sampling strategy where the examples used to approximate the gradient of the partition function are first sampled according to a squashed population…

Machine Learning · Computer Science 2017-07-17 Yu Bai , Sally Goldman , Li Zhang

TapWeight: Reweighting Pretraining Objectives for Task-Adaptive Pretraining

Large-scale general domain pretraining followed by downstream-specific finetuning has become a predominant paradigm in machine learning. However, discrepancies between the pretraining and target domains can still lead to performance…

Machine Learning · Computer Science 2024-10-15 Ruiyi Zhang , Sai Ashish Somayajula , Pengtao Xie

Boosting Share Routing for Multi-task Learning

Multi-task learning (MTL) aims to make full use of the knowledge contained in multi-task supervision signals to improve the overall performance. How to make the knowledge of multiple tasks shared appropriately is an open problem for MTL.…

Machine Learning · Computer Science 2021-03-02 Xiaokai Chen , Xiaoguang Gu , Libo Fu

Parameter-efficient Multi-task Fine-tuning for Transformers via Shared Hypernetworks

State-of-the-art parameter-efficient fine-tuning methods rely on introducing adapter modules between the layers of a pretrained language model. However, such modules are trained separately for each task and thus do not enable sharing…

Computation and Language · Computer Science 2021-06-09 Rabeeh Karimi Mahabadi , Sebastian Ruder , Mostafa Dehghani , James Henderson

Leveraging convergence behavior to balance conflicting tasks in multi-task learning

Multi-Task Learning is a learning paradigm that uses correlated tasks to improve performance generalization. A common way to learn multiple tasks is through the hard parameter sharing approach, in which a single architecture is used to…

Machine Learning · Computer Science 2022-04-15 Angelica Tiemi Mizuno Nakamura , Denis Fernando Wolf , Valdir Grassi

Optimizing Specific and Shared Parameters for Efficient Parameter Tuning

Foundation models, with a vast number of parameters and pretraining on massive datasets, achieve state-of-the-art performance across various applications. However, efficiently adapting them to downstream tasks with minimal computational…

Machine Learning · Computer Science 2025-04-07 Van-Anh Nguyen , Thanh-Toan Do , Mehrtash Harandi , Dinh Phung , Trung Le

Learning Task Grouping and Overlap in Multi-task Learning

In the paradigm of multi-task learning, mul- tiple related prediction tasks are learned jointly, sharing information across the tasks. We propose a framework for multi-task learn- ing that enables one to selectively share the information…

Machine Learning · Computer Science 2012-07-03 Abhishek Kumar , Hal Daume

Scaling Multi-Agent Reinforcement Learning with Selective Parameter Sharing

Sharing parameters in multi-agent deep reinforcement learning has played an essential role in allowing algorithms to scale to a large number of agents. Parameter sharing between agents significantly decreases the number of trainable…

Multiagent Systems · Computer Science 2021-06-15 Filippos Christianos , Georgios Papoudakis , Arrasy Rahman , Stefano V. Albrecht

DynaShare: Task and Instance Conditioned Parameter Sharing for Multi-Task Learning

Multi-task networks rely on effective parameter sharing to achieve robust generalization across tasks. In this paper, we present a novel parameter sharing method for multi-task learning that conditions parameter sharing on both the task and…

Computer Vision and Pattern Recognition · Computer Science 2023-05-30 Elahe Rahimian , Golara Javadi , Frederick Tung , Gabriel Oliveira

Model Predictive Task Sampling for Efficient and Robust Adaptation

Foundation models have revolutionized general-purpose problem-solving, offering rapid task adaptation through pretraining, meta-training, and finetuning. Recent crucial advances in these paradigms reveal the importance of challenging task…

Machine Learning · Computer Science 2025-10-21 Qi Wang , Zehao Xiao , Yixiu Mao , Yun Qu , Jiayi Shen , Yiqin Lv , Xiangyang Ji

K for the Price of 1: Parameter-efficient Multi-task and Transfer Learning

We introduce a novel method that enables parameter-efficient transfer and multi-task learning with deep neural networks. The basic approach is to learn a model patch - a small set of parameters - that will specialize to each task, instead…

Machine Learning · Computer Science 2019-02-26 Pramod Kaushik Mudrakarta , Mark Sandler , Andrey Zhmoginov , Andrew Howard

Adaptive Task Sampling for Meta-Learning

Meta-learning methods have been extensively studied and applied in computer vision, especially for few-shot classification tasks. The key idea of meta-learning for few-shot classification is to mimic the few-shot situations faced at test…

Computer Vision and Pattern Recognition · Computer Science 2020-07-20 Chenghao Liu , Zhihao Wang , Doyen Sahoo , Yuan Fang , Kun Zhang , Steven C. H. Hoi

Data Efficient Training for Reinforcement Learning with Adaptive Behavior Policy Sharing

Deep Reinforcement Learning (RL) is proven powerful for decision making in simulated environments. However, training deep RL model is challenging in real world applications such as production-scale health-care or recommender systems because…

Machine Learning · Computer Science 2020-02-14 Ge Liu , Rui Wu , Heng-Tze Cheng , Jing Wang , Jayden Ooi , Lihong Li , Ang Li , Wai Lok Sibon Li , Craig Boutilier , Ed Chi

TAPAS: Fast and Automatic Derivation of Tensor Parallel Strategies for Large Neural Networks

Tensor parallelism is an essential technique for distributed training of large neural networks. However, automatically determining an optimal tensor parallel strategy is challenging due to the gigantic search space, which grows…

Machine Learning · Computer Science 2025-08-06 Ziji Shi , Le Jiang , Ang Wang , Jie Zhang , Chencan Wu , Yong Li , Xiaokui Xiao , Wei Lin , Jialin Li