Related papers: Transforming Vision Transformer: Towards Efficient…

MeteoRA: Multiple-tasks Embedded LoRA for Large Language Models

The pretrain+fine-tune paradigm is foundational for deploying large language models (LLMs) across various downstream applications. Within this framework, Low-Rank Adaptation (LoRA) stands out for its parameter-efficient fine-tuning (PEFT),…

Computation and Language · Computer Science 2024-10-10 Jingwei Xu , Junyu Lai , Yunpeng Huang

M$^3$ViT: Mixture-of-Experts Vision Transformer for Efficient Multi-task Learning with Model-Accelerator Co-design

Multi-task learning (MTL) encapsulates multiple learned tasks in a single model and often lets those tasks learn better jointly. However, when deploying MTL onto those real-world systems that are often resource-constrained or…

Computer Vision and Pattern Recognition · Computer Science 2022-10-27 Hanxue Liang , Zhiwen Fan , Rishov Sarkar , Ziyu Jiang , Tianlong Chen , Kai Zou , Yu Cheng , Cong Hao , Zhangyang Wang

Efficient Knowledge Transfer in Multi-Task Learning through Task-Adaptive Low-Rank Representation

Pre-trained language models (PLMs) demonstrate remarkable intelligence but struggle with emerging tasks unseen during training in real-world applications. Training separate models for each new task is usually impractical. Multi-task…

Computation and Language · Computer Science 2025-05-02 Xiao Zhang , Kangsheng Wang , Tianyu Hu , Huimin Ma

YOLOR-Based Multi-Task Learning

Multi-task learning (MTL) aims to learn multiple tasks using a single model and jointly improve all of them assuming generalization and shared semantics. Reducing conflicts between tasks during joint learning is difficult and generally…

Computer Vision and Pattern Recognition · Computer Science 2023-10-02 Hung-Shuo Chang , Chien-Yao Wang , Richard Robert Wang , Gene Chou , Hong-Yuan Mark Liao

MTL-LoRA: Low-Rank Adaptation for Multi-Task Learning

Parameter-efficient fine-tuning (PEFT) has been widely employed for domain adaptation, with LoRA being one of the most prominent methods due to its simplicity and effectiveness. However, in multi-task learning (MTL) scenarios, LoRA tends to…

Machine Learning · Computer Science 2025-04-02 Yaming Yang , Dilxat Muhtar , Yelong Shen , Yuefeng Zhan , Jianfeng Liu , Yujing Wang , Hao Sun , Denvy Deng , Feng Sun , Qi Zhang , Weizhu Chen , Yunhai Tong

Parameter-Efficient and Memory-Efficient Tuning for Vision Transformer: A Disentangled Approach

Recent works on parameter-efficient transfer learning (PETL) show the potential to adapt a pre-trained Vision Transformer to downstream recognition tasks with only a few learnable parameters. However, since they usually insert new…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Taolin Zhang , Jiawang Bai , Zhihe Lu , Dongze Lian , Genping Wang , Xinchao Wang , Shu-Tao Xia

Adaptive Shared Experts with LoRA-Based Mixture of Experts for Multi-Task Learning

Mixture-of-Experts (MoE) has emerged as a powerful framework for multi-task learning (MTL). However, existing MoE-MTL methods often rely on single-task pretrained backbones and suffer from redundant adaptation and inefficient knowledge…

Computer Vision and Pattern Recognition · Computer Science 2025-10-02 Minghao Yang , Ren Togo , Guang Li , Takahiro Ogawa , Miki Haseyama

Bridging Multi-Task Learning and Meta-Learning: Towards Efficient Training and Effective Adaptation

Multi-task learning (MTL) aims to improve the generalization of several related tasks by learning them jointly. As a comparison, in addition to the joint training scheme, modern meta-learning allows unseen tasks with limited labels during…

Machine Learning · Computer Science 2021-06-17 Haoxiang Wang , Han Zhao , Bo Li

Efficient Visual Transformer by Learnable Token Merging

Self-attention and transformers have been widely used in deep learning. Recent efforts have been devoted to incorporating transformer blocks into different neural architectures, including those with convolutions, leading to various visual…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Yancheng Wang , Yingzhen Yang

A multitask deep learning model for real-time deployment in embedded systems

We propose an approach to Multitask Learning (MTL) to make deep learning models faster and lighter for applications in which multiple tasks need to be solved simultaneously, which is particularly useful in embedded, real-time systems. We…

Computer Vision and Pattern Recognition · Computer Science 2017-11-02 Miquel Martí , Atsuto Maki

EVE: Efficient Vision-Language Pre-training with Masked Prediction and Modality-Aware MoE

Building scalable vision-language models to learn from diverse, multimodal data remains an open challenge. In this paper, we introduce an Efficient Vision-languagE foundation model, namely EVE, which is one unified multimodal Transformer…

Computer Vision and Pattern Recognition · Computer Science 2024-03-04 Junyi Chen , Longteng Guo , Jia Sun , Shuai Shao , Zehuan Yuan , Liang Lin , Dongyu Zhang

Mastering Massive Multi-Task Reinforcement Learning via Mixture-of-Expert Decision Transformer

Despite recent advancements in offline multi-task reinforcement learning (MTRL) have harnessed the powerful capabilities of the Transformer architecture, most approaches focus on a limited number of tasks, with scaling to extremely massive…

Machine Learning · Computer Science 2025-06-02 Yilun Kong , Guozheng Ma , Qi Zhao , Haoyu Wang , Li Shen , Xueqian Wang , Dacheng Tao

Revisit the Imbalance Optimization in Multi-task Learning: An Experimental Analysis

Multi-task learning (MTL) aims to build general-purpose vision systems by training a single network to perform multiple tasks jointly. While promising, its potential is often hindered by "unbalanced optimization", where task interference…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Yihang Guo , Tianyuan Yu , Liang Bai , Yanming Guo , Yirun Ruan , William Li , Weishi Zheng

MALoRA: Mixture of Asymmetric Low-Rank Adaptation for Enhanced Multi-Task Learning

Parameter-Efficient Fine-Tuning (PEFT) methods like LoRA have significantly improved the adaptation of LLMs to downstream tasks in a resource-efficient manner. However, in multi-task scenarios, challenges such as training imbalance and the…

Computation and Language · Computer Science 2024-10-31 Xujia Wang , Haiyan Zhao , Shuo Wang , Hanqing Wang , Zhiyuan Liu

TT-LoRA MoE: Unifying Parameter-Efficient Fine-Tuning and Sparse Mixture-of-Experts

We propose Tensor-Trained Low-Rank Adaptation Mixture of Experts (TT-LoRA MoE), a novel computational framework integrating Parameter-Efficient Fine-Tuning (PEFT) with sparse MoE routing to address scalability challenges in large model…

Machine Learning · Computer Science 2026-01-27 Pradip Kunwar , Minh N. Vu , Maanak Gupta , Mahmoud Abdelsalam , Manish Bhattarai

Edge-MoE: Memory-Efficient Multi-Task Vision Transformer Architecture with Task-level Sparsity via Mixture-of-Experts

Computer vision researchers are embracing two promising paradigms: Vision Transformers (ViTs) and Multi-task Learning (MTL), which both show great performance but are computation-intensive, given the quadratic complexity of self-attention…

Hardware Architecture · Computer Science 2023-09-14 Rishov Sarkar , Hanxue Liang , Zhiwen Fan , Zhangyang Wang , Cong Hao

SAMoRA: Semantic-Aware Mixture of LoRA Experts for Task-Adaptive Learning

The combination of Mixture-of-Experts (MoE) and Low-Rank Adaptation (LoRA) has shown significant potential for enhancing the multi-task learning capabilities of Large Language Models. However, existing methods face two primary challenges:…

Computation and Language · Computer Science 2026-04-22 Boyan Shi , Wei Chen , Shuyuan Zhao , Junfeng Shen , Shengnan Guo , Shaojiang Wang , Huaiyu Wan

A Comprehensive Evaluation of Multi-task Learning and Multi-task Pre-training on EHR Time-series Data

Multi-task learning (MTL) is a machine learning technique aiming to improve model performance by leveraging information across many tasks. It has been used extensively on various data modalities, including electronic health record (EHR)…

Machine Learning · Computer Science 2020-07-21 Matthew B. A. McDermott , Bret Nestor , Evan Kim , Wancong Zhang , Anna Goldenberg , Peter Szolovits , Marzyeh Ghassemi

Variational Multi-Task Learning with Gumbel-Softmax Priors

Multi-task learning aims to explore task relatedness to improve individual tasks, which is of particular significance in the challenging scenario that only limited data is available for each task. To tackle this challenge, we propose…

Machine Learning · Computer Science 2021-11-10 Jiayi Shen , Xiantong Zhen , Marcel Worring , Ling Shao

R-LoRA: Randomized Multi-Head LoRA for Efficient Multi-Task Learning

Fine-tuning large language models (LLMs) is computationally expensive, and Low-Rank Adaptation (LoRA) provides a cost-effective solution by approximating weight updates through low-rank matrices. In real-world scenarios, LLMs are fine-tuned…

Machine Learning · Computer Science 2025-06-03 Jinda Liu , Yi Chang , Yuan Wu