English
Related papers

Related papers: Transforming Vision Transformer: Towards Efficient…

200 papers

The pretrain+fine-tune paradigm is foundational for deploying large language models (LLMs) across various downstream applications. Within this framework, Low-Rank Adaptation (LoRA) stands out for its parameter-efficient fine-tuning (PEFT),…

Computation and Language · Computer Science 2024-10-10 Jingwei Xu , Junyu Lai , Yunpeng Huang

Multi-task learning (MTL) encapsulates multiple learned tasks in a single model and often lets those tasks learn better jointly. However, when deploying MTL onto those real-world systems that are often resource-constrained or…

Computer Vision and Pattern Recognition · Computer Science 2022-10-27 Hanxue Liang , Zhiwen Fan , Rishov Sarkar , Ziyu Jiang , Tianlong Chen , Kai Zou , Yu Cheng , Cong Hao , Zhangyang Wang

Pre-trained language models (PLMs) demonstrate remarkable intelligence but struggle with emerging tasks unseen during training in real-world applications. Training separate models for each new task is usually impractical. Multi-task…

Computation and Language · Computer Science 2025-05-02 Xiao Zhang , Kangsheng Wang , Tianyu Hu , Huimin Ma

Multi-task learning (MTL) aims to learn multiple tasks using a single model and jointly improve all of them assuming generalization and shared semantics. Reducing conflicts between tasks during joint learning is difficult and generally…

Computer Vision and Pattern Recognition · Computer Science 2023-10-02 Hung-Shuo Chang , Chien-Yao Wang , Richard Robert Wang , Gene Chou , Hong-Yuan Mark Liao

Parameter-efficient fine-tuning (PEFT) has been widely employed for domain adaptation, with LoRA being one of the most prominent methods due to its simplicity and effectiveness. However, in multi-task learning (MTL) scenarios, LoRA tends to…

Recent works on parameter-efficient transfer learning (PETL) show the potential to adapt a pre-trained Vision Transformer to downstream recognition tasks with only a few learnable parameters. However, since they usually insert new…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Taolin Zhang , Jiawang Bai , Zhihe Lu , Dongze Lian , Genping Wang , Xinchao Wang , Shu-Tao Xia

Mixture-of-Experts (MoE) has emerged as a powerful framework for multi-task learning (MTL). However, existing MoE-MTL methods often rely on single-task pretrained backbones and suffer from redundant adaptation and inefficient knowledge…

Computer Vision and Pattern Recognition · Computer Science 2025-10-02 Minghao Yang , Ren Togo , Guang Li , Takahiro Ogawa , Miki Haseyama

Multi-task learning (MTL) aims to improve the generalization of several related tasks by learning them jointly. As a comparison, in addition to the joint training scheme, modern meta-learning allows unseen tasks with limited labels during…

Machine Learning · Computer Science 2021-06-17 Haoxiang Wang , Han Zhao , Bo Li

Self-attention and transformers have been widely used in deep learning. Recent efforts have been devoted to incorporating transformer blocks into different neural architectures, including those with convolutions, leading to various visual…

Computer Vision and Pattern Recognition · Computer Science 2025-07-22 Yancheng Wang , Yingzhen Yang

We propose an approach to Multitask Learning (MTL) to make deep learning models faster and lighter for applications in which multiple tasks need to be solved simultaneously, which is particularly useful in embedded, real-time systems. We…

Computer Vision and Pattern Recognition · Computer Science 2017-11-02 Miquel Martí , Atsuto Maki

Building scalable vision-language models to learn from diverse, multimodal data remains an open challenge. In this paper, we introduce an Efficient Vision-languagE foundation model, namely EVE, which is one unified multimodal Transformer…

Computer Vision and Pattern Recognition · Computer Science 2024-03-04 Junyi Chen , Longteng Guo , Jia Sun , Shuai Shao , Zehuan Yuan , Liang Lin , Dongyu Zhang

Despite recent advancements in offline multi-task reinforcement learning (MTRL) have harnessed the powerful capabilities of the Transformer architecture, most approaches focus on a limited number of tasks, with scaling to extremely massive…

Machine Learning · Computer Science 2025-06-02 Yilun Kong , Guozheng Ma , Qi Zhao , Haoyu Wang , Li Shen , Xueqian Wang , Dacheng Tao

Multi-task learning (MTL) aims to build general-purpose vision systems by training a single network to perform multiple tasks jointly. While promising, its potential is often hindered by "unbalanced optimization", where task interference…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Yihang Guo , Tianyuan Yu , Liang Bai , Yanming Guo , Yirun Ruan , William Li , Weishi Zheng

Parameter-Efficient Fine-Tuning (PEFT) methods like LoRA have significantly improved the adaptation of LLMs to downstream tasks in a resource-efficient manner. However, in multi-task scenarios, challenges such as training imbalance and the…

Computation and Language · Computer Science 2024-10-31 Xujia Wang , Haiyan Zhao , Shuo Wang , Hanqing Wang , Zhiyuan Liu

We propose Tensor-Trained Low-Rank Adaptation Mixture of Experts (TT-LoRA MoE), a novel computational framework integrating Parameter-Efficient Fine-Tuning (PEFT) with sparse MoE routing to address scalability challenges in large model…

Machine Learning · Computer Science 2026-01-27 Pradip Kunwar , Minh N. Vu , Maanak Gupta , Mahmoud Abdelsalam , Manish Bhattarai

Computer vision researchers are embracing two promising paradigms: Vision Transformers (ViTs) and Multi-task Learning (MTL), which both show great performance but are computation-intensive, given the quadratic complexity of self-attention…

Hardware Architecture · Computer Science 2023-09-14 Rishov Sarkar , Hanxue Liang , Zhiwen Fan , Zhangyang Wang , Cong Hao

The combination of Mixture-of-Experts (MoE) and Low-Rank Adaptation (LoRA) has shown significant potential for enhancing the multi-task learning capabilities of Large Language Models. However, existing methods face two primary challenges:…

Computation and Language · Computer Science 2026-04-22 Boyan Shi , Wei Chen , Shuyuan Zhao , Junfeng Shen , Shengnan Guo , Shaojiang Wang , Huaiyu Wan

Multi-task learning (MTL) is a machine learning technique aiming to improve model performance by leveraging information across many tasks. It has been used extensively on various data modalities, including electronic health record (EHR)…

Machine Learning · Computer Science 2020-07-21 Matthew B. A. McDermott , Bret Nestor , Evan Kim , Wancong Zhang , Anna Goldenberg , Peter Szolovits , Marzyeh Ghassemi

Multi-task learning aims to explore task relatedness to improve individual tasks, which is of particular significance in the challenging scenario that only limited data is available for each task. To tackle this challenge, we propose…

Machine Learning · Computer Science 2021-11-10 Jiayi Shen , Xiantong Zhen , Marcel Worring , Ling Shao

Fine-tuning large language models (LLMs) is computationally expensive, and Low-Rank Adaptation (LoRA) provides a cost-effective solution by approximating weight updates through low-rank matrices. In real-world scenarios, LLMs are fine-tuned…

Machine Learning · Computer Science 2025-06-03 Jinda Liu , Yi Chang , Yuan Wu
‹ Prev 1 2 3 10 Next ›