Related papers: Device Tuning for Multi-Task Large Model

CUT: Pruning Pre-Trained Multi-Task Models into Compact Models for Edge Devices

Multi-task learning has garnered widespread attention in the industry due to its efficient data utilization and strong generalization capabilities, making it particularly suitable for providing high-quality intelligent services to users.…

Machine Learning · Computer Science 2026-01-06 Jingxuan Zhou , Weidong Bao , Ji Wang , Zhengyi Zhong

muNet: Evolving Pretrained Deep Neural Networks into Scalable Auto-tuning Multitask Systems

Most uses of machine learning today involve training a model from scratch for a particular task, or sometimes starting with a model pretrained on a related task and then fine-tuning on a downstream task. Both approaches offer limited…

Machine Learning · Computer Science 2022-05-26 Andrea Gesmundo , Jeff Dean

Model Reprogramming: Resource-Efficient Cross-Domain Machine Learning

In data-rich domains such as vision, language, and speech, deep learning prevails to deliver high-performance task-specific models and can even learn general task-agnostic representations for efficient finetuning to downstream tasks.…

Machine Learning · Computer Science 2023-12-07 Pin-Yu Chen

Multi-Task Learning for Dense Prediction Tasks: A Survey

With the advent of deep learning, many dense prediction tasks, i.e. tasks that produce pixel-level predictions, have seen significant performance improvements. The typical approach is to learn these tasks in isolation, that is, a separate…

Computer Vision and Pattern Recognition · Computer Science 2021-01-26 Simon Vandenhende , Stamatios Georgoulis , Wouter Van Gansbeke , Marc Proesmans , Dengxin Dai , Luc Van Gool

Design Perspectives of Multitask Deep Learning Models and Applications

In recent years, multi-task learning has turned out to be of great success in various applications. Though single model training has promised great results throughout these years, it ignores valuable information that might help us estimate…

Machine Learning · Computer Science 2022-09-28 Yeshwant Singh , Anupam Biswas , Angshuman Bora , Debashish Malakar , Subham Chakraborty , Suman Bera

Cost-Effective Training of Deep CNNs with Active Model Adaptation

Deep convolutional neural networks have achieved great success in various applications. However, training an effective DNN model for a specific task is rather challenging because it requires a prior knowledge or experience to design the…

Machine Learning · Computer Science 2018-06-06 Sheng-Jun Huang , Jia-Wei Zhao , Zhao-Yang Liu

Muppet: Massive Multi-task Representations with Pre-Finetuning

We propose pre-finetuning, an additional large-scale learning stage between language model pre-training and fine-tuning. Pre-finetuning is massively multi-task learning (around 50 datasets, over 4.8 million total labeled examples), and is…

Computation and Language · Computer Science 2021-01-28 Armen Aghajanyan , Anchit Gupta , Akshat Shrivastava , Xilun Chen , Luke Zettlemoyer , Sonal Gupta

Efficient Inter-Task Attention for Multitask Transformer Models

In both Computer Vision and the wider Deep Learning field, the Transformer architecture is well-established as state-of-the-art for many applications. For Multitask Learning, however, where there may be many more queries necessary compared…

Computer Vision and Pattern Recognition · Computer Science 2025-08-07 Christian Bohn , Thomas Kurbiel , Klaus Friedrichs , Hasan Tercan , Tobias Meisen

DiTASK: Multi-Task Fine-Tuning with Diffeomorphic Transformations

Pre-trained Vision Transformers now serve as powerful tools for computer vision. Yet, efficiently adapting them for multiple tasks remains a challenge that arises from the need to modify the rich hidden representations encoded by the…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Krishna Sri Ipsit Mantri , Carola-Bibiane Schönlieb , Bruno Ribeiro , Chaim Baskin , Moshe Eliasof

On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning

Intelligent agents should have the ability to leverage knowledge from previously learned tasks in order to learn new ones quickly and efficiently. Meta-learning approaches have emerged as a popular solution to achieve this. However,…

Machine Learning · Computer Science 2023-02-17 Zhao Mandi , Pieter Abbeel , Stephen James

Efficient Conditional Pre-training for Transfer Learning

Almost all the state-of-the-art neural networks for computer vision tasks are trained by (1) pre-training on a large-scale dataset and (2) finetuning on the target dataset. This strategy helps reduce dependence on the target dataset and…

Computer Vision and Pattern Recognition · Computer Science 2021-11-22 Shuvam Chakraborty , Burak Uzkent , Kumar Ayush , Kumar Tanmay , Evan Sheehan , Stefano Ermon

Self-Tuning for Data-Efficient Deep Learning

Deep learning has made revolutionary advances to diverse applications in the presence of large-scale labeled datasets. However, it is prohibitively time-costly and labor-expensive to collect sufficient labeled data in most realistic…

Machine Learning · Computer Science 2021-07-22 Ximei Wang , Jinghan Gao , Mingsheng Long , Jianmin Wang

Continual Learning with Pretrained Backbones by Tuning in the Input Space

The intrinsic difficulty in adapting deep learning models to non-stationary environments limits the applicability of neural networks to real-world tasks. This issue is critical in practical supervised learning settings, such as the ones in…

Machine Learning · Computer Science 2023-06-09 Simone Marullo , Matteo Tiezzi , Marco Gori , Stefano Melacci , Tinne Tuytelaars

ClassyTune: A Performance Auto-Tuner for Systems in the Cloud

Performance tuning can improve the system performance and thus enable the reduction of cloud computing resources needed to support an application. Due to the ever increasing number of parameters and complexity of systems, there is a…

Performance · Computer Science 2019-10-15 Yuqing Zhu , Jianxun Liu

Understanding and Mitigating the Label Noise in Pre-training on Downstream Tasks

Pre-training on large-scale datasets and then fine-tuning on downstream tasks have become a standard practice in deep learning. However, pre-training data often contain label noise that may adversely affect the generalization of the model.…

Machine Learning · Computer Science 2024-03-12 Hao Chen , Jindong Wang , Ankit Shah , Ran Tao , Hongxin Wei , Xing Xie , Masashi Sugiyama , Bhiksha Raj

One Framework to Rule Them All: Unifying Multimodal Tasks with LLM Neural-Tuning

Large-scale models have exhibited remarkable capabilities across diverse domains, including automated medical services and intelligent customer support. However, as most large models are trained on single-modality corpora, enabling them to…

Computer Vision and Pattern Recognition · Computer Science 2025-08-26 Hao Sun , Yu Song , Jiaqing Liu , Jihong Hu , Yen-Wei Chen , Lanfen Lin

SMART: Self-supervised Multi-task pretrAining with contRol Transformers

Self-supervised pretraining has been extensively studied in language and vision domains, where a unified model can be easily adapted to various downstream tasks by pretraining representations without explicit labels. When it comes to…

Machine Learning · Computer Science 2023-01-25 Yanchao Sun , Shuang Ma , Ratnesh Madaan , Rogerio Bonatti , Furong Huang , Ashish Kapoor

Multi-task pre-training of deep neural networks for digital pathology

In this work, we investigate multi-task learning as a way of pre-training models for classification tasks in digital pathology. It is motivated by the fact that many small and medium-size datasets have been released by the community over…

Image and Video Processing · Electrical Eng. & Systems 2020-05-19 Romain Mormont , Pierre Geurts , Raphaël Marée

AnyTaskTune: Advanced Domain-Specific Solutions through Task-Fine-Tuning

The pervasive deployment of Large Language Models-LLMs in various sectors often neglects the nuanced requirements of individuals and small organizations, who benefit more from models precisely tailored to their specific business contexts…

Computation and Language · Computer Science 2024-07-10 Jiaxi Cui , Wentao Zhang , Jing Tang , Xudong Tong , Zhenwei Zhang , Amie , Jing Wen , Rongsheng Wang , Pengfei Wu

Less is More: Selective Layer Finetuning with SubTuning

Finetuning a pretrained model has become a standard approach for training neural networks on novel tasks, resulting in fast convergence and improved performance. In this work, we study an alternative finetuning method, where instead of…

Machine Learning · Computer Science 2023-07-04 Gal Kaplun , Andrey Gurevich , Tal Swisa , Mazor David , Shai Shalev-Shwartz , Eran Malach