Related papers: Deploying Multi-task Online Server with Large Lang…

Exploring Versatile Generative Language Model Via Parameter-Efficient Transfer Learning

Fine-tuning pre-trained generative language models to down-stream language generation tasks has shown promising results. However, this comes with the cost of having a single, large model for each task, which is not ideal in low-memory/power…

Computation and Language · Computer Science 2020-09-22 Zhaojiang Lin , Andrea Madotto , Pascale Fung

Improving Training Efficiency and Reducing Maintenance Costs via Language Specific Model Merging

Fine-tuning a task-specific multilingual large language model (LLM) involves training the model on a multilingual dataset with examples in all the required languages. Updating one or more supported languages with additional data or adding…

Computation and Language · Computer Science 2026-01-26 Alphaeus Dmonte , Vidhi Gupta , Daniel J Perry , Mark Arehart

The Internet of Large Language Models: An Orchestration Framework for LLM Training and Knowledge Exchange Toward Artificial General Intelligence

This paper explores the multi-dimensional challenges faced during the development of Large Language Models (LLMs), including the massive scale of model parameters and file sizes, the complexity of development environment configuration, the…

Artificial Intelligence · Computer Science 2025-01-14 Wilson Wei , Nicholas Chen , Yuxuan Li

Enhancing Subtask Performance of Multi-modal Large Language Model

Multi-modal Large Language Model (MLLM) refers to a model expanded from a Large Language Model (LLM) that possesses the capability to handle and infer multi-modal data. Current MLLMs typically begin by using LLMs to decompose tasks into…

Computation and Language · Computer Science 2023-09-01 Yongqiang Zhao , Zhenyu Li , Feng Zhang , Xinhai Xu , Donghong Liu

Distributed Multi-Task Relationship Learning

Multi-task learning aims to learn multiple tasks jointly by exploiting their relatedness to improve the generalization performance for each task. Traditionally, to perform multi-task learning, one needs to centralize data from all the tasks…

Machine Learning · Computer Science 2017-06-21 Sulin Liu , Sinno Jialin Pan , Qirong Ho

Optimizing Multi-Task Learning for Enhanced Performance in Large Language Models

This study aims to explore the performance improvement method of large language models based on GPT-4 under the multi-task learning framework and conducts experiments on two tasks: text classification and automatic summary generation.…

Computation and Language · Computer Science 2024-12-10 Zhen Qi , Jiajing Chen , Shuo Wang , Bingying Liu , Hongye Zheng , Chihang Wang

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Transfer and multi-task learning have traditionally focused on either a single source-target pair or very few, similar tasks. Ideally, the linguistic levels of morphology, syntax and semantics would benefit each other by being trained in a…

Computation and Language · Computer Science 2017-07-25 Kazuma Hashimoto , Caiming Xiong , Yoshimasa Tsuruoka , Richard Socher

From Large to Super-Tiny: End-to-End Optimization for Cost-Efficient LLMs

Large Language Models (LLMs) have significantly advanced artificial intelligence by optimizing traditional Natural Language Processing (NLP) workflows, facilitating their integration into various systems. Many such NLP systems, including…

Computation and Language · Computer Science 2025-05-13 Jiliang Ni , Jiachen Pu , Zhongyi Yang , Kun Zhou , Hui Wang , Xiaoliang Xiao , Dakui Wang , Xin Li , Jingfeng Luo , Conggang Hu

12-in-1: Multi-Task Vision and Language Representation Learning

Much of vision-and-language research focuses on a small but diverse set of independent tasks and supporting datasets often studied in isolation; however, the visually-grounded language understanding skills required for success at these…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Jiasen Lu , Vedanuj Goswami , Marcus Rohrbach , Devi Parikh , Stefan Lee

Tower: An Open Multilingual Large Language Model for Translation-Related Tasks

While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose…

Computation and Language · Computer Science 2024-02-28 Duarte M. Alves , José Pombal , Nuno M. Guerreiro , Pedro H. Martins , João Alves , Amin Farajian , Ben Peters , Ricardo Rei , Patrick Fernandes , Sweta Agrawal , Pierre Colombo , José G. C. de Souza , André F. T. Martins

A multitask deep learning model for real-time deployment in embedded systems

We propose an approach to Multitask Learning (MTL) to make deep learning models faster and lighter for applications in which multiple tasks need to be solved simultaneously, which is particularly useful in embedded, real-time systems. We…

Computer Vision and Pattern Recognition · Computer Science 2017-11-02 Miquel Martí , Atsuto Maki

Making LLMs Better Many-to-Many Speech-to-Text Translators with Curriculum Learning

Multimodal Large Language Models (MLLMs) have achieved significant success in Speech-to-Text Translation (S2TT) tasks. While most existing research has focused on English-centric translation directions, the exploration of many-to-many…

Computation and Language · Computer Science 2025-06-17 Yexing Du , Youcheng Pan , Ziyang Ma , Bo Yang , Yifan Yang , Keqi Deng , Xie Chen , Yang Xiang , Ming Liu , Bing Qin

Multi-Task Learning for Visual Scene Understanding

Despite the recent progress in deep learning, most approaches still go for a silo-like solution, focusing on learning each task in isolation: training a separate neural network for each individual task. Many real-world problems, however,…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Simon Vandenhende

One LLM to Train Them All: Multi-Task Learning Framework for Fact-Checking

Large language models (LLMs) are reshaping automated fact-checking (AFC) by enabling unified, end-to-end verification pipelines rather than isolated components. While large proprietary models achieve strong performance, their closed…

Computation and Language · Computer Science 2026-01-19 Malin Astrid Larsson , Harald Fosen Grunnaleite , Vinay Setty

Small LLMs Are Weak Tool Learners: A Multi-LLM Agent

Large Language Model (LLM) agents significantly extend the capabilities of standalone LLMs, empowering them to interact with external tools (e.g., APIs, functions) and complete various tasks in a self-directed fashion. The challenge of tool…

Artificial Intelligence · Computer Science 2024-02-19 Weizhou Shen , Chenliang Li , Hongzhan Chen , Ming Yan , Xiaojun Quan , Hehong Chen , Ji Zhang , Fei Huang

A Survey on Multi-Task Learning

Multi-Task Learning (MTL) is a learning paradigm in machine learning and its aim is to leverage useful information contained in multiple related tasks to help improve the generalization performance of all the tasks. In this paper, we give a…

Machine Learning · Computer Science 2021-03-30 Yu Zhang , Qiang Yang

Adaptive Scheduling for Multi-Task Learning

To train neural machine translation models simultaneously on multiple tasks (languages), it is common to sample each task uniformly or in proportion to dataset sizes. As these methods offer little control over performance trade-offs, we…

Machine Learning · Computer Science 2019-09-17 Sébastien Jean , Orhan Firat , Melvin Johnson

One Model, Many Skills: Parameter-Efficient Fine-Tuning for Multitask Code Analysis

Large language models have recently surpassed specialized systems on code generation, yet their effectiveness on other code-analysis tasks remains less clear. At the same time, multi-task learning offers a way to unify diverse objectives…

Software Engineering · Computer Science 2026-03-12 Amal Akli , Maxime Cordy , Mike Papadakis , Yves Le Traon

Evolution without Large Models: Training Language Model with Task Principles

A common training approach for language models involves using a large-scale language model to expand a human-provided dataset, which is subsequently used for model training.This method significantly reduces training costs by eliminating the…

Computation and Language · Computer Science 2025-07-09 Minghang Zhu , Shen Gao , Zhengliang Shi , Jiabao Fang , Pengjie Ren , Zhaochun Ren , Zhumin Chen , Shuo Shang

Multi-Task Learning in Natural Language Processing: An Overview

Deep learning approaches have achieved great success in the field of Natural Language Processing (NLP). However, directly training deep neural models often suffer from overfitting and data scarcity problems that are pervasive in NLP tasks.…

Artificial Intelligence · Computer Science 2024-04-30 Shijie Chen , Yu Zhang , Qiang Yang