English
Related papers

Related papers: Deploying Multi-task Online Server with Large Lang…

200 papers

Fine-tuning pre-trained generative language models to down-stream language generation tasks has shown promising results. However, this comes with the cost of having a single, large model for each task, which is not ideal in low-memory/power…

Computation and Language · Computer Science 2020-09-22 Zhaojiang Lin , Andrea Madotto , Pascale Fung

Fine-tuning a task-specific multilingual large language model (LLM) involves training the model on a multilingual dataset with examples in all the required languages. Updating one or more supported languages with additional data or adding…

Computation and Language · Computer Science 2026-01-26 Alphaeus Dmonte , Vidhi Gupta , Daniel J Perry , Mark Arehart

This paper explores the multi-dimensional challenges faced during the development of Large Language Models (LLMs), including the massive scale of model parameters and file sizes, the complexity of development environment configuration, the…

Artificial Intelligence · Computer Science 2025-01-14 Wilson Wei , Nicholas Chen , Yuxuan Li

Multi-modal Large Language Model (MLLM) refers to a model expanded from a Large Language Model (LLM) that possesses the capability to handle and infer multi-modal data. Current MLLMs typically begin by using LLMs to decompose tasks into…

Computation and Language · Computer Science 2023-09-01 Yongqiang Zhao , Zhenyu Li , Feng Zhang , Xinhai Xu , Donghong Liu

Multi-task learning aims to learn multiple tasks jointly by exploiting their relatedness to improve the generalization performance for each task. Traditionally, to perform multi-task learning, one needs to centralize data from all the tasks…

Machine Learning · Computer Science 2017-06-21 Sulin Liu , Sinno Jialin Pan , Qirong Ho

This study aims to explore the performance improvement method of large language models based on GPT-4 under the multi-task learning framework and conducts experiments on two tasks: text classification and automatic summary generation.…

Computation and Language · Computer Science 2024-12-10 Zhen Qi , Jiajing Chen , Shuo Wang , Bingying Liu , Hongye Zheng , Chihang Wang

Transfer and multi-task learning have traditionally focused on either a single source-target pair or very few, similar tasks. Ideally, the linguistic levels of morphology, syntax and semantics would benefit each other by being trained in a…

Computation and Language · Computer Science 2017-07-25 Kazuma Hashimoto , Caiming Xiong , Yoshimasa Tsuruoka , Richard Socher

Large Language Models (LLMs) have significantly advanced artificial intelligence by optimizing traditional Natural Language Processing (NLP) workflows, facilitating their integration into various systems. Many such NLP systems, including…

Computation and Language · Computer Science 2025-05-13 Jiliang Ni , Jiachen Pu , Zhongyi Yang , Kun Zhou , Hui Wang , Xiaoliang Xiao , Dakui Wang , Xin Li , Jingfeng Luo , Conggang Hu

Much of vision-and-language research focuses on a small but diverse set of independent tasks and supporting datasets often studied in isolation; however, the visually-grounded language understanding skills required for success at these…

Computer Vision and Pattern Recognition · Computer Science 2020-04-28 Jiasen Lu , Vedanuj Goswami , Marcus Rohrbach , Devi Parikh , Stefan Lee

While general-purpose large language models (LLMs) demonstrate proficiency on multiple tasks within the domain of translation, approaches based on open LLMs are competitive only when specializing on a single task. In this paper, we propose…

We propose an approach to Multitask Learning (MTL) to make deep learning models faster and lighter for applications in which multiple tasks need to be solved simultaneously, which is particularly useful in embedded, real-time systems. We…

Computer Vision and Pattern Recognition · Computer Science 2017-11-02 Miquel Martí , Atsuto Maki

Multimodal Large Language Models (MLLMs) have achieved significant success in Speech-to-Text Translation (S2TT) tasks. While most existing research has focused on English-centric translation directions, the exploration of many-to-many…

Computation and Language · Computer Science 2025-06-17 Yexing Du , Youcheng Pan , Ziyang Ma , Bo Yang , Yifan Yang , Keqi Deng , Xie Chen , Yang Xiang , Ming Liu , Bing Qin

Despite the recent progress in deep learning, most approaches still go for a silo-like solution, focusing on learning each task in isolation: training a separate neural network for each individual task. Many real-world problems, however,…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Simon Vandenhende

Large language models (LLMs) are reshaping automated fact-checking (AFC) by enabling unified, end-to-end verification pipelines rather than isolated components. While large proprietary models achieve strong performance, their closed…

Computation and Language · Computer Science 2026-01-19 Malin Astrid Larsson , Harald Fosen Grunnaleite , Vinay Setty

Large Language Model (LLM) agents significantly extend the capabilities of standalone LLMs, empowering them to interact with external tools (e.g., APIs, functions) and complete various tasks in a self-directed fashion. The challenge of tool…

Artificial Intelligence · Computer Science 2024-02-19 Weizhou Shen , Chenliang Li , Hongzhan Chen , Ming Yan , Xiaojun Quan , Hehong Chen , Ji Zhang , Fei Huang

Multi-Task Learning (MTL) is a learning paradigm in machine learning and its aim is to leverage useful information contained in multiple related tasks to help improve the generalization performance of all the tasks. In this paper, we give a…

Machine Learning · Computer Science 2021-03-30 Yu Zhang , Qiang Yang

To train neural machine translation models simultaneously on multiple tasks (languages), it is common to sample each task uniformly or in proportion to dataset sizes. As these methods offer little control over performance trade-offs, we…

Machine Learning · Computer Science 2019-09-17 Sébastien Jean , Orhan Firat , Melvin Johnson

Large language models have recently surpassed specialized systems on code generation, yet their effectiveness on other code-analysis tasks remains less clear. At the same time, multi-task learning offers a way to unify diverse objectives…

Software Engineering · Computer Science 2026-03-12 Amal Akli , Maxime Cordy , Mike Papadakis , Yves Le Traon

A common training approach for language models involves using a large-scale language model to expand a human-provided dataset, which is subsequently used for model training.This method significantly reduces training costs by eliminating the…

Computation and Language · Computer Science 2025-07-09 Minghang Zhu , Shen Gao , Zhengliang Shi , Jiabao Fang , Pengjie Ren , Zhaochun Ren , Zhumin Chen , Shuo Shang

Deep learning approaches have achieved great success in the field of Natural Language Processing (NLP). However, directly training deep neural models often suffer from overfitting and data scarcity problems that are pervasive in NLP tasks.…

Artificial Intelligence · Computer Science 2024-04-30 Shijie Chen , Yu Zhang , Qiang Yang
‹ Prev 1 2 3 10 Next ›