Related papers: Improving Multitask Retrieval by Promoting Task Sp…

Multi-task Retrieval for Knowledge-Intensive Tasks

Retrieving relevant contexts from a large corpus is a crucial step for tasks such as open-domain question answering and fact checking. Although neural retrieval outperforms traditional methods like tf-idf and BM25, its performance degrades…

Computation and Language · Computer Science 2021-01-05 Jean Maillard , Vladimir Karpukhin , Fabio Petroni , Wen-tau Yih , Barlas Oğuz , Veselin Stoyanov , Gargi Ghosh

A Multi-Task Embedder For Retrieval Augmented LLMs

LLMs confront inherent limitations in terms of its knowledge, memory, and action. The retrieval augmentation stands as a vital mechanism to address these limitations, which brings in useful information from external sources to augment the…

Information Retrieval · Computer Science 2026-01-06 Peitian Zhang , Shitao Xiao , Zheng Liu , Zhicheng Dou , Jian-Yun Nie

A Unified Generative Retriever for Knowledge-Intensive Language Tasks via Prompt Learning

Knowledge-intensive language tasks (KILTs) benefit from retrieving high-quality relevant contexts from large external knowledge corpora. Learning task-specific retrievers that return relevant contexts at an appropriate level of semantic…

Information Retrieval · Computer Science 2023-05-01 Jiangui Chen , Ruqing Zhang , Jiafeng Guo , Maarten de Rijke , Yiqun Liu , Yixing Fan , Xueqi Cheng

A Survey of Multi-task Learning in Natural Language Processing: Regarding Task Relatedness and Training Methods

Multi-task learning (MTL) has become increasingly popular in natural language processing (NLP) because it improves the performance of related tasks by exploiting their commonalities and differences. Nevertheless, it is still not understood…

Computation and Language · Computer Science 2023-02-16 Zhihan Zhang , Wenhao Yu , Mengxia Yu , Zhichun Guo , Meng Jiang

Training a Utility-based Retriever Through Shared Context Attribution for Retrieval-Augmented Language Models

Retrieval-Augmented Language Models boost task performance, owing to the retriever that provides external knowledge. Although crucial, the retriever primarily focuses on semantics relevance, which may not always be effective for generation.…

Computation and Language · Computer Science 2026-02-03 Yilong Xu , Jinhua Gao , Xiaoming Yu , Yuanhai Xue , Baolong Bi , Huawei Shen , Xueqi Cheng

Multi-task Supervised Learning via Cross-learning

In this paper we consider a problem known as multi-task learning, consisting of fitting a set of classifier or regression functions intended for solving different tasks. In our novel formulation, we couple the parameters of these functions,…

Machine Learning · Computer Science 2021-05-28 Juan Cervino , Juan Andres Bazerque , Miguel Calvo-Fullana , Alejandro Ribeiro

Distributed Multi-Task Relationship Learning

Multi-task learning aims to learn multiple tasks jointly by exploiting their relatedness to improve the generalization performance for each task. Traditionally, to perform multi-task learning, one needs to centralize data from all the tasks…

Machine Learning · Computer Science 2017-06-21 Sulin Liu , Sinno Jialin Pan , Qirong Ho

On the Effectiveness of Fine-tuning Versus Meta-reinforcement Learning

Intelligent agents should have the ability to leverage knowledge from previously learned tasks in order to learn new ones quickly and efficiently. Meta-learning approaches have emerged as a popular solution to achieve this. However,…

Machine Learning · Computer Science 2023-02-17 Zhao Mandi , Pieter Abbeel , Stephen James

Exploring the Benefits of Training Expert Language Models over Instruction Tuning

Recently, Language Models (LMs) instruction-tuned on multiple tasks, also known as multitask-prompted fine-tuning (MT), have shown the capability to generalize to unseen tasks. Previous work has shown that scaling the number of training…

Computation and Language · Computer Science 2023-02-10 Joel Jang , Seungone Kim , Seonghyeon Ye , Doyoung Kim , Lajanugen Logeswaran , Moontae Lee , Kyungjae Lee , Minjoon Seo

NIR-Prompt: A Multi-task Generalized Neural Information Retrieval Training Framework

Information retrieval aims to find information that meets users' needs from the corpus. Different needs correspond to different IR tasks such as document retrieval, open-domain question answering, retrieval-based dialogue, etc., while they…

Information Retrieval · Computer Science 2023-12-19 Shicheng Xu , Liang Pang , Huawei Shen , Xueqi Cheng

Multi-Task Retrieval-Augmented Text Generation with Relevance Sampling

This paper studies multi-task training of retrieval-augmented generation models for knowledge-intensive tasks. We propose to clean the training set by utilizing a distinct property of knowledge-intensive generation: The connection of…

Computation and Language · Computer Science 2022-07-08 Sebastian Hofstätter , Jiecao Chen , Karthik Raman , Hamed Zamani

Multitask Prompt Tuning Enables Parameter-Efficient Transfer Learning

Prompt tuning, in which a base pretrained model is adapted to each task via conditioning on learned prompt vectors, has emerged as a promising approach for efficiently adapting large language models to multiple downstream tasks. However,…

Computation and Language · Computer Science 2023-03-07 Zhen Wang , Rameswar Panda , Leonid Karlinsky , Rogerio Feris , Huan Sun , Yoon Kim

Towards Enabling Meta-Learning from Target Models

Meta-learning can extract an inductive bias from previous learning experience and assist the training of new tasks. It is often realized through optimizing a meta-model with the evaluation loss of task-specific solvers. Most existing…

Machine Learning · Computer Science 2021-12-20 Su Lu , Han-Jia Ye , Le Gan , De-Chuan Zhan

Multi-Task Reinforcement Learning Enables Parameter Scaling

Multi-task reinforcement learning (MTRL) aims to endow a single agent with the ability to perform well on multiple tasks. Recent works have focused on developing novel sophisticated architectures to improve performance, often resulting in…

Machine Learning · Computer Science 2025-03-13 Reginald McLean , Evangelos Chatzaroulas , Jordan Terry , Isaac Woungang , Nariman Farsad , Pablo Samuel Castro

A Joint Many-Task Model: Growing a Neural Network for Multiple NLP Tasks

Transfer and multi-task learning have traditionally focused on either a single source-target pair or very few, similar tasks. Ideally, the linguistic levels of morphology, syntax and semantics would benefit each other by being trained in a…

Computation and Language · Computer Science 2017-07-25 Kazuma Hashimoto , Caiming Xiong , Yoshimasa Tsuruoka , Richard Socher

Preventing Catastrophic Forgetting in Continual Learning of New Natural Language Tasks

Multi-Task Learning (MTL) is widely-accepted in Natural Language Processing as a standard technique for learning multiple related tasks in one model. Training an MTL model requires having the training data for all tasks available at the…

Computation and Language · Computer Science 2023-02-23 Sudipta Kar , Giuseppe Castellucci , Simone Filice , Shervin Malmasi , Oleg Rokhlenko

Parameter-Efficient Prompt Tuning Makes Generalized and Calibrated Neural Text Retrievers

Prompt tuning attempts to update few task-specific parameters in pre-trained models. It has achieved comparable performance to fine-tuning of the full parameter set on both language understanding and generation tasks. In this work, we study…

Computation and Language · Computer Science 2022-07-15 Weng Lam Tam , Xiao Liu , Kaixuan Ji , Lilong Xue , Xingjian Zhang , Yuxiao Dong , Jiahua Liu , Maodi Hu , Jie Tang

Behavior Retrieval: Few-Shot Imitation Learning by Querying Unlabeled Datasets

Enabling robots to learn novel visuomotor skills in a data-efficient manner remains an unsolved problem with myriad challenges. A popular paradigm for tackling this problem is through leveraging large unlabeled datasets that have many…

Robotics · Computer Science 2023-05-16 Maximilian Du , Suraj Nair , Dorsa Sadigh , Chelsea Finn

Generalization Properties of Retrieval-based Models

Many modern high-performing machine learning models such as GPT-3 primarily rely on scaling up models, e.g., transformer networks. Simultaneously, a parallel line of work aims to improve the model performance by augmenting an input instance…

Machine Learning · Computer Science 2022-10-07 Soumya Basu , Ankit Singh Rawat , Manzil Zaheer

Transductive Auxiliary Task Self-Training for Neural Multi-Task Models

Multi-task learning and self-training are two common ways to improve a machine learning model's performance in settings with limited training data. Drawing heavily on ideas from those two approaches, we suggest transductive auxiliary task…

Computation and Language · Computer Science 2019-09-24 Johannes Bjerva , Katharina Kann , Isabelle Augenstein