Related papers: Efficient Retrieval Optimized Multi-task Learning

Optimizing Question Semantic Space for Dynamic Retrieval-Augmented Multi-hop Question Answering

Retrieval-augmented generation (RAG) is usually integrated into large language models (LLMs) to mitigate hallucinations and knowledge obsolescence. Whereas,conventional one-step retrieve-and-read methods are insufficient for multi-hop…

Information Retrieval · Computer Science 2025-06-03 Linhao Ye , Lang Yu , Zhikai Lei , Qin Chen , Jie Zhou , Liang He

Retrieve Fast, Rerank Smart: Cooperative and Joint Approaches for Improved Cross-Modal Retrieval

Current state-of-the-art approaches to cross-modal retrieval process text and visual input jointly, relying on Transformer-based architectures with cross-attention mechanisms that attend over all words and objects in an image. While…

Computer Vision and Pattern Recognition · Computer Science 2022-02-22 Gregor Geigle , Jonas Pfeiffer , Nils Reimers , Ivan Vulić , Iryna Gurevych

Single LLM, Multiple Roles: A Unified Retrieval-Augmented Generation Framework Using Role-Specific Token Optimization

Existing studies have optimized retrieval-augmented generation (RAG) across various sub-tasks, such as query understanding and retrieval refinement, but integrating these optimizations into a unified framework remains challenging. To tackle…

Computation and Language · Computer Science 2025-05-22 Yutao Zhu , Jiajie Jin , Hongjin Qian , Zheng Liu , Zhicheng Dou , Ji-Rong Wen

Augmenting Pre-trained Language Models with QA-Memory for Open-Domain Question Answering

Retrieval augmented language models have recently become the standard for knowledge intensive tasks. Rather than relying purely on latent semantics within the parameters of large neural models, these methods enlist a semi-parametric memory…

Computation and Language · Computer Science 2023-01-24 Wenhu Chen , Pat Verga , Michiel de Jong , John Wieting , William Cohen

Multi-task Retrieval for Knowledge-Intensive Tasks

Retrieving relevant contexts from a large corpus is a crucial step for tasks such as open-domain question answering and fact checking. Although neural retrieval outperforms traditional methods like tf-idf and BM25, its performance degrades…

Computation and Language · Computer Science 2021-01-05 Jean Maillard , Vladimir Karpukhin , Fabio Petroni , Wen-tau Yih , Barlas Oğuz , Veselin Stoyanov , Gargi Ghosh

Enhancing Dual-Encoders with Question and Answer Cross-Embeddings for Answer Retrieval

Dual-Encoders is a promising mechanism for answer retrieval in question answering (QA) systems. Currently most conventional Dual-Encoders learn the semantic representations of questions and answers merely through matching score. Researchers…

Computation and Language · Computer Science 2022-06-08 Yanmeng Wang , Jun Bai , Ye Wang , Jianfei Zhang , Wenge Rong , Zongcheng Ji , Shaojun Wang , Jing Xiao

LamRA: Large Multimodal Model as Your Advanced Retrieval Assistant

With the rapid advancement of multimodal information retrieval, increasingly complex retrieval tasks have emerged. Existing methods predominately rely on task-specific fine-tuning of vision-language models, often those trained with…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Yikun Liu , Pingan Chen , Jiayin Cai , Xiaolong Jiang , Yao Hu , Jiangchao Yao , Yanfeng Wang , Weidi Xie

KROMA: Ontology Matching with Knowledge Retrieval and Large Language Models

Ontology Matching (OM) is a cornerstone task of semantic interoperability, yet existing systems often rely on handcrafted rules or specialized models with limited adaptability. We present KROMA, a novel OM framework that harnesses Large…

Artificial Intelligence · Computer Science 2025-09-12 Lam Nguyen , Erika Barcelos , Roger French , Yinghui Wu

Multilingual Retrieval-Augmented Generation for Knowledge-Intensive Task

Retrieval-augmented generation (RAG) has become a cornerstone of contemporary NLP, enhancing large language models (LLMs) by allowing them to access richer factual contexts through in-context retrieval. While effective in monolingual…

Computation and Language · Computer Science 2026-03-31 Leonardo Ranaldi , Barry Haddow , Alexandra Birch

Self-Organizing Maps for Storage and Transfer of Knowledge in Reinforcement Learning

The idea of reusing or transferring information from previously learned tasks (source tasks) for the learning of new tasks (target tasks) has the potential to significantly improve the sample efficiency of a reinforcement learning agent. In…

Artificial Intelligence · Computer Science 2022-09-28 Thommen George Karimpanal , Roland Bouffanais

RAVEN: Multitask Retrieval Augmented Vision-Language Learning

The scaling of large language models to encode all the world's knowledge in model parameters is unsustainable and has exacerbated resource barriers. Retrieval-Augmented Generation (RAG) presents a potential solution, yet its application to…

Computer Vision and Pattern Recognition · Computer Science 2024-06-28 Varun Nagaraj Rao , Siddharth Choudhary , Aditya Deshpande , Ravi Kumar Satzoda , Srikar Appalaraju

Chain-of-Skills: A Configurable Model for Open-domain Question Answering

The retrieval model is an indispensable component for real-world knowledge-intensive tasks, e.g., open-domain question answering (ODQA). As separate retrieval skills are annotated for different datasets, recent work focuses on customized…

Computation and Language · Computer Science 2023-05-29 Kaixin Ma , Hao Cheng , Yu Zhang , Xiaodong Liu , Eric Nyberg , Jianfeng Gao

Recover and Match: Open-Vocabulary Multi-Label Recognition through Knowledge-Constrained Optimal Transport

Identifying multiple novel classes in an image, known as open-vocabulary multi-label recognition, is a challenging task in computer vision. Recent studies explore the transfer of powerful vision-language models such as CLIP. However, these…

Computer Vision and Pattern Recognition · Computer Science 2025-03-20 Hao Tan , Zichang Tan , Jun Li , Ajian Liu , Jun Wan , Zhen Lei

CorpusLM: Towards a Unified Language Model on Corpus for Knowledge-Intensive Tasks

Large language models (LLMs) have gained significant attention in various fields but prone to hallucination, especially in knowledge-intensive (KI) tasks. To address this, retrieval-augmented generation (RAG) has emerged as a popular…

Computation and Language · Computer Science 2024-04-23 Xiaoxi Li , Zhicheng Dou , Yujia Zhou , Fangchao Liu

Tram: A Token-level Retrieval-augmented Mechanism for Source Code Summarization

Automatically generating human-readable text describing the functionality of a program is the intent of source code summarization. Although neural language models achieve significant performance in this field, they are limited by their…

Artificial Intelligence · Computer Science 2024-04-02 Tong Ye , Lingfei Wu , Tengfei Ma , Xuhong Zhang , Yangkai Du , Peiyu Liu , Shouling Ji , Wenhai Wang

Enhancing Question Answering Precision with Optimized Vector Retrieval and Instructions

Question-answering (QA) is an important application of Information Retrieval (IR) and language models, and the latest trend is toward pre-trained large neural networks with embedding parameters. Augmenting QA performances with these LLMs…

Information Retrieval · Computer Science 2024-11-05 Lixiao Yang , Mengyang Xu , Weimao Ke

OPERA: A Reinforcement Learning--Enhanced Orchestrated Planner-Executor Architecture for Reasoning-Oriented Multi-Hop Retrieval

Recent advances in large language models (LLMs) and dense retrievers have driven significant progress in retrieval-augmented generation (RAG). However, existing approaches face significant challenges in complex reasoning-oriented multi-hop…

Information Retrieval · Computer Science 2026-05-19 Yu Liu , Yanbing Liu , Fangfang Yuan , Cong Cao , Youbang Sun , Kun Peng , Weizhuo Chen , Jianjun Li , Zhiyuan Ma

Unified Multimodal and Multilingual Retrieval via Multi-Task Learning with NLU Integration

Multimodal retrieval systems typically employ Vision Language Models (VLMs) that encode images and text independently into vectors within a shared embedding space. Despite incorporating text encoders, VLMs consistently underperform…

Information Retrieval · Computer Science 2026-01-22 Xinyuan Zhang , Lina Zhang , Lisung Chen , Guangyao Liu , Shuai Nie , Jiaming Xu , Runyu Shi , Ying Huang , Guoquan Zhang

Towards Optimizing a Retrieval Augmented Generation using Large Language Model on Academic Data

Given the growing trend of many organizations integrating Retrieval Augmented Generation (RAG) into their operations, we assess RAG on domain-specific data and test state-of-the-art models across various optimization techniques. We…

Artificial Intelligence · Computer Science 2024-11-14 Anum Afzal , Juraj Vladika , Gentrit Fazlija , Andrei Staradubets , Florian Matthes

Open-RAG: Enhanced Retrieval-Augmented Reasoning with Open-Source Large Language Models

Retrieval-Augmented Generation (RAG) has been shown to enhance the factual accuracy of Large Language Models (LLMs), but existing methods often suffer from limited reasoning capabilities in effectively using the retrieved evidence,…

Computation and Language · Computer Science 2024-10-03 Shayekh Bin Islam , Md Asib Rahman , K S M Tozammel Hossain , Enamul Hoque , Shafiq Joty , Md Rizwan Parvez