English
Related papers

Related papers: Patapasco: A Python Framework for Cross-Language I…

200 papers

Cross-lingual information retrieval (CLIR) addresses the challenge of retrieving relevant documents written in languages different from that of the original query. Research in this area has typically framed the task as monolingual retrieval…

Information Retrieval · Computer Science 2025-10-02 Roksana Goworek , Olivia Macmillan-Scott , Eda B. Özyiğit

A key stumbling block for neural cross-language information retrieval (CLIR) systems has been the paucity of training data. The appearance of the MS MARCO monolingual training set led to significant advances in the state of the art in…

Information Retrieval · Computer Science 2023-05-02 James Mayfield , Eugene Yang , Dawn Lawrie , Samuel Barham , Orion Weller , Marc Mason , Suraj Nair , Scott Miller

The advent of multilingual language models has generated a resurgence of interest in cross-lingual information retrieval (CLIR), which is the task of searching documents in one language with queries from another. However, the rapid pace of…

While recent advancements in Neural Ranking Models have resulted in significant improvements over traditional statistical retrieval models, it is generally acknowledged that the use of large neural architectures and the application of…

Information Retrieval · Computer Science 2025-05-13 Sourav Saha , Harsh Agarwal , V Venktesh , Avishek Anand , Swastik Mohanty , Debapriyo Majumdar , Mandar Mitra

Cross-language information retrieval (CLIR), where queries and documents are in different languages, has of late become one of the major topics within the information retrieval community. This paper proposes a Japanese/English CLIR system,…

Computation and Language · Computer Science 2007-05-23 Atsushi Fujii , Tetsuya Ishikawa

With the increasing accessibility and utilization of multilingual documents, Cross-Lingual Information Retrieval (CLIR) has emerged as an important research area. Conventionally, CLIR tasks have been conducted under settings where the…

Information Retrieval · Computer Science 2026-04-08 Seongtae Hong , Youngjoon Jang , Jungseob Lee , Hyeonseok Moon , Heuiseok Lim

Machine Translation for English Retrieval of Information in Any Language (MATERIAL) is an IARPA initiative targeted to advance the state of cross-lingual information retrieval (CLIR). This report provides a detailed description of…

Computation and Language · Computer Science 2025-11-06 Shantanu Agarwal , Joel Barry , Elizabeth Boschee , Scott Miller

Cross-lingual information retrieval (CLIR) helps users find documents in languages different from their queries. This is especially important in academic search, where key research is often published in non-English languages. We present…

Information Retrieval · Computer Science 2025-11-20 Francisco Valentini , Diego Kozlowski , Vincent Larivière

We propose a fully unsupervised framework for ad-hoc cross-lingual information retrieval (CLIR) which requires no bilingual data at all. The framework leverages shared cross-lingual word embedding spaces in which terms, queries, and…

Computation and Language · Computer Science 2018-05-03 Robert Litschko , Goran Glavaš , Simone Paolo Ponzetto , Ivan Vulić

This paper proposes a Japanese/English cross-language information retrieval (CLIR) system targeting technical documents. Our system first translates a given query containing technical terms into the target language, and then retrieves…

Computation and Language · Computer Science 2007-05-23 Atsushi Fujii , Tetsuya Ishikawa

The advent of deep machine learning platforms such as Tensorflow and Pytorch, developed in expressive high-level languages such as Python, have allowed more expressive representations of deep neural network architectures. We argue that such…

Information Retrieval · Computer Science 2020-07-29 Craig Macdonald , Nicola Tonellotto

Cross-lingual information retrieval (CLIR) enables access to multilingual knowledge but remains challenging due to disparities in resources, scripts, and weak cross-lingual semantic alignment in embedding models. Existing pipelines often…

Information Retrieval · Computer Science 2025-11-25 Roksana Goworek , Olivia Macmillan-Scott , Eda B. Özyiğit

We present the Benchmark of Information Retrieval (IR) tasks with Complex Objectives (BIRCO). BIRCO evaluates the ability of IR systems to retrieve documents given multi-faceted user objectives. The benchmark's complexity and compact size…

Information Retrieval · Computer Science 2024-04-05 Xiaoyue Wang , Jianyou Wang , Weili Cao , Kaicheng Wang , Ramamohan Paturi , Leon Bergen

Recent work in cross-language information retrieval (CLIR), where queries and documents are in different languages, has shown the benefit of the Translate-Distill framework that trains a cross-language neural dual-encoder model using…

Information Retrieval · Computer Science 2024-05-03 Eugene Yang , Dawn Lawrie , James Mayfield

The main issue in Cross Language Information Retrieval (CLIR) is the poor performance of retrieval in terms of average precision when compared to monolingual retrieval performance. The main reasons behind poor performance of CLIR are…

Information Retrieval · Computer Science 2014-01-16 Saurabh Varshney , Jyoti Bajpai

Despite advances in neural machine translation, cross-lingual retrieval tasks in which queries and documents live in different natural language spaces remain challenging. Although neural translation models may provide an intuitive approach…

Information Retrieval · Computer Science 2021-07-30 Zhizhong Chen , Carsten Eickhoff

Two key assumptions shape the usual view of ranked retrieval: (1) that the searcher can choose words for their query that might appear in the documents that they wish to see, and (2) that ranking retrieved documents will suffice because the…

Information Retrieval · Computer Science 2022-06-09 Petra Galuščáková , Douglas W. Oard , Suraj Nair

Prior work on English monolingual retrieval has shown that a cross-encoder trained using a large number of relevance judgments for query-document pairs can be used as a teacher to train more efficient, but similarly effective, dual-encoder…

Information Retrieval · Computer Science 2024-01-11 Eugene Yang , Dawn Lawrie , James Mayfield , Douglas W. Oard , Scott Miller

Although more and more language pairs are covered by machine translation services, there are still many pairs that lack translation resources. Cross-language information retrieval (CLIR) is an application which needs translation…

Computation and Language · Computer Science 2007-05-23 Wessel Kraaij , Jian-Yun Nie , Michel Simard

Cross-lingual information retrieval (CLIR) ~\cite{shi2021cross, asai2021one, jiang2020cross} for example, can find relevant text in any language such as English(high resource) or Telugu (low resource) even when the query is posed in a…

Computation and Language · Computer Science 2025-01-09 Navya Yarrabelly , Saloni Mittal , Ketan Todi , Kimihiro Hasegawa
‹ Prev 1 2 3 10 Next ›