English

Patapasco: A Python Framework for Cross-Language Information Retrieval Experiments

Information Retrieval 2022-01-26 v1

Abstract

While there are high-quality software frameworks for information retrieval experimentation, they do not explicitly support cross-language information retrieval (CLIR). To fill this gap, we have created Patapsco, a Python CLIR framework. This framework specifically addresses the complexity that comes with running experiments in multiple languages. Patapsco is designed to be extensible to many language pairs, to be scalable to large document collections, and to support reproducible experiments driven by a configuration file. We include Patapsco results on standard CLIR collections using multiple settings.

Keywords

Cite

@article{arxiv.2201.09996,
  title  = {Patapasco: A Python Framework for Cross-Language Information Retrieval Experiments},
  author = {Cash Costello and Eugene Yang and Dawn Lawrie and James Mayfield},
  journal= {arXiv preprint arXiv:2201.09996},
  year   = {2022}
}

Comments

5 pages, accepted at ECIR 2022 as a demo paper

R2 v1 2026-06-24T09:01:08.661Z