Evaluating Temporal Persistence Using Replicability Measures

Jüri Keller; Timo Breuer; Philipp Schaer

Evaluating Temporal Persistence Using Replicability Measures

Information Retrieval 2023-08-22 v1

Authors: Jüri Keller , Timo Breuer , Philipp Schaer

Abstract

In real-world Information Retrieval (IR) experiments, the Evaluation Environment (EE) is exposed to constant change. Documents are added, removed, or updated, and the information need and the search behavior of users is evolving. Simultaneously, IR systems are expected to retain a consistent quality. The LongEval Lab seeks to investigate the longitudinal persistence of IR systems, and in this work, we describe our participation. We submitted runs of five advanced retrieval systems, namely a Reciprocal Rank Fusion (RRF) approach, ColBERT, monoT5, Doc2Query, and E5, to both sub-tasks. Further, we cast the longitudinal evaluation as a replicability study to better understand the temporal change observed. As a result, we quantify the persistence of the submitted runs and see great potential in this evaluation method.

Keywords

information retrieval evaluation metrics benchmark evaluation

Cite

@article{arxiv.2308.10549,
  title  = {Evaluating Temporal Persistence Using Replicability Measures},
  author = {Jüri Keller and Timo Breuer and Philipp Schaer},
  journal= {arXiv preprint arXiv:2308.10549},
  year   = {2023}
}

Comments

To be published in Proceedings of the Working Notes of CLEF 2023 - Conference and Labs of the Evaluation Forum, Thessaloniki, Greece 18 - 21, 2023

Evaluating Temporal Persistence Using Replicability Measures

Abstract

Keywords

Cite

Comments

Related papers