English

Improving BERT-based Query-by-Document Retrieval with Multi-Task Optimization

Information Retrieval 2022-05-25 v2

Abstract

Query-by-document (QBD) retrieval is an Information Retrieval task in which a seed document acts as the query and the goal is to retrieve related documents -- it is particular common in professional search tasks. In this work we improve the retrieval effectiveness of the BERT re-ranker, proposing an extension to its fine-tuning step to better exploit the context of queries. To this end, we use an additional document-level representation learning objective besides the ranking objective when fine-tuning the BERT re-ranker. Our experiments on two QBD retrieval benchmarks show that the proposed multi-task optimization significantly improves the ranking effectiveness without changing the BERT re-ranker or using additional training samples. In future work, the generalizability of our approach to other retrieval tasks should be further investigated.

Keywords

Cite

@article{arxiv.2202.00373,
  title  = {Improving BERT-based Query-by-Document Retrieval with Multi-Task Optimization},
  author = {Amin Abolghasemi and Suzan Verberne and Leif Azzopardi},
  journal= {arXiv preprint arXiv:2202.00373},
  year   = {2022}
}

Comments

Accepted for publication in the 44th European Conference on Information Retrieval (ECIR2022)

R2 v1 2026-06-24T09:13:00.956Z