Continuous Active Learning Using Pretrained Transformers

Nima Sadri; Gordon V. Cormack

Continuous Active Learning Using Pretrained Transformers

Information Retrieval 2022-08-16 v1 Artificial Intelligence Machine Learning

Authors: Nima Sadri , Gordon V. Cormack

Abstract

Pre-trained and fine-tuned transformer models like BERT and T5 have improved the state of the art in ad-hoc retrieval and question-answering, but not as yet in high-recall information retrieval, where the objective is to retrieve substantially all relevant documents. We investigate whether the use of transformer-based models for reranking and/or featurization can improve the Baseline Model Implementation of the TREC Total Recall Track, which represents the current state of the art for high-recall information retrieval. We also introduce CALBERT, a model that can be used to continuously fine-tune a BERT-based model based on relevance feedback.

Continuous Active Learning Using Pretrained Transformers

Abstract

Keywords

Cite

Related papers