English

FLERT: Document-Level Features for Named Entity Recognition

Computation and Language 2021-05-17 v2

Abstract

Current state-of-the-art approaches for named entity recognition (NER) typically consider text at the sentence-level and thus do not model information that crosses sentence boundaries. However, the use of transformer-based models for NER offers natural options for capturing document-level features. In this paper, we perform a comparative evaluation of document-level features in the two standard NER architectures commonly considered in the literature, namely "fine-tuning" and "feature-based LSTM-CRF". We evaluate different hyperparameters for document-level features such as context window size and enforcing document-locality. We present experiments from which we derive recommendations for how to model document context and present new state-of-the-art scores on several CoNLL-03 benchmark datasets. Our approach is integrated into the Flair framework to facilitate reproduction of our experiments.

Keywords

Cite

@article{arxiv.2011.06993,
  title  = {FLERT: Document-Level Features for Named Entity Recognition},
  author = {Stefan Schweter and Alan Akbik},
  journal= {arXiv preprint arXiv:2011.06993},
  year   = {2021}
}
R2 v1 2026-06-23T20:11:10.038Z