English

Zero-shot Entity Linking with Efficient Long Range Sequence Modeling

Computation and Language 2022-12-20 v1 Artificial Intelligence

Abstract

This paper considers the problem of zero-shot entity linking, in which a link in the test time may not present in training. Following the prevailing BERT-based research efforts, we find a simple yet effective way is to expand the long-range sequence modeling. Unlike many previous methods, our method does not require expensive pre-training of BERT with long position embedding. Instead, we propose an efficient position embeddings initialization method called Embedding-repeat, which initializes larger position embeddings based on BERT-Base. On Wikia's zero-shot EL dataset, our method improves the SOTA from 76.06% to 79.08%, and for its long data, the corresponding improvement is from 74.57% to 82.14%. Our experiments suggest the effectiveness of long-range sequence modeling without retraining the BERT model.

Cite

@article{arxiv.2010.06065,
  title  = {Zero-shot Entity Linking with Efficient Long Range Sequence Modeling},
  author = {Zonghai Yao and Liangliang Cao and Huapu Pan},
  journal= {arXiv preprint arXiv:2010.06065},
  year   = {2022}
}

Comments

6 pages, 6 figures, Findings of EMNLP2020

R2 v1 2026-06-23T19:17:42.840Z