English

Entity Extraction with Knowledge from Web Scale Corpora

Computation and Language 2019-11-22 v1 Databases Machine Learning

Abstract

Entity extraction is an important task in text mining and natural language processing. A popular method for entity extraction is by comparing substrings from free text against a dictionary of entities. In this paper, we present several techniques as a post-processing step for improving the effectiveness of the existing entity extraction technique. These techniques utilise models trained with the web-scale corpora which makes our techniques robust and versatile. Experiments show that our techniques bring a notable improvement on efficiency and effectiveness.

Keywords

Cite

@article{arxiv.1911.09373,
  title  = {Entity Extraction with Knowledge from Web Scale Corpora},
  author = {Zeyi Wen and Zeyu Huang and Rui Zhang},
  journal= {arXiv preprint arXiv:1911.09373},
  year   = {2019}
}