English

Data Augmentation for Personal Knowledge Base Population

Information Retrieval 2020-08-20 v2 Artificial Intelligence Computation and Language

Abstract

Cold start knowledge base population (KBP) is the problem of populating a knowledge base from unstructured documents. While artificial neural networks have led to significant improvements in the different tasks that are part of KBP, the overall F1 of the end-to-end system remains quite low. This problem is more acute in personal knowledge bases, which present additional challenges with regard to data protection, fairness and privacy. In this work, we present a system that uses rule based annotators and a graph neural network for missing link prediction, to populate a more complete, fair and diverse knowledge base from the TACRED dataset.

Keywords

Cite

@article{arxiv.2002.10943,
  title  = {Data Augmentation for Personal Knowledge Base Population},
  author = {Lingraj S Vannur and Balaji Ganesan and Lokesh Nagalapatti and Hima Patel and MN Thippeswamy},
  journal= {arXiv preprint arXiv:2002.10943},
  year   = {2020}
}

Comments

8 pages, 9 figures, 6 tables. under review. arXiv admin note: text overlap with arXiv:2001.08013

R2 v1 2026-06-23T13:53:17.603Z