English

Instilling Type Knowledge in Language Models via Multi-Task QA

Computation and Language 2022-05-02 v1 Artificial Intelligence

Abstract

Understanding human language often necessitates understanding entities and their place in a taxonomy of knowledge -- their types. Previous methods to learn entity types rely on training classifiers on datasets with coarse, noisy, and incomplete labels. We introduce a method to instill fine-grained type knowledge in language models with text-to-text pre-training on type-centric questions leveraging knowledge base documents and knowledge graphs. We create the WikiWiki dataset: entities and passages from 10M Wikipedia articles linked to the Wikidata knowledge graph with 41K types. Models trained on WikiWiki achieve state-of-the-art performance in zero-shot dialog state tracking benchmarks, accurately infer entity types in Wikipedia articles, and can discover new types deemed useful by human judges.

Keywords

Cite

@article{arxiv.2204.13796,
  title  = {Instilling Type Knowledge in Language Models via Multi-Task QA},
  author = {Shuyang Li and Mukund Sridhar and Chandana Satya Prakash and Jin Cao and Wael Hamza and Julian McAuley},
  journal= {arXiv preprint arXiv:2204.13796},
  year   = {2022}
}

Comments

Findings of NAACL 2022; dataset link: https://github.com/amazon-research/wikiwiki-dataset