English

Pynsett: A programmable relation extractor

Computation and Language 2020-11-06 v2 Artificial Intelligence

Abstract

This paper proposes a programmable relation extraction method for the English language by parsing texts into semantic graphs. A person can define rules in plain English that act as matching patterns onto the graph representation. These rules are designed to capture the semantic content of the documents, allowing for flexibility and ad-hoc entities. Relation extraction is a complex task that typically requires sizable training corpora. The method proposed here is ideal for extracting specialized ontologies in a limited collection of documents.

Keywords

Cite

@article{arxiv.2007.02100,
  title  = {Pynsett: A programmable relation extractor},
  author = {Alberto Cetoli},
  journal= {arXiv preprint arXiv:2007.02100},
  year   = {2020}
}

Comments

Accepted for publication in SEMAPRO2020