English

Learning Feature Representations for Keyphrase Extraction

Computation and Language 2018-01-08 v1 Artificial Intelligence

Abstract

In supervised approaches for keyphrase extraction, a candidate phrase is encoded with a set of hand-crafted features and machine learning algorithms are trained to discriminate keyphrases from non-keyphrases. Although the manually-designed features have shown to work well in practice, feature engineering is a difficult process that requires expert knowledge and normally does not generalize well. In this paper, we present SurfKE, a feature learning framework that exploits the text itself to automatically discover patterns that keyphrases exhibit. Our model represents the document as a graph and automatically learns feature representation of phrases. The proposed model obtains remarkable improvements in performance over strong baselines.

Keywords

Cite

@article{arxiv.1801.01768,
  title  = {Learning Feature Representations for Keyphrase Extraction},
  author = {Corina Florescu and Wei Jin},
  journal= {arXiv preprint arXiv:1801.01768},
  year   = {2018}
}

Comments

To appear in AAAI 2018 Student Abstract and Poster Program