English

Topically Driven Neural Language Model

Computation and Language 2017-10-16 v2

Abstract

Language models are typically applied at the sentence level, without access to the broader document context. We present a neural language model that incorporates document context in the form of a topic model-like architecture, thus providing a succinct representation of the broader document context outside of the current sentence. Experiments over a range of datasets demonstrate that our model outperforms a pure sentence-based model in terms of language model perplexity, and leads to topics that are potentially more coherent than those produced by a standard LDA topic model. Our model also has the ability to generate related sentences for a topic, providing another way to interpret topics.

Keywords

Cite

@article{arxiv.1704.08012,
  title  = {Topically Driven Neural Language Model},
  author = {Jey Han Lau and Timothy Baldwin and Trevor Cohn},
  journal= {arXiv preprint arXiv:1704.08012},
  year   = {2017}
}

Comments

11 pages, Proceedings of the 55th Annual Meeting of the Association for Computational Linguistics (ACL 2017) (to appear)