Learning Dynamic Author Representations with Temporal Language Models

Edouard Delasalles; Sylvain Lamprier; Ludovic Denoyer

doi:10.1109/ICDM.2019.00022

Learning Dynamic Author Representations with Temporal Language Models

Computation and Language 2020-02-25 v1 Machine Learning Machine Learning

Authors: Edouard Delasalles , Sylvain Lamprier , Ludovic Denoyer

View on arXiv ↗ PDF ↗ DOI ↗

Abstract

Language models are at the heart of numerous works, notably in the text mining and information retrieval communities. These statistical models aim at extracting word distributions, from simple unigram models to recurrent approaches with latent variables that capture subtle dependencies in texts. However, those models are learned from word sequences only, and authors' identities, as well as publication dates, are seldom considered. We propose a neural model, based on recurrent language modeling, which aims at capturing language diffusion tendencies in author communities through time. By conditioning language models with author and temporal vector states, we are able to leverage the latent dependencies between the text contexts. This allows us to beat several temporal and non-temporal language baselines on two real-world corpora, and to learn meaningful author representations that vary through time.

Keywords

language modeling natural language parsing natural language processing

Cite

@article{arxiv.1909.04985,
  title  = {Learning Dynamic Author Representations with Temporal Language Models},
  author = {Edouard Delasalles and Sylvain Lamprier and Ludovic Denoyer},
  journal= {arXiv preprint arXiv:1909.04985},
  year   = {2020}
}

Comments

International Conference on Data Mining, ICDM 2019

Learning Dynamic Author Representations with Temporal Language Models

Abstract

Keywords

Cite

Comments

Related papers