English

Variational Sequential Labelers for Semi-Supervised Learning

Computation and Language 2019-06-25 v1

Abstract

We introduce a family of multitask variational methods for semi-supervised sequence labeling. Our model family consists of a latent-variable generative model and a discriminative labeler. The generative models use latent variables to define the conditional probability of a word given its context, drawing inspiration from word prediction objectives commonly used in learning word embeddings. The labeler helps inject discriminative information into the latent space. We explore several latent variable configurations, including ones with hierarchical structure, which enables the model to account for both label-specific and word-specific information. Our models consistently outperform standard sequential baselines on 8 sequence labeling datasets, and improve further with unlabeled data.

Keywords

Cite

@article{arxiv.1906.09535,
  title  = {Variational Sequential Labelers for Semi-Supervised Learning},
  author = {Mingda Chen and Qingming Tang and Karen Livescu and Kevin Gimpel},
  journal= {arXiv preprint arXiv:1906.09535},
  year   = {2019}
}

Comments

Appeared in EMNLP 2018 Long