Semi-supervised sequence tagging with bidirectional language models

Matthew E. Peters; Waleed Ammar; Chandra Bhagavatula; Russell Power

Semi-supervised sequence tagging with bidirectional language models

Computation and Language 2017-05-02 v1

Authors: Matthew E. Peters , Waleed Ammar , Chandra Bhagavatula , Russell Power

Abstract

Pre-trained word embeddings learned from unlabeled text have become a standard component of neural network architectures for NLP tasks. However, in most cases, the recurrent network that operates on word-level representations to produce context sensitive representations is trained on relatively little labeled data. In this paper, we demonstrate a general semi-supervised approach for adding pre- trained context embeddings from bidirectional language models to NLP systems and apply it to sequence labeling tasks. We evaluate our model on two standard datasets for named entity recognition (NER) and chunking, and in both cases achieve state of the art results, surpassing previous systems that use other forms of transfer or joint learning with additional labeled data and task specific gazetteers.

Keywords

semi-supervised learning natural language parsing word embeddings

Cite

@article{arxiv.1705.00108,
  title  = {Semi-supervised sequence tagging with bidirectional language models},
  author = {Matthew E. Peters and Waleed Ammar and Chandra Bhagavatula and Russell Power},
  journal= {arXiv preprint arXiv:1705.00108},
  year   = {2017}
}

Comments

To appear in ACL 2017

Semi-supervised sequence tagging with bidirectional language models

Abstract

Keywords

Cite

Comments

Related papers