LIMIT-BERT : Linguistic Informed Multi-Task BERT

Junru Zhou; Zhuosheng Zhang; Hai Zhao; Shuailiang Zhang

LIMIT-BERT : Linguistic Informed Multi-Task BERT

Computation and Language 2020-10-07 v2 Machine Learning

Authors: Junru Zhou , Zhuosheng Zhang , Hai Zhao , Shuailiang Zhang

Abstract

In this paper, we present a Linguistic Informed Multi-Task BERT (LIMIT-BERT) for learning language representations across multiple linguistic tasks by Multi-Task Learning (MTL). LIMIT-BERT includes five key linguistic syntax and semantics tasks: Part-Of-Speech (POS) tags, constituent and dependency syntactic parsing, span and dependency semantic role labeling (SRL). Besides, LIMIT-BERT adopts linguistics mask strategy: Syntactic and Semantic Phrase Masking which mask all of the tokens corresponding to a syntactic/semantic phrase. Different from recent Multi-Task Deep Neural Networks (MT-DNN) (Liu et al., 2019), our LIMIT-BERT is linguistically motivated and learning in a semi-supervised method which provides large amounts of linguistic-task data as same as BERT learning corpus. As a result, LIMIT-BERT not only improves linguistic tasks performance but also benefits from a regularization effect and linguistic information that leads to more general representations to help adapt to new tasks and domains. LIMIT-BERT obtains new state-of-the-art or competitive results on both span and dependency semantic parsing on Propbank benchmarks and both dependency and constituent syntactic parsing on Penn Treebank.

Keywords

bert pre-trained language model parsing

Cite

@article{arxiv.1910.14296,
  title  = {LIMIT-BERT : Linguistic Informed Multi-Task BERT},
  author = {Junru Zhou and Zhuosheng Zhang and Hai Zhao and Shuailiang Zhang},
  journal= {arXiv preprint arXiv:1910.14296},
  year   = {2020}
}

Comments

EMNLP 2020, ACL Findings

LIMIT-BERT : Linguistic Informed Multi-Task BERT

Abstract

Keywords

Cite

Comments

Related papers