English

Multitask Pointer Network for Multi-Representational Parsing

Computation and Language 2022-12-26 v2

Abstract

We propose a transition-based approach that, by training a single model, can efficiently parse any input sentence with both constituent and dependency trees, supporting both continuous/projective and discontinuous/non-projective syntactic structures. To that end, we develop a Pointer Network architecture with two separate task-specific decoders and a common encoder, and follow a multitask learning strategy to jointly train them. The resulting quadratic system, not only becomes the first parser that can jointly produce both unrestricted constituent and dependency trees from a single model, but also proves that both syntactic formalisms can benefit from each other during training, achieving state-of-the-art accuracies in several widely-used benchmarks such as the continuous English and Chinese Penn Treebanks, as well as the discontinuous German NEGRA and TIGER datasets.

Keywords

Cite

@article{arxiv.2009.09730,
  title  = {Multitask Pointer Network for Multi-Representational Parsing},
  author = {Daniel Fernández-González and Carlos Gómez-Rodríguez},
  journal= {arXiv preprint arXiv:2009.09730},
  year   = {2022}
}

Comments

Final peer-reviewed manuscript accepted for publication in Knowledge-Based Systems