English

Semi-supervised Autoencoding Projective Dependency Parsing

Computation and Language 2020-11-03 v1 Machine Learning

Abstract

We describe two end-to-end autoencoding models for semi-supervised graph-based projective dependency parsing. The first model is a Locally Autoencoding Parser (LAP) encoding the input using continuous latent variables in a sequential manner; The second model is a Globally Autoencoding Parser (GAP) encoding the input into dependency trees as latent variables, with exact inference. Both models consist of two parts: an encoder enhanced by deep neural networks (DNN) that can utilize the contextual information to encode the input into latent variables, and a decoder which is a generative model able to reconstruct the input. Both LAP and GAP admit a unified structure with different loss functions for labeled and unlabeled data with shared parameters. We conducted experiments on WSJ and UD dependency parsing data sets, showing that our models can exploit the unlabeled data to improve the performance given a limited amount of labeled data, and outperform a previously proposed semi-supervised model.

Keywords

Cite

@article{arxiv.2011.00704,
  title  = {Semi-supervised Autoencoding Projective Dependency Parsing},
  author = {Xiao Zhang and Dan Goldwasser},
  journal= {arXiv preprint arXiv:2011.00704},
  year   = {2020}
}

Comments

COLING 2020

R2 v1 2026-06-23T19:49:53.312Z