Related papers: CPTAM: Constituency Parse Tree Aggregation Method

Constituency Parsing using LLMs

Constituency parsing is a fundamental yet unsolved challenge in natural language processing. In this paper, we examine the potential of recent large language models (LLMs) to address this challenge. We reformat constituency parsing as a…

Computation and Language · Computer Science 2025-09-29 Xuefeng Bai , Jialong Wu , Yulong Chen , Zhongqing Wang , Kehai Chen , Min Zhang , Yue Zhang

Straight to the Tree: Constituency Parsing with Neural Syntactic Distance

In this work, we propose a novel constituency parsing scheme. The model predicts a vector of real-valued scalars, named syntactic distances, for each split position in the input sentence. The syntactic distances specify the order in which…

Computation and Language · Computer Science 2018-06-13 Yikang Shen , Zhouhan Lin , Athul Paul Jacob , Alessandro Sordoni , Aaron Courville , Yoshua Bengio

Contextual Distortion Reveals Constituency: Masked Language Models are Implicit Parsers

Recent advancements in pre-trained language models (PLMs) have demonstrated that these models possess some degree of syntactic awareness. To leverage this knowledge, we propose a novel chart-based method for extracting parse trees from…

Computation and Language · Computer Science 2023-06-02 Jiaxi Li , Wei Lu

Consensus Tree Estimation with False Discovery Rate Control via Partially Ordered Sets

Connected acyclic graphs (trees) are data objects that hierarchically organize categories. Collections of trees arise in a diverse variety of fields, including evolutionary biology, public health, machine learning, social sciences and…

Methodology · Statistics 2025-12-01 Maria Alejandra Valdez Cabrera , Amy D Willis , Armeen Taeb

Tree Transformers are an Ineffective Model of Syntactic Constituency

Linguists have long held that a key aspect of natural language syntax is the recursive organization of language units into constituent structures, and research has suggested that current state-of-the-art language models lack an inherent…

Computation and Language · Computer Science 2024-11-27 Michael Ginn

Efficient Constituency Parsing by Pointing

We propose a novel constituency parsing model that casts the parsing problem into a series of pointing tasks. Specifically, our model estimates the likelihood of a span being a legitimate tree constituent via the pointing score…

Computation and Language · Computer Science 2020-06-25 Thanh-Tung Nguyen , Xuan-Phi Nguyen , Shafiq Joty , Xiaoli Li

Structured Tree Alignment for Evaluation of (Speech) Constituency Parsing

We present the structured average intersection-over-union ratio (STRUCT-IOU), a similarity metric between constituency parse trees motivated by the problem of evaluating speech parsers. STRUCT-IOU enables comparison between a constituency…

Computation and Language · Computer Science 2024-06-21 Freda Shi , Kevin Gimpel , Karen Livescu

Ensemble Distillation for Unsupervised Constituency Parsing

We investigate the unsupervised constituency parsing task, which organizes words and phrases of a sentence into a hierarchical structure without using linguistically annotated data. We observe that existing unsupervised parsers capture…

Computation and Language · Computer Science 2024-04-29 Behzad Shayegh , Yanshuai Cao , Xiaodan Zhu , Jackie C. K. Cheung , Lili Mou

Revisiting the Practical Effectiveness of Constituency Parse Extraction from Pre-trained Language Models

Constituency Parse Extraction from Pre-trained Language Models (CPE-PLM) is a recent paradigm that attempts to induce constituency parse trees relying only on the internal knowledge of pre-trained language models. While attractive in the…

Computation and Language · Computer Science 2022-11-02 Taeuk Kim

Tree-Averaging Algorithms for Ensemble-Based Unsupervised Discontinuous Constituency Parsing

We address unsupervised discontinuous constituency parsing, where we observe a high variance in the performance of the only previous model in the literature. We propose to build an ensemble of different runs of the existing discontinuous…

Computation and Language · Computer Science 2024-11-07 Behzad Shayegh , Yuqiao Wen , Lili Mou

Constituent Parsing as Sequence Labeling

We introduce a method to reduce constituent parsing to sequence labeling. For each word w_t, it generates a label that encodes: (1) the number of ancestors in the tree that the words w_t and w_{t+1} have in common, and (2) the nonterminal…

Computation and Language · Computer Science 2019-09-18 Carlos Gómez-Rodríguez , David Vilares

Linear Aggregation in Tree-based Estimators

Regression trees and their ensemble methods are popular methods for nonparametric regression: they combine strong predictive performance with interpretable estimators. To improve their utility for locally smooth response surfaces, we study…

Methodology · Statistics 2021-09-13 Sören R. Künzel , Theo F. Saarinen , Edward W. Liu , Jasjeet S. Sekhon

Constituency Parsing with a Self-Attentive Encoder

We demonstrate that replacing an LSTM encoder with a self-attentive architecture can lead to improvements to a state-of-the-art discriminative constituency parser. The use of attention makes explicit the manner in which information is…

Computation and Language · Computer Science 2018-05-04 Nikita Kitaev , Dan Klein

Unsupervised Parsing via Constituency Tests

We propose a method for unsupervised parsing based on the linguistic notion of a constituency test. One type of constituency test involves modifying the sentence via some transformation (e.g. replacing the span with a pronoun) and then…

Computation and Language · Computer Science 2020-10-08 Steven Cao , Nikita Kitaev , Dan Klein

Cascading and Direct Approaches to Unsupervised Constituency Parsing on Spoken Sentences

Past work on unsupervised parsing is constrained to written form. In this paper, we present the first study on unsupervised spoken constituency parsing given unlabeled spoken sentences and unpaired textual data. The goal is to determine the…

Computation and Language · Computer Science 2023-05-10 Yuan Tseng , Cheng-I Lai , Hung-yi Lee

Bottom-Up Constituency Parsing and Nested Named Entity Recognition with Pointer Networks

Constituency parsing and nested named entity recognition (NER) are similar tasks since they both aim to predict a collection of nested and non-crossing spans. In this work, we cast nested NER to constituency parsing and propose a novel…

Computation and Language · Computer Science 2022-03-10 Songlin Yang , Kewei Tu

Exploiting Syntactic Features in a Parsed Tree to Improve End-to-End TTS

The end-to-end TTS, which can predict speech directly from a given sequence of graphemes or phonemes, has shown improved performance over the conventional TTS. However, its predicting capability is still limited by the acoustic/phonetic…

Computation and Language · Computer Science 2019-04-10 Haohan Guo , Frank K. Soong , Lei He , Lei Xie

A Conditional Splitting Framework for Efficient Constituency Parsing

We introduce a generic seq2seq parsing framework that casts constituency parsing problems (syntactic and discourse parsing) into a series of conditional splitting decisions. Our parsing model estimates the conditional probability…

Computation and Language · Computer Science 2021-07-01 Thanh-Tung Nguyen , Xuan-Phi Nguyen , Shafiq Joty , Xiaoli Li

Encoding Syntactic Constituency Paths for Frame-Semantic Parsing with Graph Convolutional Networks

We study the problem of integrating syntactic information from constituency trees into a neural model in Frame-semantic parsing sub-tasks, namely Target Identification (TI), FrameIdentification (FI), and Semantic Role Labeling (SRL). We use…

Computation and Language · Computer Science 2020-11-30 Emanuele Bastianelli , Andrea Vanzo , Oliver Lemon

Compacting the Penn Treebank Grammar

Treebanks, such as the Penn Treebank (PTB), offer a simple approach to obtaining a broad coverage grammar: one can simply read the grammar off the parse trees in the treebank. While such a grammar is easy to obtain, a square-root rate of…

Computation and Language · Computer Science 2007-05-23 Alexander Krotov , Mark Hepple , Robert Gaizauskas , Yorick Wilks