Related papers: Linguistic Structure Induction from Language Model…

Active Use of Latent Constituency Representation in both Humans and Large Language Models

Understanding how sentences are internally represented in the human brain, as well as in large language models (LLMs) such as ChatGPT, is a major challenge for cognitive science. Classic linguistic theories propose that the brain represents…

Computation and Language · Computer Science 2024-05-29 Wei Liu , Ming Xiang , Nai Ding

A Systematic Study of Compositional Syntactic Transformer Language Models

Syntactic language models (SLMs) enhance Transformers by incorporating syntactic biases through the modeling of linearized syntactic parse trees alongside surface sentences. This paper focuses on compositional SLMs that are based on…

Computation and Language · Computer Science 2025-07-01 Yida Zhao , Hao Xve , Xiang Hu , Kewei Tu

Probabilistic Transformer: A Probabilistic Dependency Model for Contextual Word Representation

Syntactic structures used to play a vital role in natural language processing (NLP), but since the deep learning revolution, NLP has been gradually dominated by neural models that do not consider syntactic structures in their design. One…

Computation and Language · Computer Science 2023-11-28 Haoyi Wu , Kewei Tu

Syntactic Inductive Biases for Deep Learning Methods

In this thesis, we try to build a connection between the two schools by introducing syntactic inductive biases for deep learning models. We propose two families of inductive biases, one for constituency structure and another one for…

Machine Learning · Computer Science 2022-06-13 Yikang Shen

Syntactic Perturbations Reveal Representational Correlates of Hierarchical Phrase Structure in Pretrained Language Models

While vector-based language representations from pretrained language models have set a new standard for many NLP tasks, there is not yet a complete accounting of their inner workings. In particular, it is not entirely clear what aspects of…

Computation and Language · Computer Science 2021-04-16 Matteo Alleman , Jonathan Mamou , Miguel A Del Rio , Hanlin Tang , Yoon Kim , SueYeon Chung

StructFormer: Joint Unsupervised Induction of Dependency and Constituency Structure from Masked Language Modeling

There are two major classes of natural language grammar -- the dependency grammar that models one-to-one correspondences between words and the constituency grammar that models the assembly of one or several corresponded words. While…

Computation and Language · Computer Science 2021-07-13 Yikang Shen , Yi Tay , Che Zheng , Dara Bahri , Donald Metzler , Aaron Courville

Probing for Constituency Structure in Neural Language Models

In this paper, we investigate to which extent contextual neural language models (LMs) implicitly learn syntactic structure. More concretely, we focus on constituent structure as represented in the Penn Treebank (PTB). Using standard probing…

Computation and Language · Computer Science 2022-04-14 David Arps , Younes Samih , Laura Kallmeyer , Hassan Sajjad

Using Priming to Uncover the Organization of Syntactic Representations in Neural Language Models

Neural language models (LMs) perform well on tasks that require sensitivity to syntactic structure. Drawing on the syntactic priming paradigm from psycholinguistics, we propose a novel technique to analyze the representations that enable…

Computation and Language · Computer Science 2019-09-25 Grusha Prasad , Marten van Schijndel , Tal Linzen

A Self-supervised Representation Learning of Sentence Structure for Authorship Attribution

Syntactic structure of sentences in a document substantially informs about its authorial writing style. Sentence representation learning has been widely explored in recent years and it has been shown that it improves the generalization of…

Computation and Language · Computer Science 2022-02-25 Fereshteh Jafariakinabad , Kien A. Hua

Tree Transformers are an Ineffective Model of Syntactic Constituency

Linguists have long held that a key aspect of natural language syntax is the recursive organization of language units into constituent structures, and research has suggested that current state-of-the-art language models lack an inherent…

Computation and Language · Computer Science 2024-11-27 Michael Ginn

Syntactic Structure Processing in the Brain while Listening

Syntactic parsing is the task of assigning a syntactic structure to a sentence. There are two popular syntactic parsing methods: constituency and dependency parsing. Recent works have used syntactic embeddings based on constituency trees,…

Computation and Language · Computer Science 2023-02-20 Subba Reddy Oota , Mounika Marreddy , Manish Gupta , Bapi Raju Surampud

Understanding Syntactic Generalization in Structure-inducing Language Models

Structure-inducing Language Models (SiLM) are trained on a self-supervised language modeling task, and induce a hierarchical sentence representation as a byproduct when processing an input. SiLMs couple strong syntactic generalization…

Computation and Language · Computer Science 2025-12-09 David Arps , Hassan Sajjad , Laura Kallmeyer

Representations of Syntax [MASK] Useful: Effects of Constituency and Dependency Structure in Recursive LSTMs

Sequence-based neural networks show significant sensitivity to syntactic structure, but they still perform less well on syntactic tasks than tree-based networks. Such tree-based networks can be provided with a constituency parse, a…

Computation and Language · Computer Science 2020-05-04 Michael A. Lepori , Tal Linzen , R. Thomas McCoy

Assessment of Pre-Trained Models Across Languages and Grammars

We present an approach for assessing how multilingual large language models (LLMs) learn syntax in terms of multi-formalism syntactic structures. We aim to recover constituent and dependency structures by casting parsing as sequence…

Computation and Language · Computer Science 2023-09-21 Alberto Muñoz-Ortiz , David Vilares , Carlos Gómez-Rodríguez

Word Representations, Tree Models and Syntactic Functions

Word representations induced from models with discrete latent variables (e.g.\ HMMs) have been shown to be beneficial in many NLP applications. In this work, we exploit labeled syntactic dependency trees and formalize the induction problem…

Computation and Language · Computer Science 2016-02-08 Simon Šuster , Gertjan van Noord , Ivan Titov

Structural Persistence in Language Models: Priming as a Window into Abstract Language Representations

We investigate the extent to which modern, neural language models are susceptible to structural priming, the phenomenon whereby the structure of a sentence makes the same structure more probable in a follow-up sentence. We explore how…

Computation and Language · Computer Science 2022-06-30 Arabella Sinclair , Jaap Jumelet , Willem Zuidema , Raquel Fernández

Multilingual Syntax-aware Language Modeling through Dependency Tree Conversion

Incorporating stronger syntactic biases into neural language models (LMs) is a long-standing goal, but research in this area often focuses on modeling English text, where constituent treebanks are readily available. Extending constituent…

Computation and Language · Computer Science 2022-04-20 Shunsuke Kando , Hiroshi Noji , Yusuke Miyao

Hierarchical Representation in Neural Language Models: Suppression and Recovery of Expectations

Deep learning sequence models have led to a marked increase in performance for a range of Natural Language Processing tasks, but it remains an open question whether they are able to induce proper hierarchical generalizations for…

Computation and Language · Computer Science 2019-06-11 Ethan Wilcox , Roger Levy , Richard Futrell

Jointly Learning Sentence Embeddings and Syntax with Unsupervised Tree-LSTMs

We introduce a neural network that represents sentences by composing their words according to induced binary parse trees. We use Tree-LSTM as our composition function, applied along a tree structure found by a fully differentiable natural…

Computation and Language · Computer Science 2020-01-16 Jean Maillard , Stephen Clark , Dani Yogatama

Word Interdependence Exposes How LSTMs Compose Representations

Recent work in NLP shows that LSTM language models capture compositional structure in language data. For a closer look at how these representations are composed hierarchically, we present a novel measure of interdependence between word…

Computation and Language · Computer Science 2020-04-29 Naomi Saphra , Adam Lopez