English
Related papers

Related papers: Linguistic Structure Induction from Language Model…

200 papers

Understanding how sentences are internally represented in the human brain, as well as in large language models (LLMs) such as ChatGPT, is a major challenge for cognitive science. Classic linguistic theories propose that the brain represents…

Computation and Language · Computer Science 2024-05-29 Wei Liu , Ming Xiang , Nai Ding

Syntactic language models (SLMs) enhance Transformers by incorporating syntactic biases through the modeling of linearized syntactic parse trees alongside surface sentences. This paper focuses on compositional SLMs that are based on…

Computation and Language · Computer Science 2025-07-01 Yida Zhao , Hao Xve , Xiang Hu , Kewei Tu

Syntactic structures used to play a vital role in natural language processing (NLP), but since the deep learning revolution, NLP has been gradually dominated by neural models that do not consider syntactic structures in their design. One…

Computation and Language · Computer Science 2023-11-28 Haoyi Wu , Kewei Tu

In this thesis, we try to build a connection between the two schools by introducing syntactic inductive biases for deep learning models. We propose two families of inductive biases, one for constituency structure and another one for…

Machine Learning · Computer Science 2022-06-13 Yikang Shen

While vector-based language representations from pretrained language models have set a new standard for many NLP tasks, there is not yet a complete accounting of their inner workings. In particular, it is not entirely clear what aspects of…

Computation and Language · Computer Science 2021-04-16 Matteo Alleman , Jonathan Mamou , Miguel A Del Rio , Hanlin Tang , Yoon Kim , SueYeon Chung

There are two major classes of natural language grammar -- the dependency grammar that models one-to-one correspondences between words and the constituency grammar that models the assembly of one or several corresponded words. While…

Computation and Language · Computer Science 2021-07-13 Yikang Shen , Yi Tay , Che Zheng , Dara Bahri , Donald Metzler , Aaron Courville

In this paper, we investigate to which extent contextual neural language models (LMs) implicitly learn syntactic structure. More concretely, we focus on constituent structure as represented in the Penn Treebank (PTB). Using standard probing…

Computation and Language · Computer Science 2022-04-14 David Arps , Younes Samih , Laura Kallmeyer , Hassan Sajjad

Neural language models (LMs) perform well on tasks that require sensitivity to syntactic structure. Drawing on the syntactic priming paradigm from psycholinguistics, we propose a novel technique to analyze the representations that enable…

Computation and Language · Computer Science 2019-09-25 Grusha Prasad , Marten van Schijndel , Tal Linzen

Syntactic structure of sentences in a document substantially informs about its authorial writing style. Sentence representation learning has been widely explored in recent years and it has been shown that it improves the generalization of…

Computation and Language · Computer Science 2022-02-25 Fereshteh Jafariakinabad , Kien A. Hua

Linguists have long held that a key aspect of natural language syntax is the recursive organization of language units into constituent structures, and research has suggested that current state-of-the-art language models lack an inherent…

Computation and Language · Computer Science 2024-11-27 Michael Ginn

Syntactic parsing is the task of assigning a syntactic structure to a sentence. There are two popular syntactic parsing methods: constituency and dependency parsing. Recent works have used syntactic embeddings based on constituency trees,…

Computation and Language · Computer Science 2023-02-20 Subba Reddy Oota , Mounika Marreddy , Manish Gupta , Bapi Raju Surampud

Structure-inducing Language Models (SiLM) are trained on a self-supervised language modeling task, and induce a hierarchical sentence representation as a byproduct when processing an input. SiLMs couple strong syntactic generalization…

Computation and Language · Computer Science 2025-12-09 David Arps , Hassan Sajjad , Laura Kallmeyer

Sequence-based neural networks show significant sensitivity to syntactic structure, but they still perform less well on syntactic tasks than tree-based networks. Such tree-based networks can be provided with a constituency parse, a…

Computation and Language · Computer Science 2020-05-04 Michael A. Lepori , Tal Linzen , R. Thomas McCoy

We present an approach for assessing how multilingual large language models (LLMs) learn syntax in terms of multi-formalism syntactic structures. We aim to recover constituent and dependency structures by casting parsing as sequence…

Computation and Language · Computer Science 2023-09-21 Alberto Muñoz-Ortiz , David Vilares , Carlos Gómez-Rodríguez

Word representations induced from models with discrete latent variables (e.g.\ HMMs) have been shown to be beneficial in many NLP applications. In this work, we exploit labeled syntactic dependency trees and formalize the induction problem…

Computation and Language · Computer Science 2016-02-08 Simon Šuster , Gertjan van Noord , Ivan Titov

We investigate the extent to which modern, neural language models are susceptible to structural priming, the phenomenon whereby the structure of a sentence makes the same structure more probable in a follow-up sentence. We explore how…

Computation and Language · Computer Science 2022-06-30 Arabella Sinclair , Jaap Jumelet , Willem Zuidema , Raquel Fernández

Incorporating stronger syntactic biases into neural language models (LMs) is a long-standing goal, but research in this area often focuses on modeling English text, where constituent treebanks are readily available. Extending constituent…

Computation and Language · Computer Science 2022-04-20 Shunsuke Kando , Hiroshi Noji , Yusuke Miyao

Deep learning sequence models have led to a marked increase in performance for a range of Natural Language Processing tasks, but it remains an open question whether they are able to induce proper hierarchical generalizations for…

Computation and Language · Computer Science 2019-06-11 Ethan Wilcox , Roger Levy , Richard Futrell

We introduce a neural network that represents sentences by composing their words according to induced binary parse trees. We use Tree-LSTM as our composition function, applied along a tree structure found by a fully differentiable natural…

Computation and Language · Computer Science 2020-01-16 Jean Maillard , Stephen Clark , Dani Yogatama

Recent work in NLP shows that LSTM language models capture compositional structure in language data. For a closer look at how these representations are composed hierarchically, we present a novel measure of interdependence between word…

Computation and Language · Computer Science 2020-04-29 Naomi Saphra , Adam Lopez
‹ Prev 1 2 3 10 Next ›