Related papers: Multilingual Syntax-aware Language Modeling throug…

Assessment of Pre-Trained Models Across Languages and Grammars

We present an approach for assessing how multilingual large language models (LLMs) learn syntax in terms of multi-formalism syntactic structures. We aim to recover constituent and dependency structures by casting parsing as sequence…

Computation and Language · Computer Science 2023-09-21 Alberto Muñoz-Ortiz , David Vilares , Carlos Gómez-Rodríguez

Inducing Grammars with and for Neural Machine Translation

Machine translation systems require semantic knowledge and grammatical understanding. Neural machine translation (NMT) systems often assume this information is captured by an attention mechanism and a decoder that ensures fluency. Recent…

Computation and Language · Computer Science 2018-05-29 Ke Tran , Yonatan Bisk

Tree-Planted Transformers: Unidirectional Transformer Language Models with Implicit Syntactic Supervision

Syntactic Language Models (SLMs) can be trained efficiently to reach relatively high performance; however, they have trouble with inference efficiency due to the explicit generation of syntactic structures. In this paper, we propose a new…

Computation and Language · Computer Science 2025-08-20 Ryo Yoshida , Taiga Someya , Yohei Oseki

Cross-Linguistic Syntactic Evaluation of Word Prediction Models

A range of studies have concluded that neural word prediction models can distinguish grammatical from ungrammatical sentences with high accuracy. However, these studies are based primarily on monolingual evidence from English. To…

Computation and Language · Computer Science 2020-05-22 Aaron Mueller , Garrett Nicolai , Panayiota Petrou-Zeniou , Natalia Talmina , Tal Linzen

Understanding Cross-Lingual Syntactic Transfer in Multilingual Recurrent Neural Networks

It is now established that modern neural language models can be successfully trained on multiple languages simultaneously without changes to the underlying architecture. But what kind of knowledge is really shared among languages within…

Computation and Language · Computer Science 2021-04-15 Prajit Dhar , Arianna Bisazza

Richer Syntactic Dependencies for Structured Language Modeling

The paper investigates the use of richer syntactic dependencies in the structured language model (SLM). We present two simple methods of enriching the dependencies in the syntactic parse trees used for intializing the SLM. We evaluate the…

Computation and Language · Computer Science 2007-05-23 Ciprian Chelba , Peng Xu

Sneaking Syntax into Transformer Language Models with Tree Regularization

While compositional accounts of human language understanding are based on a hierarchical tree-like process, neural models like transformers lack a direct inductive bias for such tree structures. Introducing syntactic inductive biases could…

Computation and Language · Computer Science 2025-03-25 Ananjan Nandi , Christopher D. Manning , Shikhar Murty

A Systematic Study of Compositional Syntactic Transformer Language Models

Syntactic language models (SLMs) enhance Transformers by incorporating syntactic biases through the modeling of linearized syntactic parse trees alongside surface sentences. This paper focuses on compositional SLMs that are based on…

Computation and Language · Computer Science 2025-07-01 Yida Zhao , Hao Xve , Xiang Hu , Kewei Tu

Dependency-based Mixture Language Models

Various models have been proposed to incorporate knowledge of syntactic structures into neural language models. However, previous works have relied heavily on elaborate components for a specific language model, usually recurrent neural…

Computation and Language · Computer Science 2022-03-22 Zhixian Yang , Xiaojun Wan

Cross-Lingual Dependency Parsing Using Code-Mixed TreeBank

Treebank translation is a promising method for cross-lingual transfer of syntactic dependency knowledge. The basic idea is to map dependency arcs from a source treebank to its target translation according to word alignments. This method,…

Computation and Language · Computer Science 2019-09-06 Zhang Meishan , Zhang Yue , Fu Guohong

Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies

The success of long short-term memory (LSTM) neural networks in language processing is typically attributed to their ability to capture long-distance statistical regularities. Linguistic regularities are often sensitive to syntactic…

Computation and Language · Computer Science 2016-11-07 Tal Linzen , Emmanuel Dupoux , Yoav Goldberg

Syntax-Aware Language Modeling with Recurrent Neural Networks

Neural language models (LMs) are typically trained using only lexical features, such as surface forms of words. In this paper, we argue this deprives the LM of crucial syntactic signals that can be detected at high confidence using existing…

Computation and Language · Computer Science 2018-03-13 Duncan Blythe , Alan Akbik , Roland Vollgraf

Representations of Syntax [MASK] Useful: Effects of Constituency and Dependency Structure in Recursive LSTMs

Sequence-based neural networks show significant sensitivity to syntactic structure, but they still perform less well on syntactic tasks than tree-based networks. Such tree-based networks can be provided with a constituency parse, a…

Computation and Language · Computer Science 2020-05-04 Michael A. Lepori , Tal Linzen , R. Thomas McCoy

Structural Supervision Improves Learning of Non-Local Grammatical Dependencies

State-of-the-art LSTM language models trained on large corpora learn sequential contingencies in impressive detail and have been shown to acquire a number of non-local grammatical dependencies with some success. Here we investigate whether…

Computation and Language · Computer Science 2019-04-09 Ethan Wilcox , Peng Qian , Richard Futrell , Miguel Ballesteros , Roger Levy

Exploring Design Choices for Building Language-Specific LLMs

Despite rapid progress in large language models (LLMs), their performance on a vast majority of languages remains unsatisfactory. In this paper, we study building language-specific LLMs by adapting monolingual and multilingual LLMs. We…

Computation and Language · Computer Science 2024-10-31 Atula Tejaswi , Nilesh Gupta , Eunsol Choi

A Tree-based Decoder for Neural Machine Translation

Recent advances in Neural Machine Translation (NMT) show that adding syntactic information to NMT systems can improve the quality of their translations. Most existing work utilizes some specific types of linguistically-inspired tree…

Computation and Language · Computer Science 2018-08-29 Xinyi Wang , Hieu Pham , Pengcheng Yin , Graham Neubig

Exploiting Syntactic Structure for Better Language Modeling: A Syntactic Distance Approach

It is commonly believed that knowledge of syntactic structure should improve language modeling. However, effectively and computationally efficiently incorporating syntactic structure into neural language models has been a challenging topic.…

Computation and Language · Computer Science 2020-05-13 Wenyu Du , Zhouhan Lin , Yikang Shen , Timothy J. O'Donnell , Yoshua Bengio , Yue Zhang

Top-down string-to-dependency Neural Machine Translation

Most of modern neural machine translation (NMT) models are based on an encoder-decoder framework with an attention mechanism. While they perform well on standard datasets, they can have trouble in translation of long inputs that are rare or…

Computation and Language · Computer Science 2026-03-31 Shuhei Kondo , Katsuhito Sudoh , Yuji Matsumoto

Schr\"odinger's Tree -- On Syntax and Neural Language Models

In the last half-decade, the field of natural language processing (NLP) has undergone two major transitions: the switch to neural networks as the primary modeling paradigm and the homogenization of the training regime (pre-train, then…

Computation and Language · Computer Science 2021-10-19 Artur Kulmizev , Joakim Nivre

Dependency Recurrent Neural Language Models for Sentence Completion

Recent work on language modelling has shifted focus from count-based models to neural models. In these works, the words in each sentence are always considered in a left-to-right order. In this paper we show how we can improve the…

Computation and Language · Computer Science 2015-07-07 Piotr Mirowski , Andreas Vlachos