English
Related papers

Related papers: Multilingual Syntax-aware Language Modeling throug…

200 papers

We present an approach for assessing how multilingual large language models (LLMs) learn syntax in terms of multi-formalism syntactic structures. We aim to recover constituent and dependency structures by casting parsing as sequence…

Computation and Language · Computer Science 2023-09-21 Alberto Muñoz-Ortiz , David Vilares , Carlos Gómez-Rodríguez

Machine translation systems require semantic knowledge and grammatical understanding. Neural machine translation (NMT) systems often assume this information is captured by an attention mechanism and a decoder that ensures fluency. Recent…

Computation and Language · Computer Science 2018-05-29 Ke Tran , Yonatan Bisk

Syntactic Language Models (SLMs) can be trained efficiently to reach relatively high performance; however, they have trouble with inference efficiency due to the explicit generation of syntactic structures. In this paper, we propose a new…

Computation and Language · Computer Science 2025-08-20 Ryo Yoshida , Taiga Someya , Yohei Oseki

A range of studies have concluded that neural word prediction models can distinguish grammatical from ungrammatical sentences with high accuracy. However, these studies are based primarily on monolingual evidence from English. To…

Computation and Language · Computer Science 2020-05-22 Aaron Mueller , Garrett Nicolai , Panayiota Petrou-Zeniou , Natalia Talmina , Tal Linzen

It is now established that modern neural language models can be successfully trained on multiple languages simultaneously without changes to the underlying architecture. But what kind of knowledge is really shared among languages within…

Computation and Language · Computer Science 2021-04-15 Prajit Dhar , Arianna Bisazza

The paper investigates the use of richer syntactic dependencies in the structured language model (SLM). We present two simple methods of enriching the dependencies in the syntactic parse trees used for intializing the SLM. We evaluate the…

Computation and Language · Computer Science 2007-05-23 Ciprian Chelba , Peng Xu

While compositional accounts of human language understanding are based on a hierarchical tree-like process, neural models like transformers lack a direct inductive bias for such tree structures. Introducing syntactic inductive biases could…

Computation and Language · Computer Science 2025-03-25 Ananjan Nandi , Christopher D. Manning , Shikhar Murty

Syntactic language models (SLMs) enhance Transformers by incorporating syntactic biases through the modeling of linearized syntactic parse trees alongside surface sentences. This paper focuses on compositional SLMs that are based on…

Computation and Language · Computer Science 2025-07-01 Yida Zhao , Hao Xve , Xiang Hu , Kewei Tu

Various models have been proposed to incorporate knowledge of syntactic structures into neural language models. However, previous works have relied heavily on elaborate components for a specific language model, usually recurrent neural…

Computation and Language · Computer Science 2022-03-22 Zhixian Yang , Xiaojun Wan

Treebank translation is a promising method for cross-lingual transfer of syntactic dependency knowledge. The basic idea is to map dependency arcs from a source treebank to its target translation according to word alignments. This method,…

Computation and Language · Computer Science 2019-09-06 Zhang Meishan , Zhang Yue , Fu Guohong

The success of long short-term memory (LSTM) neural networks in language processing is typically attributed to their ability to capture long-distance statistical regularities. Linguistic regularities are often sensitive to syntactic…

Computation and Language · Computer Science 2016-11-07 Tal Linzen , Emmanuel Dupoux , Yoav Goldberg

Neural language models (LMs) are typically trained using only lexical features, such as surface forms of words. In this paper, we argue this deprives the LM of crucial syntactic signals that can be detected at high confidence using existing…

Computation and Language · Computer Science 2018-03-13 Duncan Blythe , Alan Akbik , Roland Vollgraf

Sequence-based neural networks show significant sensitivity to syntactic structure, but they still perform less well on syntactic tasks than tree-based networks. Such tree-based networks can be provided with a constituency parse, a…

Computation and Language · Computer Science 2020-05-04 Michael A. Lepori , Tal Linzen , R. Thomas McCoy

State-of-the-art LSTM language models trained on large corpora learn sequential contingencies in impressive detail and have been shown to acquire a number of non-local grammatical dependencies with some success. Here we investigate whether…

Computation and Language · Computer Science 2019-04-09 Ethan Wilcox , Peng Qian , Richard Futrell , Miguel Ballesteros , Roger Levy

Despite rapid progress in large language models (LLMs), their performance on a vast majority of languages remains unsatisfactory. In this paper, we study building language-specific LLMs by adapting monolingual and multilingual LLMs. We…

Computation and Language · Computer Science 2024-10-31 Atula Tejaswi , Nilesh Gupta , Eunsol Choi

Recent advances in Neural Machine Translation (NMT) show that adding syntactic information to NMT systems can improve the quality of their translations. Most existing work utilizes some specific types of linguistically-inspired tree…

Computation and Language · Computer Science 2018-08-29 Xinyi Wang , Hieu Pham , Pengcheng Yin , Graham Neubig

It is commonly believed that knowledge of syntactic structure should improve language modeling. However, effectively and computationally efficiently incorporating syntactic structure into neural language models has been a challenging topic.…

Computation and Language · Computer Science 2020-05-13 Wenyu Du , Zhouhan Lin , Yikang Shen , Timothy J. O'Donnell , Yoshua Bengio , Yue Zhang

Most of modern neural machine translation (NMT) models are based on an encoder-decoder framework with an attention mechanism. While they perform well on standard datasets, they can have trouble in translation of long inputs that are rare or…

Computation and Language · Computer Science 2026-03-31 Shuhei Kondo , Katsuhito Sudoh , Yuji Matsumoto

In the last half-decade, the field of natural language processing (NLP) has undergone two major transitions: the switch to neural networks as the primary modeling paradigm and the homogenization of the training regime (pre-train, then…

Computation and Language · Computer Science 2021-10-19 Artur Kulmizev , Joakim Nivre

Recent work on language modelling has shifted focus from count-based models to neural models. In these works, the words in each sentence are always considered in a left-to-right order. In this paper we show how we can improve the…

Computation and Language · Computer Science 2015-07-07 Piotr Mirowski , Andreas Vlachos
‹ Prev 1 2 3 10 Next ›