Related papers: Word Ordering Without Syntax

N-gram Language Modeling using Recurrent Neural Network Estimation

We investigate the effective memory depth of RNN models by using them for $n$-gram language model (LM) smoothing. Experiments on a small corpus (UPenn Treebank, one million words of training data and 10k vocabulary) have found the LSTM cell…

Computation and Language · Computer Science 2017-06-21 Ciprian Chelba , Mohammad Norouzi , Samy Bengio

Neural language modeling of free word order argument structure

Neural language models trained with a predictive or masked objective have proven successful at capturing short and long distance syntactic dependencies. Here, we focus on verb argument structure in German, which has the interesting property…

Computation and Language · Computer Science 2021-03-05 Charlotte Rochereau , Benoît Sagot , Emmanuel Dupoux

Neural Transition-based Syntactic Linearization

The task of linearization is to find a grammatical order given a set of words. Traditional models use statistical methods. Syntactic linearization systems, which generate a sentence along with its syntactic tree, have shown state-of-the-art…

Computation and Language · Computer Science 2018-10-24 Linfeng Song , Yue Zhang , Daniel Gildea

Evaluating the Ability of LSTMs to Learn Context-Free Grammars

While long short-term memory (LSTM) neural net architectures are designed to capture sequence information, human language is generally composed of hierarchical structures. This raises the question as to whether LSTMs can learn hierarchical…

Computation and Language · Computer Science 2018-11-08 Luzi Sennhauser , Robert C. Berwick

Using Priming to Uncover the Organization of Syntactic Representations in Neural Language Models

Neural language models (LMs) perform well on tasks that require sensitivity to syntactic structure. Drawing on the syntactic priming paradigm from psycholinguistics, we propose a novel technique to analyze the representations that enable…

Computation and Language · Computer Science 2019-09-25 Grusha Prasad , Marten van Schijndel , Tal Linzen

Exploiting Syntactic Structure for Natural Language Modeling

The thesis presents an attempt at using the syntactic structure in natural language for improved language models for speech recognition. The structured language model merges techniques in automatic parsing and language modeling using an…

Computation and Language · Computer Science 2007-05-23 Ciprian Chelba

How Order-Sensitive Are LLMs? OrderProbe for Deterministic Structural Reconstruction

Large language models (LLMs) excel at semantic understanding, yet their ability to reconstruct internal structure from scrambled inputs remains underexplored. Sentence-level restoration is ill-posed for automated evaluation because multiple…

Computation and Language · Computer Science 2026-01-21 Yingjie He , Zhaolu Kang , Kehan Jiang , Qianyuan Zhang , Jiachen Qian , Chunlei Meng , Yujie Feng , Yuan Wang , Jiabao Dou , Aming Wu , Leqi Zheng , Pengxiang Zhao , Jiaxin Liu , Zeyu Zhang , Lei Wang , Guansu Wang , Qishi Zhan , Xiaomin He , Meisheng Zhang , Jianyuan Ni

On the Role of Pre-trained Language Models in Word Ordering: A Case Study with BART

Word ordering is a constrained language generation task taking unordered words as input. Existing work uses linear models and neural networks for the task, yet pre-trained language models have not been studied in word ordering, let alone…

Computation and Language · Computer Science 2022-10-31 Zebin Ou , Meishan Zhang , Yue Zhang

Combining semantic and syntactic structure for language modeling

Structured language models for speech recognition have been shown to remedy the weaknesses of n-gram models. All current structured language models are, however, limited in that they do not take into account dependencies between…

Computation and Language · Computer Science 2007-05-23 Rens Bod

Overestimation of Syntactic Representationin Neural Language Models

With the advent of powerful neural language models over the last few years, research attention has increasingly focused on what aspects of language they represent that make them so successful. Several testing methodologies have been…

Computation and Language · Computer Science 2023-05-26 Jordan Kodner , Nitish Gupta

Explicitly Modeling Syntax in Language Models with Incremental Parsing and a Dynamic Oracle

Syntax is fundamental to our thinking about language. Failing to capture the structure of input language could lead to generalization problems and over-parametrization. In the present work, we propose a new syntax-aware language model:…

Computation and Language · Computer Science 2021-05-12 Yikang Shen , Shawn Tan , Alessandro Sordoni , Siva Reddy , Aaron Courville

Masked Language Modeling and the Distributional Hypothesis: Order Word Matters Pre-training for Little

A possible explanation for the impressive performance of masked language model (MLM) pre-training is that such models have learned to represent the syntactic structures prevalent in classical NLP pipelines. In this paper, we propose a…

Computation and Language · Computer Science 2021-09-13 Koustuv Sinha , Robin Jia , Dieuwke Hupkes , Joelle Pineau , Adina Williams , Douwe Kiela

OrderSum: Semantic Sentence Ordering for Extractive Summarization

There are two main approaches to recent extractive summarization: the sentence-level framework, which selects sentences to include in a summary individually, and the summary-level framework, which generates multiple candidate summaries and…

Computation and Language · Computer Science 2025-02-25 Taewan Kwon , Sangyong Lee

Assessing the Ability of LSTMs to Learn Syntax-Sensitive Dependencies

The success of long short-term memory (LSTM) neural networks in language processing is typically attributed to their ability to capture long-distance statistical regularities. Linguistic regularities are often sensitive to syntactic…

Computation and Language · Computer Science 2016-11-07 Tal Linzen , Emmanuel Dupoux , Yoav Goldberg

Semantically Conditioned LSTM-based Natural Language Generation for Spoken Dialogue Systems

Natural language generation (NLG) is a critical component of spoken dialogue and it has a significant impact both on usability and perceived quality. Most NLG systems in common use employ rules and heuristics and tend to generate rigid and…

Computation and Language · Computer Science 2015-08-27 Tsung-Hsien Wen , Milica Gasic , Nikola Mrksic , Pei-Hao Su , David Vandyke , Steve Young

Ordered Neurons: Integrating Tree Structures into Recurrent Neural Networks

Natural language is hierarchically structured: smaller units (e.g., phrases) are nested within larger units (e.g., clauses). When a larger constituent ends, all of the smaller constituents that are nested within it must also be closed.…

Computation and Language · Computer Science 2019-05-09 Yikang Shen , Shawn Tan , Alessandro Sordoni , Aaron Courville

A Comparison of Neural Models for Word Ordering

We compare several language models for the word-ordering task and propose a new bag-to-sequence neural model based on attention-based sequence-to-sequence models. We evaluate the model on a large German WMT data set where it significantly…

Computation and Language · Computer Science 2017-08-08 Eva Hasler , Felix Stahlberg , Marcus Tomalin , Adri`a de Gispert , Bill Byrne

Lattice Rescoring Strategies for Long Short Term Memory Language Models in Speech Recognition

Recurrent neural network (RNN) language models (LMs) and Long Short Term Memory (LSTM) LMs, a variant of RNN LMs, have been shown to outperform traditional N-gram LMs on speech recognition tasks. However, these models are computationally…

Machine Learning · Statistics 2017-11-16 Shankar Kumar , Michael Nirschl , Daniel Holtmann-Rice , Hank Liao , Ananda Theertha Suresh , Felix Yu

Evaluating LLMs' Reasoning Over Ordered Procedural Steps

Reasoning over procedural sequences, where the order of steps directly impacts outcomes, is a critical capability for large language models (LLMs). In this work, we study the task of reconstructing globally ordered sequences from shuffled…

Computation and Language · Computer Science 2025-11-18 Adrita Anika , Md Messal Monem Miah

Expoiting Syntactic Structure for Language Modeling

The paper presents a language model that develops syntactic structure and uses it to extract meaningful information from the word history, thus enabling the use of long distance dependencies. The model assigns probability to every joint…

Computation and Language · Computer Science 2007-05-23 Ciprian Chelba , Frederick Jelinek