Related papers: Learning Context-Free Languages with Nondeterminis…

Nondeterministic Stacks in Neural Networks

Human language is full of compositional syntactic structures, and although neural networks have contributed to groundbreaking improvements in computer systems that process language, widely-used neural network architectures still exhibit…

Computation and Language · Computer Science 2023-05-19 Brian DuSell

Learning Hierarchical Structures with Differentiable Nondeterministic Stacks

Learning hierarchical structures in sequential data -- from simple algorithmic patterns to natural language -- in a reliable, generalizable way remains a challenging problem for neural language models. Past work has shown that recurrent…

Computation and Language · Computer Science 2022-12-01 Brian DuSell , David Chiang

The Surprising Computational Power of Nondeterministic Stack RNNs

Traditional recurrent neural networks (RNNs) have a fixed, finite number of memory cells. In theory (assuming bounded range and precision), this limits their formal language recognition power to regular languages, and in practice, RNNs have…

Computation and Language · Computer Science 2023-03-14 Brian DuSell , David Chiang

Context-Free Transductions with Neural Stacks

This paper analyzes the behavior of stack-augmented recurrent neural network (RNN) models. Due to the architectural similarity between stack RNNs and pushdown transducers, we train stack RNN models on a number of tasks, including string…

Neural and Evolutionary Computing · Computer Science 2018-09-11 Yiding Hao , William Merrill , Dana Angluin , Robert Frank , Noah Amsel , Andrew Benz , Simon Mendelsohn

An automata characterisation for multiple context-free languages

We introduce tree stack automata as a new class of automata with storage and identify a restricted form of tree stack automata that recognises exactly the multiple context-free languages.

Formal Languages and Automata Theory · Computer Science 2016-09-26 Tobias Denkinger

Bearing Syntactic Fruit with Stack-Augmented Neural Networks

Any finite set of training data is consistent with an infinite number of hypothetical algorithms that could have generated it. Studies have shown that when human children learn language, they consistently favor hypotheses based on…

Computation and Language · Computer Science 2025-11-06 Brian DuSell , Ryan Cotterell

Reservoir Stack Machines

Memory-augmented neural networks equip a recurrent neural network with an explicit memory to support tasks that require information storage without interference over long times. A key motivation for such research is to perform classic…

Neural and Evolutionary Computing · Computer Science 2021-07-27 Benjamin Paaßen , Alexander Schulz , Barbara Hammer

Randomized Deep Structured Prediction for Discourse-Level Processing

Expressive text encoders such as RNNs and Transformer Networks have been at the center of NLP models in recent work. Most of the effort has focused on sentence-level tasks, capturing the dependencies between words in a single sentence, or…

Computation and Language · Computer Science 2021-09-15 Manuel Widmoser , Maria Leonor Pacheco , Jean Honorio , Dan Goldwasser

Finding Syntactic Representations in Neural Stacks

Neural network architectures have been augmented with differentiable stacks in order to introduce a bias toward learning hierarchy-sensitive regularities. It has, however, proven difficult to assess the degree to which such a bias is…

Computation and Language · Computer Science 2019-06-05 William Merrill , Lenny Khazan , Noah Amsel , Yiding Hao , Simon Mendelsohn , Robert Frank

A Transformer with Stack Attention

Natural languages are believed to be (mildly) context-sensitive. Despite underpinning remarkably capable large language models, transformers are unable to model many context-free language tasks. In an attempt to address this limitation in…

Computation and Language · Computer Science 2024-05-15 Jiaoda Li , Jennifer C. White , Mrinmaya Sachan , Ryan Cotterell

Deformable Stacked Structure for Named Entity Recognition

Neural architecture for named entity recognition has achieved great success in the field of natural language processing. Currently, the dominating architecture consists of a bi-directional recurrent neural network (RNN) as the encoder and a…

Computation and Language · Computer Science 2018-10-01 Shuyang Cao , Xipeng Qiu , Xuanjing Huang

The Neural Network Pushdown Automaton: Model, Stack and Learning Simulations

In order for neural networks to learn complex languages or grammars, they must have sufficient computational power or resources to recognize or generate such languages. Though many approaches have been discussed, one ob- vious approach to…

Artificial Intelligence · Computer Science 2017-11-17 G. Z. Sun , C. L. Giles , H. H. Chen , Y. C. Lee

Contextualized Non-local Neural Networks for Sequence Learning

Recently, a large number of neural mechanisms and models have been proposed for sequence learning, of which self-attention, as exemplified by the Transformer model, and graph neural networks (GNNs) have attracted much attention. In this…

Computation and Language · Computer Science 2018-11-22 Pengfei Liu , Shuaichen Chang , Xuanjing Huang , Jian Tang , Jackie Chi Kit Cheung

Recurrent Neural Networks with Mixed Hierarchical Structures for Natural Language Processing

Hierarchical structures exist in both linguistics and Natural Language Processing (NLP) tasks. How to design RNNs to learn hierarchical representations of natural languages remains a long-standing challenge. In this paper, we define two…

Computation and Language · Computer Science 2021-06-07 Zhaoxin Luo , Michael Zhu

Deep Structured Output Learning for Unconstrained Text Recognition

We develop a representation suitable for the unconstrained recognition of words in natural images: the general case of no fixed lexicon and unknown length. To this end we propose a convolutional neural network (CNN) based architecture which…

Computer Vision and Pattern Recognition · Computer Science 2015-04-13 Max Jaderberg , Karen Simonyan , Andrea Vedaldi , Andrew Zisserman

The Neural State Pushdown Automata

In order to learn complex grammars, recurrent neural networks (RNNs) require sufficient computational resources to ensure correct grammar recognition. A widely-used approach to expand model capacity would be to couple an RNN to an external…

Neural and Evolutionary Computing · Computer Science 2019-09-23 Ankur Mali , Alexander Ororbia , C. Lee Giles

Learning Operations on a Stack with Neural Turing Machines

Multiple extensions of Recurrent Neural Networks (RNNs) have been proposed recently to address the difficulty of storing information over long time periods. In this paper, we experiment with the capacity of Neural Turing Machines (NTMs) to…

Machine Learning · Computer Science 2016-12-05 Tristan Deleu , Joseph Dureau

Generative and Discriminative Text Classification with Recurrent Neural Networks

We empirically characterize the performance of discriminative and generative LSTM models for text classification. We find that although RNN-based generative models are more powerful than their bag-of-words ancestors (e.g., they account for…

Machine Learning · Statistics 2017-05-29 Dani Yogatama , Chris Dyer , Wang Ling , Phil Blunsom

Speeding up Context-based Sentence Representation Learning with Non-autoregressive Convolutional Decoding

Context plays an important role in human language understanding, thus it may also be useful for machines learning vector representations of language. In this paper, we explore an asymmetric encoder-decoder structure for unsupervised…

Neural and Evolutionary Computing · Computer Science 2018-06-04 Shuai Tang , Hailin Jin , Chen Fang , Zhaowen Wang , Virginia R. de Sa

Recurrent Hierarchical Topic-Guided RNN for Language Generation

To simultaneously capture syntax and global semantics from a text corpus, we propose a new larger-context recurrent neural network (RNN) based language model, which extracts recurrent hierarchical semantic structure via a dynamic deep topic…

Computation and Language · Computer Science 2020-06-30 Dandan Guo , Bo Chen , Ruiying Lu , Mingyuan Zhou