Related papers: Subregular Complexity and Deep Learning

On Evaluating the Generalization of LSTM Models in Formal Languages

Recurrent Neural Networks (RNNs) are theoretically Turing-complete and established themselves as a dominant model for language processing. Yet, there still remains an uncertainty regarding their language learning capabilities. In this…

Computation and Language · Computer Science 2018-11-05 Mirac Suzgun , Yonatan Belinkov , Stuart M. Shieber

RNNs as psycholinguistic subjects: Syntactic state and grammatical dependency

Recurrent neural networks (RNNs) are the state of the art in sequence modeling for natural language. However, it remains poorly understood what grammatical characteristics of natural language they implicitly learn and represent as a…

Computation and Language · Computer Science 2018-09-06 Richard Futrell , Ethan Wilcox , Takashi Morita , Roger Levy

Training Neural Networks as Recognizers of Formal Languages

Characterizing the computational power of neural network architectures in terms of formal language theory remains a crucial line of research, as it describes lower and upper bounds on the reasoning capabilities of modern AI. However, when…

Computation and Language · Computer Science 2025-04-15 Alexandra Butoi , Ghazal Khalighinejad , Anej Svete , Josef Valvoda , Ryan Cotterell , Brian DuSell

Long Short-Term Memory Based Recurrent Neural Network Architectures for Large Vocabulary Speech Recognition

Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) architecture that has been designed to address the vanishing and exploding gradient problems of conventional RNNs. Unlike feedforward neural networks, RNNs have cyclic…

Neural and Evolutionary Computing · Computer Science 2014-02-06 Haşim Sak , Andrew Senior , Françoise Beaufays

Recurrent babbling: evaluating the acquisition of grammar from limited input data

Recurrent Neural Networks (RNNs) have been shown to capture various aspects of syntax from raw linguistic input. In most previous experiments, however, learning happens over unrealistic corpora, which do not reflect the type and amount of…

Computation and Language · Computer Science 2024-11-12 Ludovica Pannitto , Aurélie Herbelot

Visualizing and Understanding Recurrent Networks

Recurrent Neural Networks (RNNs), and specifically a variant with Long Short-Term Memory (LSTM), are enjoying renewed interest as a result of successful applications in a wide range of machine learning problems that involve sequential data.…

Machine Learning · Computer Science 2015-11-18 Andrej Karpathy , Justin Johnson , Li Fei-Fei

Learning Over Long Time Lags

The advantage of recurrent neural networks (RNNs) in learning dependencies between time-series data has distinguished RNNs from other deep learning models. Recently, many advances are proposed in this emerging field. However, there is a…

Neural and Evolutionary Computing · Computer Science 2016-02-16 Hojjat Salehinejad

Long Short-Term Memory Spiking Networks and Their Applications

Recent advances in event-based neuromorphic systems have resulted in significant interest in the use and development of spiking neural networks (SNNs). However, the non-differentiable nature of spiking neurons makes SNNs incompatible with…

Neural and Evolutionary Computing · Computer Science 2020-07-10 Ali Lotfi Rezaabad , Sriram Vishwanath

How much complexity does an RNN architecture need to learn syntax-sensitive dependencies?

Long short-term memory (LSTM) networks and their variants are capable of encapsulating long-range dependencies, which is evident from their performance on a variety of linguistic tasks. On the other hand, simple recurrent networks (SRNs),…

Computation and Language · Computer Science 2020-05-26 Gantavya Bhatt , Hritik Bansal , Rishubh Singh , Sumeet Agarwal

Learning to Execute

Recurrent Neural Networks (RNNs) with Long Short-Term Memory units (LSTM) are widely used because they are expressive and are easy to train. Our interest lies in empirically evaluating the expressiveness and the learnability of LSTMs in the…

Neural and Evolutionary Computing · Computer Science 2015-11-24 Wojciech Zaremba , Ilya Sutskever

Constructing Long Short-Term Memory based Deep Recurrent Neural Networks for Large Vocabulary Speech Recognition

Long short-term memory (LSTM) based acoustic modeling methods have recently been shown to give state-of-the-art performance on some speech recognition tasks. To achieve a further performance improvement, in this research, deep extensions on…

Computation and Language · Computer Science 2015-05-12 Xiangang Li , Xihong Wu

Deep LSTM for Large Vocabulary Continuous Speech Recognition

Recurrent neural networks (RNNs), especially long short-term memory (LSTM) RNNs, are effective network for sequential task like speech recognition. Deeper LSTM models perform well on large vocabulary continuous speech recognition, because…

Computation and Language · Computer Science 2017-03-22 Xu Tian , Jun Zhang , Zejun Ma , Yi He , Juan Wei , Peihao Wu , Wenchang Situ , Shuai Li , Yang Zhang

Resurrecting Recurrent Neural Networks for Long Sequences

Recurrent Neural Networks (RNNs) offer fast inference on long sequences but are hard to optimize and slow to train. Deep state-space models (SSMs) have recently been shown to perform remarkably well on long sequence modeling tasks, and have…

Machine Learning · Computer Science 2023-03-14 Antonio Orvieto , Samuel L Smith , Albert Gu , Anushan Fernando , Caglar Gulcehre , Razvan Pascanu , Soham De

Coverage Guided Testing for Recurrent Neural Networks

Recurrent neural networks (RNNs) have been applied to a broad range of applications, including natural language processing, drug discovery, and video recognition. Their vulnerability to input perturbation is also known. Aligning with a view…

Machine Learning · Computer Science 2021-05-14 Wei Huang , Youcheng Sun , Xingyu Zhao , James Sharp , Wenjie Ruan , Jie Meng , Xiaowei Huang

Understanding LSTM -- a tutorial into Long Short-Term Memory Recurrent Neural Networks

Long Short-Term Memory Recurrent Neural Networks (LSTM-RNN) are one of the most powerful dynamic classifiers publicly known. The network itself and the related learning algorithms are reasonably well documented to get an idea how it works.…

Neural and Evolutionary Computing · Computer Science 2019-09-23 Ralf C. Staudemeyer , Eric Rothstein Morris

Assessing the Unitary RNN as an End-to-End Compositional Model of Syntax

We show that both an LSTM and a unitary-evolution recurrent neural network (URN) can achieve encouraging accuracy on two types of syntactic patterns: context-free long distance agreement, and mildly context-sensitive cross serial…

Computation and Language · Computer Science 2022-08-12 Jean-Philippe Bernardy , Shalom Lappin

Evaluating Recurrent Neural Network Explanations

Recently, several methods have been proposed to explain the predictions of recurrent neural networks (RNNs), in particular of LSTMs. The goal of these methods is to understand the network's decisions by assigning to each input variable,…

Machine Learning · Computer Science 2019-06-05 Leila Arras , Ahmed Osman , Klaus-Robert Müller , Wojciech Samek

Depth-Adaptive Graph Recurrent Network for Text Classification

The Sentence-State LSTM (S-LSTM) is a powerful and high efficient graph recurrent network, which views words as nodes and performs layer-wise recurrent steps between them simultaneously. Despite its successes on text representations, the…

Computation and Language · Computer Science 2020-03-03 Yijin Liu , Fandong Meng , Yufeng Chen , Jinan Xu , Jie Zhou

Using LSTMs to Model the Java Programming Language

Recurrent neural networks (RNNs), specifically long-short term memory networks (LSTMs), can model natural language effectively. This research investigates the ability for these same LSTMs to perform next "word" prediction on the Java…

Software Engineering · Computer Science 2019-09-02 Brendon Boldt

Achieving Online Regression Performance of LSTMs with Simple RNNs

Recurrent Neural Networks (RNNs) are widely used for online regression due to their ability to generalize nonlinear temporal dependencies. As an RNN model, Long-Short-Term-Memory Networks (LSTMs) are commonly preferred in practice, as these…

Machine Learning · Computer Science 2021-06-01 N. Mert Vural , Fatih Ilhan , Selim F. Yilmaz , Salih Ergüt , Suleyman S. Kozat