English
Related papers

Related papers: Recurrent Memory Array Structures

200 papers

We introduce multiplicative LSTM (mLSTM), a recurrent neural network architecture for sequence modelling that combines the long short-term memory (LSTM) and multiplicative recurrent neural network architectures. mLSTM is characterised by…

Neural and Evolutionary Computing · Computer Science 2017-10-13 Ben Krause , Liang Lu , Iain Murray , Steve Renals

Long Short-Term Memory (LSTM) is the primary recurrent neural networks architecture for acoustic modeling in automatic speech recognition systems. Residual learning is an efficient method to help neural networks converge easier and faster.…

Computation and Language · Computer Science 2017-08-21 Lu Huang , Jiasong Sun , Ji Xu , Yi Yang

Long short-term memory (LSTM) based acoustic modeling methods have recently been shown to give state-of-the-art performance on some speech recognition tasks. To achieve a further performance improvement, in this research, deep extensions on…

Computation and Language · Computer Science 2015-05-12 Xiangang Li , Xihong Wu

This paper is based on a machine learning project at the Norwegian University of Science and Technology, fall 2020. The project was initiated with a literature review on the latest developments within time-series forecasting methods in the…

Machine Learning · Computer Science 2021-05-17 Christian Bakke Vennerød , Adrian Kjærran , Erling Stray Bugge

Long Short-Term Memory (LSTM) units have the ability to memorise and use long-term dependencies between inputs to generate predictions on time series data. We introduce the concept of modifying the cell state (memory) of LSTMs using…

Machine Learning · Computer Science 2021-05-04 Vlad Velici , Adam Prügel-Bennett

Long Short-Term Memory (LSTM) is a popular approach to boosting the ability of Recurrent Neural Networks to store longer term temporal information. The capacity of an LSTM network can be increased by widening and adding layers. However,…

Machine Learning · Statistics 2017-12-14 Zhen He , Shaobing Gao , Liang Xiao , Daxue Liu , Hangen He , David Barber

This paper introduces Grid Long Short-Term Memory, a network of LSTM cells arranged in a multidimensional grid that can be applied to vectors, sequences or higher dimensional data such as images. The network differs from existing deep LSTM…

Neural and Evolutionary Computing · Computer Science 2016-01-08 Nal Kalchbrenner , Ivo Danihelka , Alex Graves

Long Short-Term Memory (LSTM) neural network models have become the cornerstone for sequential data modeling in numerous applications, ranging from natural language processing to time series forecasting. Despite their success, the problem…

Machine Learning · Statistics 2026-05-26 Fahad Mostafa

Because of their superior ability to preserve sequence information over time, Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have obtained strong results on a variety of…

Computation and Language · Computer Science 2015-06-02 Kai Sheng Tai , Richard Socher , Christopher D. Manning

The chain-structured long short-term memory (LSTM) has showed to be effective in a wide range of problems such as speech recognition and machine translation. In this paper, we propose to extend it to tree structures, in which a memory cell…

Computation and Language · Computer Science 2015-03-18 Xiaodan Zhu , Parinaz Sobhani , Hongyu Guo

We present five variants of the standard Long Short-term Memory (LSTM) recurrent neural networks by uniformly reducing blocks of adaptive parameters in the gating mechanisms. For simplicity, we refer to these models as LSTM1, LSTM2, LSTM3,…

Neural and Evolutionary Computing · Computer Science 2017-07-18 Atra Akandeh , Fathi M. Salem

The standard LSTM recurrent neural networks while very powerful in long-range dependency sequence applications have highly complex structure and relatively large (adaptive) parameters. In this work, we present empirical comparison between…

Neural and Evolutionary Computing · Computer Science 2017-01-13 Yuzhen Lu , Fathi M. Salem

Recurrent Neural Networks (RNNs) with Long Short-Term Memory units (LSTM) are widely used because they are expressive and are easy to train. Our interest lies in empirically evaluating the expressiveness and the learnability of LSTMs in the…

Neural and Evolutionary Computing · Computer Science 2015-11-24 Wojciech Zaremba , Ilya Sutskever

The standard LSTM, although it succeeds in the modeling long-range dependences, suffers from a highly complex structure that can be simplified through modifications to its gate units. This paper was to perform an empirical comparison…

Neural and Evolutionary Computing · Computer Science 2016-12-13 Yuzhen Lu

Recurrent Neural Networks (RNNs), and specifically a variant with Long Short-Term Memory (LSTM), are enjoying renewed interest as a result of successful applications in a wide range of machine learning problems that involve sequential data.…

Machine Learning · Computer Science 2015-11-18 Andrej Karpathy , Justin Johnson , Li Fei-Fei

LSTMs and other RNN variants have shown strong performance on character-level language modeling. These models are typically trained using truncated backpropagation through time, and it is common to assume that their success stems from their…

Computation and Language · Computer Science 2018-12-11 Rami Al-Rfou , Dokook Choe , Noah Constant , Mandy Guo , Llion Jones

We propose Nested LSTMs (NLSTM), a novel RNN architecture with multiple levels of memory. Nested LSTMs add depth to LSTMs via nesting as opposed to stacking. The value of a memory cell in an NLSTM is computed by an LSTM cell, which has its…

Computation and Language · Computer Science 2018-02-01 Joel Ruben Antony Moniz , David Krueger

Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) architecture that has been designed to address the vanishing and exploding gradient problems of conventional RNNs. Unlike feedforward neural networks, RNNs have cyclic…

Neural and Evolutionary Computing · Computer Science 2014-02-06 Haşim Sak , Andrew Senior , Françoise Beaufays

Long Short-Term Memory (LSTM) is one of the most widely used recurrent structures in sequence modeling. It aims to use gates to control information flow (e.g., whether to skip some information or not) in the recurrent computations, although…

Machine Learning · Computer Science 2018-06-11 Zhuohan Li , Di He , Fei Tian , Wei Chen , Tao Qin , Liwei Wang , Tie-Yan Liu

The transcription of handwritten text on images is one task in machine learning and one solution to solve it is using multi-dimensional recurrent neural networks (MDRNN) with connectionist temporal classification (CTC). The RNNs can contain…

Artificial Intelligence · Computer Science 2019-08-28 G. Leifert , T. Strauß , T. Grüning , R. Labahn
‹ Prev 1 2 3 10 Next ›