English
Related papers

Related papers: Nested LSTMs

200 papers

We propose a method of stacking multiple long short-term memory (LSTM) layers for modeling sentences. In contrast to the conventional stacked LSTMs where only hidden states are fed as input to the next layer, the suggested architecture…

Computation and Language · Computer Science 2019-11-04 Jihun Choi , Taeuk Kim , Sang-goo Lee

Language models, being at the heart of many NLP problems, are always of great interest to researchers. Neural language models come with the advantage of distributed representations and long range contexts. With its particular dynamics that…

Neural and Evolutionary Computing · Computer Science 2018-11-19 Thomas Cherian , Akshay Badola , Vineet Padmanabhan

In this short note, we present an extension of long short-term memory (LSTM) neural networks to using a depth gate to connect memory cells of adjacent layers. Doing so introduces a linear dependence between lower and upper layer recurrent…

Neural and Evolutionary Computing · Computer Science 2015-08-26 Kaisheng Yao , Trevor Cohn , Katerina Vylomova , Kevin Duh , Chris Dyer

Long Short-Term Memory (LSTM) is a popular approach to boosting the ability of Recurrent Neural Networks to store longer term temporal information. The capacity of an LSTM network can be increased by widening and adding layers. However,…

Machine Learning · Statistics 2017-12-14 Zhen He , Shaobing Gao , Liang Xiao , Daxue Liu , Hangen He , David Barber

Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) architecture that has been designed to address the vanishing and exploding gradient problems of conventional RNNs. Unlike feedforward neural networks, RNNs have cyclic…

Neural and Evolutionary Computing · Computer Science 2014-02-06 Haşim Sak , Andrew Senior , Françoise Beaufays

Long Short-Term Memory (LSTM) is a prominent recurrent neural network for extracting dependencies from sequential data such as time-series and multi-view data, having achieved impressive results for different visual recognition tasks. A…

Computer Vision and Pattern Recognition · Computer Science 2020-06-03 Alireza Sepas-Moghaddam , Ali Etemad , Fernando Pereira , Paulo Lobato Correia

Previous RNN architectures have largely been superseded by LSTM, or "Long Short-Term Memory". Since its introduction, there have been many variations on this simple design. However, it is still widely used and we are not aware of a…

Neural and Evolutionary Computing · Computer Science 2017-04-03 Andrew Pulver , Siwei Lyu

The chain-structured long short-term memory (LSTM) has showed to be effective in a wide range of problems such as speech recognition and machine translation. In this paper, we propose to extend it to tree structures, in which a memory cell…

Computation and Language · Computer Science 2015-03-18 Xiaodan Zhu , Parinaz Sobhani , Hongyu Guo

Recently, machine learning methods have provided a broad spectrum of original and efficient algorithms based on Deep Neural Networks (DNN) to automatically predict an outcome with respect to a sequence of inputs. Recurrent hidden cells…

Machine Learning · Computer Science 2017-02-15 Mohamed Bouaziz , Mohamed Morchid , Richard Dufour , Georges Linarès , Renato De Mori

The Long Short-Term Memory (LSTM) layer is an important advancement in the field of neural networks and machine learning, allowing for effective training and impressive inference performance. LSTM-based neural networks have been…

Neural and Evolutionary Computing · Computer Science 2019-01-04 Daniel Kent , Fathi M. Salem

Recurrent Neural Networks (RNN), Long Short-Term Memory Networks (LSTM), and Memory Networks which contain memory are popularly used to learn patterns in sequential data. Sequential data has long sequences that hold relationships. RNN can…

Computation and Language · Computer Science 2019-04-22 Anupiya Nugaliyadde , Kok Wai Wong , Ferdous Sohel , Hong Xie

We present a novel LSTM cell architecture capable of learning both intra- and inter-perspective relationships available in visual sequences captured from multiple perspectives. Our architecture adopts a novel recurrent joint learning…

Computer Vision and Pattern Recognition · Computer Science 2021-05-07 Alireza Sepas-Moghaddam , Fernando Pereira , Paulo Lobato Correia , Ali Etemad

Recently, neural networks have achieved great success on sentiment classification due to their ability to alleviate feature engineering. However, one of the remaining challenges is to model long texts in document-level sentiment…

Computation and Language · Computer Science 2016-10-18 Jiacheng Xu , Danlu Chen , Xipeng Qiu , Xuangjing Huang

Long short-term memory (LSTM) is one of the robust recurrent neural network architectures for learning sequential data. However, it requires considerable computational power to learn and implement both software and hardware aspects. This…

Machine Learning · Computer Science 2023-01-13 Nelly Elsayed , Zag ElSayed , Anthony S. Maida

We introduce multiplicative LSTM (mLSTM), a recurrent neural network architecture for sequence modelling that combines the long short-term memory (LSTM) and multiplicative recurrent neural network architectures. mLSTM is characterised by…

Neural and Evolutionary Computing · Computer Science 2017-10-13 Ben Krause , Liang Lu , Iain Murray , Steve Renals

Long short-term memory (LSTM) based acoustic modeling methods have recently been shown to give state-of-the-art performance on some speech recognition tasks. To achieve a further performance improvement, in this research, deep extensions on…

Computation and Language · Computer Science 2015-05-12 Xiangang Li , Xihong Wu

Recurrent Neural Network (RNN) and one of its specific architectures, Long Short-Term Memory (LSTM), have been widely used for sequence labeling. In this paper, we first enhance LSTM-based sequence labeling to explicitly model label…

Computation and Language · Computer Science 2016-09-01 Gakuto Kurata , Bing Xiang , Bowen Zhou , Mo Yu

Long Short-Term Memory (LSTM) networks, a type of recurrent neural network with a more complex computational unit, have been successfully applied to a variety of sequence modeling tasks. In this paper we develop Tree Long Short-Term Memory…

Computation and Language · Computer Science 2016-04-05 Xingxing Zhang , Liang Lu , Mirella Lapata

Recurrent Neural Networks (RNNs), and specifically a variant with Long Short-Term Memory (LSTM), are enjoying renewed interest as a result of successful applications in a wide range of machine learning problems that involve sequential data.…

Machine Learning · Computer Science 2015-11-18 Andrej Karpathy , Justin Johnson , Li Fei-Fei

Recently, there has been interest in multiplicative recurrent neural networks for language modeling. Indeed, simple Recurrent Neural Networks (RNNs) encounter difficulties recovering from past mistakes when generating sequences due to high…

Machine Learning · Computer Science 2019-07-02 Diego Maupomé , Marie-Jean Meurs
‹ Prev 1 2 3 10 Next ›