English
Related papers

Related papers: Parallel Long Short-Term Memory for Multi-stream C…

200 papers

Recurrent neural networks (RNNs) have shown outstanding performance on processing sequence data. However, they suffer from long training time, which demands parallel implementations of the training procedure. Parallelization of the training…

Neural and Evolutionary Computing · Computer Science 2015-11-25 Kyuyeon Hwang , Wonyong Sung

Long Short-Term Memory (LSTM) is a recurrent neural network (RNN) architecture that has been designed to address the vanishing and exploding gradient problems of conventional RNNs. Unlike feedforward neural networks, RNNs have cyclic…

Neural and Evolutionary Computing · Computer Science 2014-02-06 Haşim Sak , Andrew Senior , Françoise Beaufays

Long short-term memory (LSTM) recurrent neural networks (RNNs) have been shown to give state-of-the-art performance on many speech recognition tasks, as they are able to provide the learned dynamically changing contextual window of all…

Computation and Language · Computer Science 2016-10-12 Xiangang Li , Xihong Wu

Long short-term memory (LSTM) based acoustic modeling methods have recently been shown to give state-of-the-art performance on some speech recognition tasks. To achieve a further performance improvement, in this research, deep extensions on…

Computation and Language · Computer Science 2015-05-12 Xiangang Li , Xihong Wu

Language models, being at the heart of many NLP problems, are always of great interest to researchers. Neural language models come with the advantage of distributed representations and long range contexts. With its particular dynamics that…

Neural and Evolutionary Computing · Computer Science 2018-11-19 Thomas Cherian , Akshay Badola , Vineet Padmanabhan

Modern recurrent architectures, such as xLSTM and Mamba, have recently challenged the Transformer in language modeling. However, their structure constrains their applicability to sequences only or requires processing multi-dimensional data…

Machine Learning · Computer Science 2025-06-16 Korbinian Pöppel , Richard Freinschlag , Thomas Schmied , Wei Lin , Sepp Hochreiter

Time series prediction can be generalized as a process that extracts useful information from historical records and then determines future values. Learning long-range dependencies that are embedded in time series is often an obstacle for…

Neural and Evolutionary Computing · Computer Science 2018-10-25 Yuxiu Hua , Zhifeng Zhao , Rongpeng Li , Xianfu Chen , Zhiming Liu , Honggang Zhang

Recent advances in event-based neuromorphic systems have resulted in significant interest in the use and development of spiking neural networks (SNNs). However, the non-differentiable nature of spiking neurons makes SNNs incompatible with…

Neural and Evolutionary Computing · Computer Science 2020-07-10 Ali Lotfi Rezaabad , Sriram Vishwanath

Recurrent Neural Networks (RNNs) have become the state-of-the-art choice for extracting patterns from temporal sequences. However, current RNN models are ill-suited to process irregularly sampled data triggered by events generated in…

Machine Learning · Computer Science 2016-11-01 Daniel Neil , Michael Pfeiffer , Shih-Chii Liu

The Long Short-Term Memory (LSTM) recurrent neural network is capable of processing complex sequential information since it utilizes special gating schemes for learning representations from long input sequences. It has the potential to…

Computer Vision and Pattern Recognition · Computer Science 2019-05-14 Naifan Zhuang , Guo-Jun Qi , The Duc Kieu , Kien A. Hua

Network Traffic Matrix (TM) prediction is defined as the problem of estimating future network traffic from the previous and achieved network traffic data. It is widely used in network planning, resource management and network security. Long…

Networking and Internet Architecture · Computer Science 2017-06-12 Abdelhadi Azzouni , Guy Pujolle

Clinical medical data, especially in the intensive care unit (ICU), consist of multivariate time series of observations. For each patient visit (or episode), sensor data and lab test results are recorded in the patient's Electronic Health…

Machine Learning · Computer Science 2017-03-23 Zachary C. Lipton , David C. Kale , Charles Elkan , Randall Wetzel

The recurrent neural network and its variants have shown great success in processing sequences in recent years. However, this deep neural network has not aroused much attention in anomaly detection through predictively process monitoring.…

Machine Learning · Computer Science 2023-09-06 Jiaqi Qiu , Yu Lin , Inez Zwetsloot

Recurrent neural networks like long short-term memory (LSTM) are important architectures for sequential prediction tasks. LSTMs (and RNNs in general) model sequences along the forward time direction. Bidirectional LSTMs (Bi-LSTMs) on the…

Machine Learning · Statistics 2017-11-16 Samira Shabanian , Devansh Arpit , Adam Trischler , Yoshua Bengio

Data-driven approaches to automated machine condition monitoring are gaining popularity due to advancements made in sensing technologies and computing algorithms. This paper proposes the use of a deep learning model, based on Long…

Signal Processing · Electrical Eng. & Systems 2019-07-30 Jianlei Zhang , Binil Starly

Long Short-Term Memory (LSTM) is a prominent recurrent neural network for extracting dependencies from sequential data such as time-series and multi-view data, having achieved impressive results for different visual recognition tasks. A…

Computer Vision and Pattern Recognition · Computer Science 2020-06-03 Alireza Sepas-Moghaddam , Ali Etemad , Fernando Pereira , Paulo Lobato Correia

Traditional Recurrent Neural Networks (RNNs) and Long Short-Term Memory (LSTM) units operate on discrete time steps, often failing to capture the fluid temporal dynamics of real-world physical processes. Liquid Neural Networks (LNNs),…

Machine Learning · Computer Science 2026-05-28 Ye Kyaw Thu , Thazin Myint Oo , Thepchai Supnithi

Long short-term memory (LSTM) and recurrent neural network (RNN) has achieved great successes on time-series prediction. In this paper, a methodology of using LSTM-based deep-RNN for two-phase flow regime prediction is proposed, motivated…

Computer Vision and Pattern Recognition · Computer Science 2020-10-07 Zhuoran Dang , Mamoru Ishii

Transformers have become the dominant architecture for sequence modeling by using self-attention to enable expressive and highly parallel processing. However, the resulting quadratic time and memory costs limit efficiency in long-context…

Machine Learning · Computer Science 2026-05-19 Tristan Gaudreault , Yongyi Mao

Deep-learning models such as Convolutional Neural Networks (CNN) and Long Short-Term Memory (LSTM) have been successfully used for process-mining tasks. They have achieved better performance for different predictive tasks than traditional…

Machine Learning · Computer Science 2021-05-04 Ishwar Venugopal , Jessica Töllich , Michael Fairbank , Ansgar Scherp
‹ Prev 1 2 3 10 Next ›