English
Related papers

Related papers: Parallelizing Legendre Memory Unit Training

200 papers

Recurrent neural networks (RNNs) have shown outstanding performance on processing sequence data. However, they suffer from long training time, which demands parallel implementations of the training procedure. Parallelization of the training…

Neural and Evolutionary Computing · Computer Science 2015-11-25 Kyuyeon Hwang , Wonyong Sung

Recurrent neural networks (RNNs) are widely used to model sequential data but their non-linear dependencies between sequence elements prevent parallelizing training over sequence length. We show the training of RNNs with only linear…

Neural and Evolutionary Computing · Computer Science 2018-02-23 Eric Martin , Chris Cundy

With the emergence of massively parallel processing units, parallelization has become a desirable property for new sequence models. The ability to parallelize the processing of sequences with respect to the sequence length during training…

Machine Learning · Computer Science 2026-05-19 Florent De Geeter , Gaspard Lambrechts , Damien Ernst , Guillaume Drion

Transformer models have demonstrated high accuracy in numerous applications but have high complexity and lack sequential processing capability making them ill-suited for many streaming applications at the edge where devices are heavily…

Neural and Evolutionary Computing · Computer Science 2024-02-08 Zeyu Liu , Gourav Datta , Anni Li , Peter Anthony Beerel

This is part II of three-part work. Here, we present a second set of inter-related five variants of simplified Long Short-term Memory (LSTM) recurrent neural networks by further reducing adaptive parameters. Two of these models have been…

Neural and Evolutionary Computing · Computer Science 2017-07-18 Atra Akandeh , Fathi M. Salem

This review aims to conduct a comparative analysis of liquid neural networks (LNNs) and traditional recurrent neural networks (RNNs) and their variants, such as long short-term memory networks (LSTMs) and gated recurrent units (GRUs). The…

Machine Learning · Computer Science 2025-10-10 Shilong Zong , Alex Bierly , Almuatazbellah Boker , Hoda Eldardiry

Recurrent neural networks (RNN) have been successfully applied to various sequential decision-making tasks, natural language processing applications, and time-series predictions. Such networks are usually trained through back-propagation…

Machine Learning · Computer Science 2019-12-02 Julia El Zini , Yara Rizk , Mariette Awad

Recurrent neural networks (RNNs) have represented for years the state of the art in neural machine translation. Recently, new architectures have been proposed, which can leverage parallel computation on GPUs better than classical RNNs.…

Computation and Language · Computer Science 2018-05-14 Mattia Antonino Di Gangi , Marcello Federico

An efficient algorithm for recurrent neural network training is presented. The approach increases the training speed for tasks where a length of the input sequence may vary significantly. The proposed approach is based on the optimal batch…

Machine Learning · Computer Science 2017-08-21 Viacheslav Khomenko , Oleg Shyshkov , Olga Radyvonenko , Kostiantyn Bokhan

In our previous work we have shown that resistive cross point devices, so called Resistive Processing Unit (RPU) devices, can provide significant power and speed benefits when training deep fully connected networks as well as convolutional…

Machine Learning · Computer Science 2023-02-17 Tayfun Gokmen , Malte Rasch , Wilfried Haensch

As neural network algorithms show high performance in many applications, their efficient inference on mobile and embedded systems are of great interests. When a single stream recurrent neural network (RNN) is executed for a personal user in…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-04-02 Wonyong Sung , Jinhwan Park

Recurrent neural networks have achieved great success in many NLP tasks. However, they have difficulty in parallelization because of the recurrent structure, so it takes much time to train RNNs. In this paper, we introduce sliced recurrent…

Computation and Language · Computer Science 2018-07-09 Zeping Yu , Gongshen Liu

We present five variants of the standard Long Short-term Memory (LSTM) recurrent neural networks by uniformly reducing blocks of adaptive parameters in the gating mechanisms. For simplicity, we refer to these models as LSTM1, LSTM2, LSTM3,…

Neural and Evolutionary Computing · Computer Science 2017-07-18 Atra Akandeh , Fathi M. Salem

Recurrent Neural Network (RNN) and its variations such as Long Short-Term Memory (LSTM) and Gated Recurrent Unit (GRU), have become standard building blocks for learning online data of sequential nature in many research areas, including…

Computation and Language · Computer Science 2020-05-12 Enmao Diao , Jie Ding , Vahid Tarokh

Recently, machine learning methods have provided a broad spectrum of original and efficient algorithms based on Deep Neural Networks (DNN) to automatically predict an outcome with respect to a sequence of inputs. Recurrent hidden cells…

Machine Learning · Computer Science 2017-02-15 Mohamed Bouaziz , Mohamed Morchid , Richard Dufour , Georges Linarès , Renato De Mori

Traditional Recurrent Neural Networks (RNNs) and Long Short-Term Memory (LSTM) units operate on discrete time steps, often failing to capture the fluid temporal dynamics of real-world physical processes. Liquid Neural Networks (LNNs),…

Machine Learning · Computer Science 2026-05-28 Ye Kyaw Thu , Thazin Myint Oo , Thepchai Supnithi

We have shown previously that our parameter-reduced variants of Long Short-Term Memory (LSTM) Recurrent Neural Networks (RNN) are comparable in performance to the standard LSTM RNN on the MNIST dataset. In this study, we show that this is…

Neural and Evolutionary Computing · Computer Science 2019-01-23 Atra Akandeh , Fathi M. Salem

Advanced deep learning architectures, particularly recurrent neural networks (RNNs), have been widely applied in audio, bioacoustic, and biomedical signal analysis, especially in data-scarce environments. While gated RNNs remain effective,…

Sound · Computer Science 2025-12-02 Pengfei Sun , Wenyu Jiang , Paul Devos , Dick Botteldooren

Common recurrent neural architectures scale poorly due to the intrinsic difficulty in parallelizing their state computations. In this work, we propose the Simple Recurrent Unit (SRU), a light recurrent unit that balances model capacity and…

Computation and Language · Computer Science 2018-09-10 Tao Lei , Yu Zhang , Sida I. Wang , Hui Dai , Yoav Artzi

Despite the great successes of deep learning, the effectiveness of deep neural networks has not been understood at any theoretical depth. This work is motivated by the thrust of developing a deeper understanding of recurrent neural…

Machine Learning · Computer Science 2018-02-12 Dingkun Long , Richong Zhang , Yongyi Mao
‹ Prev 1 2 3 10 Next ›