English
Related papers

Related papers: Parallelizing Linear Recurrent Neural Nets Over Se…

200 papers

Recurrent neural networks (RNNs) have shown outstanding performance on processing sequence data. However, they suffer from long training time, which demands parallel implementations of the training procedure. Parallelization of the training…

Neural and Evolutionary Computing · Computer Science 2015-11-25 Kyuyeon Hwang , Wonyong Sung

Recurrent neural networks are a powerful tool for modeling sequential data, but the dependence of each timestep's computation on the previous timestep's output limits parallelism and makes RNNs unwieldy for very long sequences. We introduce…

Neural and Evolutionary Computing · Computer Science 2016-11-22 James Bradbury , Stephen Merity , Caiming Xiong , Richard Socher

Recurrent neural networks have achieved great success in many NLP tasks. However, they have difficulty in parallelization because of the recurrent structure, so it takes much time to train RNNs. In this paper, we introduce sliced recurrent…

Computation and Language · Computer Science 2018-07-09 Zeping Yu , Gongshen Liu

Recurrent Neural Networks (RNNs) laid the foundation for sequence modeling, but their intrinsic sequential nature restricts parallel computation, creating a fundamental barrier to scaling. This has led to the dominance of parallelizable…

Machine Learning · Computer Science 2025-11-04 Federico Danieli , Pau Rodriguez , Miguel Sarabia , Xavier Suau , Luca Zappella

Sequential models, such as Recurrent Neural Networks and Neural Ordinary Differential Equations, have long suffered from slow training due to their inherent sequential nature. For many years this bottleneck has persisted, as many thought…

Machine Learning · Computer Science 2024-01-17 Yi Heng Lim , Qi Zhu , Joshua Selfridge , Muhammad Firmansyah Kasim

Recently, a new recurrent neural network (RNN) named the Legendre Memory Unit (LMU) was proposed and shown to achieve state-of-the-art performance on several benchmark datasets. Here we leverage the linear time-invariant (LTI) memory…

Machine Learning · Computer Science 2021-05-12 Narsimha Chilkuri , Chris Eliasmith

The recurrent network architecture is a widely used model in sequence modeling, but its serial dependency hinders the computation parallelization, which makes the operation inefficient. The same problem was encountered in serial adder at…

Machine Learning · Computer Science 2021-08-25 Haowei Jiang , Feiwei Qin , Jin Cao , Yong Peng , Yanli Shao

This review aims to conduct a comparative analysis of liquid neural networks (LNNs) and traditional recurrent neural networks (RNNs) and their variants, such as long short-term memory networks (LSTMs) and gated recurrent units (GRUs). The…

Machine Learning · Computer Science 2025-10-10 Shilong Zong , Alex Bierly , Almuatazbellah Boker , Hoda Eldardiry

The advantage of recurrent neural networks (RNNs) in learning dependencies between time-series data has distinguished RNNs from other deep learning models. Recently, many advances are proposed in this emerging field. However, there is a…

Neural and Evolutionary Computing · Computer Science 2016-02-16 Hojjat Salehinejad

Recurrent Neural Networks (RNNs) offer fast inference on long sequences but are hard to optimize and slow to train. Deep state-space models (SSMs) have recently been shown to perform remarkably well on long sequence modeling tasks, and have…

Machine Learning · Computer Science 2023-03-14 Antonio Orvieto , Samuel L Smith , Albert Gu , Anushan Fernando , Caglar Gulcehre , Razvan Pascanu , Soham De

Recurrent neural networks (RNNs) are widely used as a memory model for sequence-related problems. Many variants of RNN have been proposed to solve the gradient problems of training RNNs and process long sequences. Although some classical…

Neural and Evolutionary Computing · Computer Science 2020-05-29 Chenpeng Zhang , Shuai Li , Mao Ye , Ce Zhu , Xue Li

As neural network algorithms show high performance in many applications, their efficient inference on mobile and embedded systems are of great interests. When a single stream recurrent neural network (RNN) is executed for a personal user in…

Distributed, Parallel, and Cluster Computing · Computer Science 2018-04-02 Wonyong Sung , Jinhwan Park

Recurrent neural networks (RNN) have been successfully applied to various sequential decision-making tasks, natural language processing applications, and time-series predictions. Such networks are usually trained through back-propagation…

Machine Learning · Computer Science 2019-12-02 Julia El Zini , Yara Rizk , Mariette Awad

Learning with recurrent neural networks (RNNs) on long sequences is a notoriously difficult task. There are three major challenges: 1) complex dependencies, 2) vanishing and exploding gradients, and 3) efficient parallelization. In this…

Artificial Intelligence · Computer Science 2017-11-03 Shiyu Chang , Yang Zhang , Wei Han , Mo Yu , Xiaoxiao Guo , Wei Tan , Xiaodong Cui , Michael Witbrock , Mark Hasegawa-Johnson , Thomas S. Huang

Recurrent neural networks (RNNs) are capable of learning features and long term dependencies from sequential and time-series data. The RNNs have a stack of non-linear units where at least one connection between units forms a directed cycle.…

Neural and Evolutionary Computing · Computer Science 2018-02-26 Hojjat Salehinejad , Sharan Sankar , Joseph Barfett , Errol Colak , Shahrokh Valaee

Recurrent Neural Networks (RNNs) have the ability to retain memory and learn data sequences. Due to the recurrent nature of RNNs, it is sometimes hard to parallelize all its computations on conventional hardware. CPUs do not currently offer…

Neural and Evolutionary Computing · Computer Science 2016-03-07 Andre Xian Ming Chang , Berin Martini , Eugenio Culurciello

Clinical medical data, especially in the intensive care unit (ICU), consist of multivariate time series of observations. For each patient visit (or episode), sensor data and lab test results are recorded in the patient's Electronic Health…

Machine Learning · Computer Science 2017-03-23 Zachary C. Lipton , David C. Kale , Charles Elkan , Randall Wetzel

Over the long history of machine learning, which dates back several decades, recurrent neural networks (RNNs) have been used mainly for sequential data and time series and generally with 1D information. Even in some rare studies on 2D…

Computer Vision and Pattern Recognition · Computer Science 2021-03-05 Nguyen Huu Phong , Bernardete Ribeiro

Countless learning tasks require dealing with sequential data. Image captioning, speech synthesis, and music generation all require that a model produce outputs that are sequences. In other domains, such as time series prediction, video…

Machine Learning · Computer Science 2015-10-20 Zachary C. Lipton , John Berkowitz , Charles Elkan

Recurrent neural networks (RNNs) have been widely used for processing sequential data. However, RNNs are commonly difficult to train due to the well-known gradient vanishing and exploding problems and hard to learn long-term patterns. Long…

Computer Vision and Pattern Recognition · Computer Science 2019-10-15 Shuai Li , Wanqing Li , Chris Cook , Ce Zhu , Yanbo Gao
‹ Prev 1 2 3 10 Next ›