English
Related papers

Related papers: Efficient Sparse Selective-Update RNNs for Long-Ra…

200 papers

Recurrent Neural Networks (RNNs) continue to show outstanding performance in sequence modeling tasks. However, training RNNs on long sequences often face challenges like slow inference, vanishing gradients and difficulty in capturing long…

Artificial Intelligence · Computer Science 2018-02-06 Victor Campos , Brendan Jou , Xavier Giro-i-Nieto , Jordi Torres , Shih-Fu Chang

Countless learning tasks require dealing with sequential data. Image captioning, speech synthesis, and music generation all require that a model produce outputs that are sequences. In other domains, such as time series prediction, video…

Machine Learning · Computer Science 2015-10-20 Zachary C. Lipton , John Berkowitz , Charles Elkan

Recurrent Neural Networks (RNNs) offer fast inference on long sequences but are hard to optimize and slow to train. Deep state-space models (SSMs) have recently been shown to perform remarkably well on long sequence modeling tasks, and have…

Machine Learning · Computer Science 2023-03-14 Antonio Orvieto , Samuel L Smith , Albert Gu , Anushan Fernando , Caglar Gulcehre , Razvan Pascanu , Soham De

Recurrent neural networks (RNNs) are omnipresent in sequence modeling tasks. Practical models usually consist of several layers of hundreds or thousands of neurons which are fully connected. This places a heavy computational and memory…

Machine Learning · Computer Science 2019-05-30 Matthijs Van Keirsbilck , Alexander Keller , Xiaodong Yang

Recurrent neural networks (RNNs) have been used extensively and with increasing success to model various types of sequential data. Much of this progress has been achieved through devising recurrent units and architectures with the…

Machine Learning · Statistics 2017-03-06 Yacine Jernite , Edouard Grave , Armand Joulin , Tomas Mikolov

Recurrent Networks are one of the most powerful and promising artificial neural network algorithms to processing the sequential data such as natural languages, sound, time series data. Unlike traditional feed-forward network, Recurrent…

Machine Learning · Computer Science 2018-07-11 Pushparaja Murugan

A longstanding challenge for the Machine Learning community is the one of developing models that are capable of processing and learning from very long sequences of data. The outstanding results of Transformers-based networks (e.g., Large…

Machine Learning · Computer Science 2024-02-15 Matteo Tiezzi , Michele Casoni , Alessandro Betti , Tommaso Guidi , Marco Gori , Stefano Melacci

Recurrent neural networks (RNNs) provide state-of-the-art performance in processing sequential data but are memory intensive to train, limiting the flexibility of RNN models which can be trained. Reversible RNNs---RNNs for which the…

Machine Learning · Computer Science 2018-10-26 Matthew MacKay , Paul Vicol , Jimmy Ba , Roger Grosse

Recurrent neural networks (RNNs) are widely used as a memory model for sequence-related problems. Many variants of RNN have been proposed to solve the gradient problems of training RNNs and process long sequences. Although some classical…

Neural and Evolutionary Computing · Computer Science 2020-05-29 Chenpeng Zhang , Shuai Li , Mao Ye , Ce Zhu , Xue Li

Short-term memory is essential for cognitive processing, yet our understanding of its neural mechanisms remains unclear. Neuroscience has long focused on how sequential activity patterns, where neurons fire one after another within large…

Recurrent Neural Networks (RNNs) are used in state-of-the-art models in domains such as speech recognition, machine translation, and language modelling. Sparsity is a technique to reduce compute and memory requirements of deep learning…

Machine Learning · Computer Science 2017-11-09 Sharan Narang , Eric Undersander , Gregory Diamos

Recurrent neural networks (RNNs) are well suited for solving sequence tasks in resource-constrained systems due to their expressivity and low computational requirements. However, there is still a need to bridge the gap between what RNNs are…

Machine Learning · Computer Science 2023-03-13 Anand Subramoney , Khaleelulla Khan Nazeer , Mark Schöne , Christian Mayr , David Kappel

Recurrent neural networks (RNNs) have drawn interest from machine learning researchers because of their effectiveness at preserving past inputs for time-varying data processing tasks. To understand the success and limitations of RNNs, it is…

Information Theory · Computer Science 2017-01-30 Adam Charles , Dong Yin , Christopher Rozell

Sequence models assign probabilities to variable-length sequences such as natural language texts. The ability of sequence models to capture temporal dependence can be characterized by the temporal scaling of correlation and mutual…

Machine Learning · Computer Science 2019-05-13 Huitao Shen

Recommender systems objectives can be broadly characterized as modeling user preferences over short-or long-term time horizon. A large body of previous research studied long-term recommendation through dimensionality reduction techniques…

Information Retrieval · Computer Science 2018-07-25 Kiewan Villatel , Elena Smirnova , Jérémie Mary , Philippe Preux

In this paper, we study novel neural network structures to better model long term dependency in sequential data. We propose to use more memory units to keep track of more preceding states in recurrent neural networks (RNNs), which are all…

Neural and Evolutionary Computing · Computer Science 2016-05-03 Rohollah Soltani , Hui Jiang

The effectiveness of recurrent neural networks can be largely influenced by their ability to store into their dynamical memory information extracted from input sequences at different frequencies and timescales. Such a feature can be…

Machine Learning · Computer Science 2020-07-01 Antonio Carta , Alessandro Sperduti , Davide Bacciu

Recurrent Neural Networks (RNNs) are powerful tools for solving sequence-based problems, but their efficacy and execution time are dependent on the size of the network. Following recent work in simplifying these networks with model pruning…

Neural and Evolutionary Computing · Computer Science 2018-04-30 Feiwen Zhu , Jeff Pool , Michael Andersch , Jeremy Appleyard , Fung Xie

Recurrent neural networks have achieved great success in many NLP tasks. However, they have difficulty in parallelization because of the recurrent structure, so it takes much time to train RNNs. In this paper, we introduce sliced recurrent…

Computation and Language · Computer Science 2018-07-09 Zeping Yu , Gongshen Liu

Recurrent Neural Networks (RNNs) are popular models of brain function. The typical training strategy is to adjust their input-output behavior so that it matches that of the biological circuit of interest. Even though this strategy ensures…

Neurons and Cognition · Quantitative Biology 2020-11-09 Alessandro Salatiello , Martin A. Giese
‹ Prev 1 2 3 10 Next ›