English
Related papers

Related papers: A Basic Recurrent Neural Network Model

200 papers

Circuits of biological neurons, such as in the functional parts of the brain can be modeled as networks of coupled oscillators. Inspired by the ability of these systems to express a rich set of outputs while keeping (gradients of) state…

Machine Learning · Computer Science 2021-03-16 T. Konstantin Rusch , Siddhartha Mishra

The design of recurrent neural networks (RNNs) to accurately process sequential inputs with long-time dependencies is very challenging on account of the exploding and vanishing gradient problem. To overcome this, we propose a novel RNN…

Machine Learning · Computer Science 2021-08-19 T. Konstantin Rusch , Siddhartha Mishra

Recurrent neural networks (RNNs) have recently demonstrated strong performance and faster inference than Transformers at comparable parameter budgets. However, the recursive gradient computation with the backpropagation through time (or…

Machine Learning · Computer Science 2025-04-01 Paul Caillon , Erwan Fagnou , Alexandre Allauzen

Several variants of recurrent neural networks (RNNs) with orthogonal or unitary recurrent matrices have recently been developed to mitigate the vanishing/exploding gradient problem and to model long-term dependencies of sequences. However,…

Machine Learning · Computer Science 2019-11-20 Kyle Helfrich , Qiang Ye

Recursive Neural Networks (RvNNs), which compose sequences according to their underlying hierarchical syntactic structure, have performed well in several natural language processing tasks compared to similar models without structural…

Computation and Language · Computer Science 2021-06-14 Jishnu Ray Chowdhury , Cornelia Caragea

Time series prediction is essential for human activities in diverse areas. A common approach to this task is to harness Recurrent Neural Networks (RNNs). However, while their predictions are quite accurate, their learning process is complex…

Quantum Physics · Physics 2025-05-30 Michał Siemaszko , Adam Buraczewski , Bertrand Le Saux , Magdalena Stobińska

Recurrent neural network is a powerful model that learns temporal patterns in sequential data. For a long time, it was believed that recurrent networks are difficult to train using simple optimizers, such as stochastic gradient descent, due…

Neural and Evolutionary Computing · Computer Science 2015-04-20 Tomas Mikolov , Armand Joulin , Sumit Chopra , Michael Mathieu , Marc'Aurelio Ranzato

We introduce the "NoBackTrack" algorithm to train the parameters of dynamical systems such as recurrent neural networks. This algorithm works in an online, memoryless setting, thus requiring no backpropagation through time, and is scalable,…

Neural and Evolutionary Computing · Computer Science 2015-11-24 Yann Ollivier , Corentin Tallec , Guillaume Charpiat

We introduce a deep residual recurrent neural network (DR-RNN) as an efficient model reduction technique for nonlinear dynamical systems. The developed DR-RNN is inspired by the iterative steps of line search methods in finding the residual…

Computational Engineering, Finance, and Science · Computer Science 2017-09-05 J. Nagoor Kani , Ahmed H. Elsheikh

Vanishing (and exploding) gradients effect is a common problem for recurrent neural networks with nonlinear activation functions which use backpropagation method for calculation of derivatives. Deep feedforward neural networks with many…

Neural and Evolutionary Computing · Computer Science 2017-02-15 Artem Chernodub , Dimitri Nowicki

A core technology that has emerged from the artificial intelligence revolution is the recurrent neural network (RNN). Its unique sequence-based architecture provides a tractable likelihood estimate with stable training paradigms, a…

Disordered Systems and Neural Networks · Physics 2020-07-01 Mohamed Hibat-Allah , Martin Ganahl , Lauren E. Hayward , Roger G. Melko , Juan Carrasquilla

We study the learning ability of linear recurrent neural networks with Gradient Descent. We prove the first theoretical guarantee on linear RNNs to learn any stable linear dynamic system using any a large type of loss functions. For an…

Machine Learning · Computer Science 2023-10-24 Lifu Wang , Tianyu Wang , Shengwei Yi , Bo Shen , Bo Hu , Xing Cao

This paper proposes a Fast Graph Convolutional Neural Network (FGRNN) architecture to predict sequences with an underlying graph structure. The proposed architecture addresses the limitations of the standard recurrent neural network (RNN),…

Signal Processing · Electrical Eng. & Systems 2020-01-28 Sai Kiran Kadambari , Sundeep Prabhakar Chepuri

Autoregressive and recurrent networks have achieved remarkable progress across various fields, from weather forecasting to molecular generation and Large Language Models. Despite their strong predictive capabilities, these models lack a…

Machine Learning · Computer Science 2025-07-22 Dario Coscia , Max Welling , Nicola Demo , Gianluigi Rozza

Recurrent neural networks (RNNs) notoriously struggle to learn long-term memories, primarily due to vanishing and exploding gradients. The recent success of state-space models (SSMs), a subclass of RNNs, to overcome such difficulties…

Machine Learning · Computer Science 2024-11-06 Nicolas Zucchet , Antonio Orvieto

In this work we explore a straightforward variational Bayes scheme for Recurrent Neural Networks. Firstly, we show that a simple adaptation of truncated backpropagation through time can yield good quality uncertainty estimates and superior…

Machine Learning · Computer Science 2019-05-13 Meire Fortunato , Charles Blundell , Oriol Vinyals

Linear recurrent neural networks, such as State Space Models (SSMs) and Linear Recurrent Units (LRUs), have recently shown state-of-the-art performance on long sequence modelling benchmarks. Despite their success, their empirical…

Machine Learning · Computer Science 2024-10-08 Kai Biegun , Rares Dolga , Jake Cunningham , David Barber

Recurrent neural networks are the foundation of many sequence-to-sequence models in machine learning, such as machine translation and speech synthesis. In contrast, applied quantum computing is in its infancy. Nevertheless there already…

Machine Learning · Computer Science 2020-10-01 Johannes Bausch

Recurrent neural networks (RNNs) have shown promising performance for language modeling. However, traditional training of RNNs using back-propagation through time often suffers from overfitting. One reason for this is that stochastic…

Computation and Language · Computer Science 2017-04-25 Zhe Gan , Chunyuan Li , Changyou Chen , Yunchen Pu , Qinliang Su , Lawrence Carin

In this work, we present the Grounded Recurrent Neural Network (GRNN), a recurrent neural network architecture for multi-label prediction which explicitly ties labels to specific dimensions of the recurrent hidden state (we call this…

Machine Learning · Statistics 2017-05-25 Ankit Vani , Yacine Jernite , David Sontag
‹ Prev 1 2 3 10 Next ›