Related papers: The Statistical Recurrent Unit

Deep Gate Recurrent Neural Network

This paper introduces two recurrent neural network structures called Simple Gated Unit (SGU) and Deep Simple Gated Unit (DSGU), which are general structures for learning long term dependencies. Compared to traditional Long Short-Term Memory…

Neural and Evolutionary Computing · Computer Science 2016-05-16 Yuan Gao , Dorota Glowacka

Empirical Evaluation of Gated Recurrent Neural Networks on Sequence Modeling

In this paper we compare different types of recurrent units in recurrent neural networks (RNNs). Especially, we focus on more sophisticated units that implement a gating mechanism, such as a long short-term memory (LSTM) unit and a recently…

Neural and Evolutionary Computing · Computer Science 2014-12-12 Junyoung Chung , Caglar Gulcehre , KyungHyun Cho , Yoshua Bengio

Improving speech recognition by revising gated recurrent units

Speech recognition is largely taking advantage of deep learning, showing that substantial benefits can be obtained by modern Recurrent Neural Networks (RNNs). The most popular RNNs are Long Short-Term Memory (LSTMs), which typically reach…

Computation and Language · Computer Science 2017-10-03 Mirco Ravanelli , Philemon Brakel , Maurizio Omologo , Yoshua Bengio

A comparison of LSTM and GRU networks for learning symbolic sequences

We explore the architecture of recurrent neural networks (RNNs) by studying the complexity of string sequences it is able to memorize. Symbolic sequences of different complexity are generated to simulate RNN training and study parameter…

Machine Learning · Computer Science 2023-11-17 Roberto Cahuantzi , Xinye Chen , Stefan Güttel

Gated recurrent units viewed through the lens of continuous time dynamical systems

Gated recurrent units (GRUs) are specialized memory elements for building recurrent neural networks. Despite their incredible success on various tasks, including extracting dynamics underlying neural data, little is understood about the…

Machine Learning · Computer Science 2021-07-30 Ian D. Jordan , Piotr Aleksander Sokol , Il Memming Park

Can recurrent neural networks warp time?

Successful recurrent models such as long short-term memories (LSTMs) and gated recurrent units (GRUs) use ad hoc gating mechanisms. Empirically these models have been found to improve the learning of medium to long term temporal…

Machine Learning · Computer Science 2018-05-01 Corentin Tallec , Yann Ollivier

Gated Orthogonal Recurrent Units: On Learning to Forget

We present a novel recurrent neural network (RNN) based model that combines the remembering ability of unitary RNNs with the ability of gated RNNs to effectively forget redundant/irrelevant information in its memory. We achieve this by…

Machine Learning · Computer Science 2017-10-26 Li Jing , Caglar Gulcehre , John Peurifoy , Yichen Shen , Max Tegmark , Marin Soljačić , Yoshua Bengio

Memory Visualization for Gated Recurrent Neural Networks in Speech Recognition

Recurrent neural networks (RNNs) have shown clear superiority in sequence modeling, particularly the ones with gated units, such as long short-term memory (LSTM) and gated recurrent unit (GRU). However, the dynamic properties behind the…

Machine Learning · Computer Science 2017-02-28 Zhiyuan Tang , Ying Shi , Dong Wang , Yang Feng , Shiyue Zhang

Minimal Gated Unit for Recurrent Neural Networks

Recently recurrent neural networks (RNN) has been very successful in handling sequence data. However, understanding RNN and finding the best practices for RNN is a difficult task, partly because there are many competing and complex hidden…

Neural and Evolutionary Computing · Computer Science 2016-04-01 Guo-Bing Zhou , Jianxin Wu , Chen-Lin Zhang , Zhi-Hua Zhou

Gated Recurrent Networks for Seizure Detection

Recurrent Neural Networks (RNNs) with sophisticated units that implement a gating mechanism have emerged as powerful technique for modeling sequential signals such as speech or electroencephalography (EEG). The latter is the focus on this…

Signal Processing · Electrical Eng. & Systems 2018-01-09 Meysam Golmohammadi , Saeedeh Ziyabari , Vinit Shah , Eva Von Weltin , Christopher Campbell , Iyad Obeid , Joseph Picone

Lattice Recurrent Unit: Improving Convergence and Statistical Efficiency for Sequence Modeling

Recurrent neural networks have shown remarkable success in modeling sequences. However low resource situations still adversely affect the generalizability of these models. We introduce a new family of models, called Lattice Recurrent Units…

Machine Learning · Computer Science 2017-11-23 Chaitanya Ahuja , Louis-Philippe Morency

Simple Recurrent Units for Highly Parallelizable Recurrence

Common recurrent neural architectures scale poorly due to the intrinsic difficulty in parallelizing their state computations. In this work, we propose the Simple Recurrent Unit (SRU), a light recurrent unit that balances model capacity and…

Computation and Language · Computer Science 2018-09-10 Tao Lei , Yu Zhang , Sida I. Wang , Hui Dai , Yoav Artzi

SGRU: A High-Performance Structured Gated Recurrent Unit for Traffic Flow Prediction

Traffic flow prediction is an essential task in constructing smart cities and is a typical Multivariate Time Series (MTS) Problem. Recent research has abandoned Gated Recurrent Units (GRU) and utilized dilated convolutions or temporal…

Artificial Intelligence · Computer Science 2024-04-19 Wenfeng Zhang , Xin Li , Anqi Li , Xiaoting Huang , Ti Wang , Honglei Gao

A recurrent neural network without chaos

We introduce an exceptionally simple gated recurrent neural network (RNN) that achieves performance comparable to well-known gated architectures, such as LSTMs and GRUs, on the word-level language modeling task. We prove that our model has…

Neural and Evolutionary Computing · Computer Science 2016-12-20 Thomas Laurent , James von Brecht

Gates Are Not What You Need in RNNs

Recurrent neural networks have flourished in many areas. Consequently, we can see new RNN cells being developed continuously, usually by creating or using gates in a new, original way. But what if we told you that gates in RNNs are…

Machine Learning · Computer Science 2023-11-23 Ronalds Zakovskis , Andis Draguns , Eliza Gaile , Emils Ozolins , Karlis Freivalds

Stabilising and accelerating light gated recurrent units for automatic speech recognition

The light gated recurrent units (Li-GRU) is well-known for achieving impressive results in automatic speech recognition (ASR) tasks while being lighter and faster to train than a standard gated recurrent units (GRU). However, the unbounded…

Audio and Speech Processing · Electrical Eng. & Systems 2023-02-21 Adel Moumen , Titouan Parcollet

SiTGRU: Single-Tunnelled Gated Recurrent Unit for Abnormality Detection

Abnormality detection is a challenging task due to the dependence on a specific context and the unconstrained variability of practical scenarios. In recent years, it has benefited from the powerful features learnt by deep neural networks,…

Computer Vision and Pattern Recognition · Computer Science 2020-06-29 Habtamu Fanta , Zhiwen Shao , Lizhuang Ma

Gated Recurrent Neural Networks with Weighted Time-Delay Feedback

In this paper, we present a novel approach to modeling long-term dependencies in sequential data by introducing a gated recurrent unit (GRU) with a weighted time-delay feedback mechanism. Our proposed model, named $\tau$-GRU, is a…

Machine Learning · Computer Science 2025-05-21 N. Benjamin Erichson , Soon Hoe Lim , Michael W. Mahoney

Towards Non-saturating Recurrent Units for Modelling Long-term Dependencies

Modelling long-term dependencies is a challenge for recurrent neural networks. This is primarily due to the fact that gradients vanish during training, as the sequence length increases. Gradients can be attenuated by transition operators…

Neural and Evolutionary Computing · Computer Science 2019-02-19 Sarath Chandar , Chinnadhurai Sankar , Eugene Vorontsov , Samira Ebrahimi Kahou , Yoshua Bengio

Contrast-Enhanced Gating in GRUs for Robust Low-Data Sequence Learning

Activation functions govern how recurrent networks regulate and transmit information across temporal dependencies. Despite advances in sequence modelling, gated recurrent units (GRUs) still depend on the standard sigmoid and tanh…

Machine Learning · Computer Science 2026-04-29 Barathi Subramanian , Rathinaraja Jeyaraj , Anand Paul