Related papers: Maelstrom Networks

Quantifying the vanishing gradient and long distance dependency problem in recursive neural networks and recursive LSTMs

Recursive neural networks (RNN) and their recently proposed extension recursive long short term memory networks (RLSTM) are models that compute representations for sentences, by recursively combining word embeddings according to an…

Artificial Intelligence · Computer Science 2016-03-02 Phong Le , Willem Zuidema

Synthesis of recurrent neural networks for dynamical system simulation

We review several of the most widely used techniques for training recurrent neural networks to approximate dynamical systems, then describe a novel algorithm for this task. The algorithm is based on an earlier theoretical result that…

Neural and Evolutionary Computing · Computer Science 2016-06-09 Adam Trischler , Gabriele MT D'Eleuterio

Hierarchically Structured Task-Agnostic Continual Learning

One notable weakness of current machine learning algorithms is the poor ability of models to solve new problems without forgetting previously acquired knowledge. The Continual Learning paradigm has emerged as a protocol to systematically…

Machine Learning · Computer Science 2022-11-16 Heinke Hihn , Daniel A. Braun

Working Memory Networks: Augmenting Memory Networks with a Relational Reasoning Module

During the last years, there has been a lot of interest in achieving some kind of complex reasoning using deep neural networks. To do that, models like Memory Networks (MemNNs) have combined external memory storages and attention…

Computation and Language · Computer Science 2018-05-25 Juan Pavez , Héctor Allende , Héctor Allende-Cid

Higher Order Recurrent Neural Networks

In this paper, we study novel neural network structures to better model long term dependency in sequential data. We propose to use more memory units to keep track of more preceding states in recurrent neural networks (RNNs), which are all…

Neural and Evolutionary Computing · Computer Science 2016-05-03 Rohollah Soltani , Hui Jiang

Fortuitous Forgetting in Connectionist Networks

Forgetting is often seen as an unwanted characteristic in both human and machine learning. However, we propose that forgetting can in fact be favorable to learning. We introduce "forget-and-relearn" as a powerful paradigm for shaping the…

Machine Learning · Computer Science 2022-02-02 Hattie Zhou , Ankit Vani , Hugo Larochelle , Aaron Courville

Neural Stored-program Memory

Neural networks powered with external memory simulate computer behaviors. These models, which use the memory to store data for a neural controller, can learn algorithms and other complex tasks. In this paper, we introduce a new memory to…

Neural and Evolutionary Computing · Computer Science 2019-12-30 Hung Le , Truyen Tran , Svetha Venkatesh

Increasing Depth of Neural Networks for Life-long Learning

Purpose: We propose a novel method for continual learning based on the increasing depth of neural networks. This work explores whether extending neural network depth may be beneficial in a life-long learning setting. Methods: We propose a…

Machine Learning · Computer Science 2023-05-09 Jędrzej Kozal , Michał Woźniak

Deep Recurrent Neural Networks for Time Series Prediction

Ability of deep networks to extract high level features and of recurrent networks to perform time-series inference have been studied. In view of universality of one hidden layer network at approximating functions under weak constraints, the…

Neural and Evolutionary Computing · Computer Science 2014-12-19 Sharat C. Prasad , Piyush Prasad

Measuring Catastrophic Forgetting in Neural Networks

Deep neural networks are used in many state-of-the-art systems for machine perception. Once a network is trained to do a specific task, e.g., bird classification, it cannot easily be trained to do new tasks, e.g., incrementally learning to…

Artificial Intelligence · Computer Science 2017-11-10 Ronald Kemker , Marc McClure , Angelina Abitino , Tyler Hayes , Christopher Kanan

Teasing Apart Architecture and Initial Weights as Sources of Inductive Bias in Neural Networks

Artificial neural networks can acquire many aspects of human knowledge from data, making them promising as models of human learning. But what those networks can learn depends upon their inductive biases -- the factors other than the data…

Machine Learning · Computer Science 2025-02-28 Gianluca Bencomo , Max Gupta , Ioana Marinescu , R. Thomas McCoy , Thomas L. Griffiths

Robust training of recurrent neural networks to handle missing data for disease progression modeling

Disease progression modeling (DPM) using longitudinal data is a challenging task in machine learning for healthcare that can provide clinicians with better tools for diagnosis and monitoring of disease. Existing DPM algorithms neglect…

Computer Vision and Pattern Recognition · Computer Science 2018-08-17 Mostafa Mehdipour Ghazi , Mads Nielsen , Akshay Pai , M. Jorge Cardoso , Marc Modat , Sebastien Ourselin , Lauge Sørensen

Meta-learnt priors slow down catastrophic forgetting in neural networks

Current training regimes for deep learning usually involve exposure to a single task / dataset at a time. Here we start from the observation that in this context the trained model is not given any knowledge of anything outside its…

Artificial Intelligence · Computer Science 2020-02-11 Giacomo Spigler

Learning continually with representational drift

Deep artificial neural networks famously struggle to learn from non-stationary streams of data. Without dedicated mitigation strategies, continual learning is associated with continuous forgetting of previous tasks and a progressive loss of…

Neurons and Cognition · Quantitative Biology 2025-12-29 Suzanne van der Veldt , Gido M. van de Ven , Sanne Moorman , Guillaume Etter

State-Space Modeling in Long Sequence Processing: A Survey on Recurrence in the Transformer Era

Effectively learning from sequential data is a longstanding goal of Artificial Intelligence, especially in the case of long sequences. From the dawn of Machine Learning, several researchers have pursued algorithms and architectures capable…

Machine Learning · Computer Science 2025-08-19 Matteo Tiezzi , Michele Casoni , Alessandro Betti , Marco Gori , Stefano Melacci

Tree Memory Networks for Modelling Long-term Temporal Dependencies

In the domain of sequence modelling, Recurrent Neural Networks (RNN) have been capable of achieving impressive results in a variety of application areas including visual question answering, part-of-speech tagging and machine translation.…

Machine Learning · Computer Science 2018-05-22 Tharindu Fernando , Simon Denman , Aaron McFadyen , Sridha Sridharan , Clinton Fookes

An Empirical Evaluation of Generic Convolutional and Recurrent Networks for Sequence Modeling

For most deep learning practitioners, sequence modeling is synonymous with recurrent networks. Yet recent results indicate that convolutional architectures can outperform recurrent networks on tasks such as audio synthesis and machine…

Machine Learning · Computer Science 2018-04-20 Shaojie Bai , J. Zico Kolter , Vladlen Koltun

One step back, two steps forward: interference and learning in recurrent neural networks

Artificial neural networks, trained to perform cognitive tasks, have recently been used as models for neural recordings from animals performing these tasks. While some progress has been made in performing such comparisons, the evolution of…

Neurons and Cognition · Quantitative Biology 2019-05-03 Chen Beer , Omri Barak

Learning to Continually Learn with the Bayesian Principle

In the present era of deep learning, continual learning research is mainly focused on mitigating forgetting when training a neural network with stochastic gradient descent on a non-stationary stream of data. On the other hand, in the more…

Machine Learning · Computer Science 2024-05-30 Soochan Lee , Hyeonseong Jeon , Jaehyeon Son , Gunhee Kim

Continual Learning: Tackling Catastrophic Forgetting in Deep Neural Networks with Replay Processes

Humans learn all their life long. They accumulate knowledge from a sequence of learning experiences and remember the essential concepts without forgetting what they have learned previously. Artificial neural networks struggle to learn…

Machine Learning · Computer Science 2020-12-09 Timothée Lesort