Related papers: Maelstrom Networks

A continual learning survey: Defying forgetting in classification tasks

Artificial neural networks thrive in solving the classification problem for a particular rigid task, acquiring knowledge through generalized learning behaviour from a distinct training phase. The resulting network resembles a static entity…

Computer Vision and Pattern Recognition · Computer Science 2021-04-19 Matthias De Lange , Rahaf Aljundi , Marc Masana , Sarah Parisot , Xu Jia , Ales Leonardis , Gregory Slabaugh , Tinne Tuytelaars

Tensor-Train Recurrent Neural Networks for Video Classification

The Recurrent Neural Networks and their variants have shown promising performances in sequence modeling tasks such as Natural Language Processing. These models, however, turn out to be impractical and difficult to train when exposed to very…

Computer Vision and Pattern Recognition · Computer Science 2017-07-07 Yinchong Yang , Denis Krompass , Volker Tresp

Deep Generative Dual Memory Network for Continual Learning

Despite advances in deep learning, neural networks can only learn multiple tasks when trained on them jointly. When tasks arrive sequentially, they lose performance on previously learnt tasks. This phenomenon called catastrophic forgetting…

Machine Learning · Computer Science 2018-05-29 Nitin Kamra , Umang Gupta , Yan Liu

Memory and Information Processing in Recurrent Neural Networks

Recurrent neural networks (RNN) are simple dynamical systems whose computational power has been attributed to their short-term memory. Short-term memory of RNNs has been previously studied analytically only for the case of orthogonal…

Neural and Evolutionary Computing · Computer Science 2016-04-26 Alireza Goudarzi , Sarah Marzen , Peter Banda , Guy Feldman , Christof Teuscher , Darko Stefanovic

Memoria: Resolving Fateful Forgetting Problem through Human-Inspired Memory Architecture

Making neural networks remember over the long term has been a longstanding issue. Although several external memory techniques have been introduced, most focus on retaining recent information in the short term. Regardless of its importance,…

Machine Learning · Computer Science 2024-07-19 Sangjun Park , JinYeong Bak

ResNets Are Deeper Than You Think

Residual connections remain ubiquitous in modern neural network architectures nearly a decade after their introduction. Their widespread adoption is often credited to their dramatically improved trainability: residual networks train faster,…

Machine Learning · Computer Science 2025-06-18 Christian H. X. Ali Mehmeti-Göpel , Michael Wand

Continual Learning with Self-Organizing Maps

Despite remarkable successes achieved by modern neural networks in a wide range of applications, these networks perform best in domain-specific stationary environments where they are trained only once on large-scale controlled data…

Neural and Evolutionary Computing · Computer Science 2019-04-23 Pouya Bashivan , Martin Schrimpf , Robert Ajemian , Irina Rish , Matthew Riemer , Yuhai Tu

Triple Memory Networks: a Brain-Inspired Method for Continual Learning

Continual acquisition of novel experience without interfering previously learned knowledge, i.e. continual learning, is critical for artificial neural networks, but limited by catastrophic forgetting. A neural network adjusts its parameters…

Machine Learning · Computer Science 2022-02-15 Liyuan Wang , Bo Lei , Qian Li , Hang Su , Jun Zhu , Yi Zhong

On the algorithmic construction of deep ReLU networks

It is difficult to describe in mathematical terms what a neural network trained on data represents. On the other hand, there is a growing mathematical understanding of what neural networks are in principle capable of representing.…

Machine Learning · Computer Science 2025-06-25 Daan Huybrechs

A Novel Framework for Recurrent Neural Networks with Enhancing Information Processing and Transmission between Units

This paper proposes a novel framework for recurrent neural networks (RNNs) inspired by the human memory models in the field of cognitive neuroscience to enhance information processing and transmission between adjacent RNNs' units. The…

Neural and Evolutionary Computing · Computer Science 2018-06-05 Xi Chen , Zhihong Deng , Gehui Shen , Ting Huang

Incremental Training of a Recurrent Neural Network Exploiting a Multi-Scale Dynamic Memory

The effectiveness of recurrent neural networks can be largely influenced by their ability to store into their dynamical memory information extracted from input sequences at different frequencies and timescales. Such a feature can be…

Machine Learning · Computer Science 2020-07-01 Antonio Carta , Alessandro Sperduti , Davide Bacciu

Reversible Recurrent Neural Networks

Recurrent neural networks (RNNs) provide state-of-the-art performance in processing sequential data but are memory intensive to train, limiting the flexibility of RNN models which can be trained. Reversible RNNs---RNNs for which the…

Machine Learning · Computer Science 2018-10-26 Matthew MacKay , Paul Vicol , Jimmy Ba , Roger Grosse

Memory Networks

We describe a new class of learning models called memory networks. Memory networks reason with inference components combined with a long-term memory component; they learn how to use these jointly. The long-term memory can be read and…

Artificial Intelligence · Computer Science 2015-12-01 Jason Weston , Sumit Chopra , Antoine Bordes

Rethinking Attention: Exploring Shallow Feed-Forward Neural Networks as an Alternative to Attention Layers in Transformers

This work presents an analysis of the effectiveness of using standard shallow feed-forward networks to mimic the behavior of the attention mechanism in the original Transformer model, a state-of-the-art architecture for sequence-to-sequence…

Computation and Language · Computer Science 2024-02-06 Vukasin Bozic , Danilo Dordevic , Daniele Coppola , Joseph Thommes , Sidak Pal Singh

Influence-aware Memory Architectures for Deep Reinforcement Learning

Due to its perceptual limitations, an agent may have too little information about the state of the environment to act optimally. In such cases, it is important to keep track of the observation history to uncover hidden state. Recent deep…

Machine Learning · Computer Science 2021-02-18 Miguel Suau , Jinke He , Elena Congeduti , Rolf A. N. Starre , Aleksander Czechowski , Frans A. Oliehoek

Survey of reasoning using Neural networks

Reason and inference require process as well as memory skills by humans. Neural networks are able to process tasks like image recognition (better than humans) but in memory aspects are still limited (by attention mechanism, size). Recurrent…

Machine Learning · Computer Science 2017-03-03 Amit Sahu

Forward Thinking: Building and Training Neural Networks One Layer at a Time

We present a general framework for training deep neural networks without backpropagation. This substantially decreases training time and also allows for construction of deep networks with many sorts of learners, including networks whose…

Machine Learning · Statistics 2017-06-09 Chris Hettinger , Tanner Christensen , Ben Ehlert , Jeffrey Humpherys , Tyler Jarvis , Sean Wade

Efficient and robust multi-task learning in the brain with modular latent primitives

Biological agents do not have infinite resources to learn new things. For this reason, a central aspect of human learning is the ability to recycle previously acquired knowledge in a way that allows for faster, less resource-intensive…

Artificial Intelligence · Computer Science 2022-05-27 Christian David Márton , Léo Gagnon , Guillaume Lajoie , Kanaka Rajan

Reframing Neural Networks: Deep Structure in Overcomplete Representations

In comparison to classical shallow representation learning techniques, deep neural networks have achieved superior performance in nearly every application benchmark. But despite their clear empirical advantages, it is still not well…

Machine Learning · Computer Science 2022-01-11 Calvin Murdock , George Cazenavette , Simon Lucey

Meta Continual Learning

Using neural networks in practical settings would benefit from the ability of the networks to learn new tasks throughout their lifetimes without forgetting the previous tasks. This ability is limited in the current deep neural networks by a…

Machine Learning · Computer Science 2018-06-20 Risto Vuorio , Dong-Yeon Cho , Daejoong Kim , Jiwon Kim