Related papers: Maelstrom Networks

Feed-forward approximations to dynamic recurrent network architectures

Recurrent neural network architectures can have useful computational properties, with complex temporal dynamics and input-sensitive attractor states. However, evaluation of recurrent dynamic architectures requires solution of systems of…

Neural and Evolutionary Computing · Computer Science 2019-11-18 Dylan Richard Muir

Learning by Active Forgetting for Neural Networks

Remembering and forgetting mechanisms are two sides of the same coin in a human learning-memory system. Inspired by human brain memory mechanisms, modern machine learning systems have been working to endow machine with lifelong learning…

Machine Learning · Computer Science 2021-11-23 Jian Peng , Xian Sun , Min Deng , Chao Tao , Bo Tang , Wenbo Li , Guohua Wu , QingZhu , Yu Liu , Tao Lin , Haifeng Li

Neural Networks and Continuous Time

The fields of neural computation and artificial neural networks have developed much in the last decades. Most of the works in these fields focus on implementing and/or learning discrete functions or behavior. However, technical, physical,…

Neural and Evolutionary Computing · Computer Science 2016-06-15 Frieder Stolzenburg , Florian Ruh

Neural Network Retraining for Model Serving

We propose incremental (re)training of a neural network model to cope with a continuous flow of new data in inference during model serving. As such, this is a life-long learning process. We address two challenges of life-long retraining:…

Machine Learning · Computer Science 2020-04-30 Diego Klabjan , Xiaofeng Zhu

Concept Learning through Deep Reinforcement Learning with Memory-Augmented Neural Networks

Deep neural networks have shown superior performance in many regimes to remember familiar patterns with large amounts of data. However, the standard supervised deep learning paradigm is still limited when facing the need to learn new…

Machine Learning · Computer Science 2018-11-16 Jing Shi , Jiaming Xu , Yiqun Yao , Bo Xu

Memory-based Parameter Adaptation

Deep neural networks have excelled on a wide range of problems, from vision to language and game playing. Neural networks very gradually incorporate information into weights as they process data, requiring very low learning rates. If the…

Machine Learning · Statistics 2018-03-01 Pablo Sprechmann , Siddhant M. Jayakumar , Jack W. Rae , Alexander Pritzel , Adrià Puigdomènech Badia , Benigno Uria , Oriol Vinyals , Demis Hassabis , Razvan Pascanu , Charles Blundell

Beneficial Perturbation Network for designing general adaptive artificial intelligence systems

The human brain is the gold standard of adaptive learning. It not only can learn and benefit from experience, but also can adapt to new situations. In contrast, deep neural networks only learn one sophisticated but fixed mapping from inputs…

Computer Vision and Pattern Recognition · Computer Science 2021-03-18 Shixian Wen , Amanda Rios , Yunhao Ge , Laurent Itti

A Survey of Retentive Network

Retentive Network (RetNet) represents a significant advancement in neural network architecture, offering an efficient alternative to the Transformer. While Transformers rely on self-attention to model dependencies, they suffer from high…

Computation and Language · Computer Science 2025-06-10 Haiqi Yang , Zhiyuan Li , Yi Chang , Yuan Wu

Progressive Neural Networks

Learning to solve complex sequences of tasks--while both leveraging transfer and avoiding catastrophic forgetting--remains a key obstacle to achieving human-level intelligence. The progressive networks approach represents a step forward in…

Machine Learning · Computer Science 2022-10-25 Andrei A. Rusu , Neil C. Rabinowitz , Guillaume Desjardins , Hubert Soyer , James Kirkpatrick , Koray Kavukcuoglu , Razvan Pascanu , Raia Hadsell

Metalearned Neural Memory

We augment recurrent neural networks with an external memory mechanism that builds upon recent progress in metalearning. We conceptualize this memory as a rapidly adaptable function that we parameterize as a deep neural network. Reading…

Neural and Evolutionary Computing · Computer Science 2019-12-04 Tsendsuren Munkhdalai , Alessandro Sordoni , Tong Wang , Adam Trischler

Anti-Retroactive Interference for Lifelong Learning

Humans can continuously learn new knowledge. However, machine learning models suffer from drastic dropping in performance on previous tasks after learning new tasks. Cognitive science points out that the competition of similar knowledge is…

Computer Vision and Pattern Recognition · Computer Science 2022-11-01 Runqi Wang , Yuxiang Bao , Baochang Zhang , Jianzhuang Liu , Wentao Zhu , Guodong Guo

The Multiple Subnetwork Hypothesis: Enabling Multidomain Learning by Isolating Task-Specific Subnetworks in Feedforward Neural Networks

Neural networks have seen an explosion of usage and research in the past decade, particularly within the domains of computer vision and natural language processing. However, only recently have advancements in neural networks yielded…

Machine Learning · Computer Science 2022-07-20 Jacob Renn , Ian Sotnek , Benjamin Harvey , Brian Caffo

On the Long-Term Memory of Deep Recurrent Networks

A key attribute that drives the unprecedented success of modern Recurrent Neural Networks (RNNs) on learning tasks which involve sequential data, is their ability to model intricate long-term temporal dependencies. However, a well…

Machine Learning · Computer Science 2018-06-07 Yoav Levine , Or Sharir , Alon Ziv , Amnon Shashua

Recurrent Network Models Of Sequence Generation And Memory

Sequential activation of neurons is a common feature of network activity during a variety of behaviors, including working memory and decision making. Previous network models for sequences and memory emphasized specialized architectures in…

Neurons and Cognition · Quantitative Biology 2016-03-16 Kanaka Rajan , Christopher D Harvey , David W Tank

Memory Efficient Continual Learning with Transformers

In many real-world scenarios, data to train machine learning models becomes available over time. Unfortunately, these models struggle to continually learn new concepts without forgetting what has been learnt in the past. This phenomenon is…

Computation and Language · Computer Science 2023-01-16 Beyza Ermis , Giovanni Zappella , Martin Wistuba , Aditya Rawal , Cedric Archambeau

Learning to Continually Learn

Continual lifelong learning requires an agent or model to learn many sequentially ordered tasks, building on previous knowledge without catastrophically forgetting it. Much work has gone towards preventing the default tendency of machine…

Machine Learning · Computer Science 2020-03-05 Shawn Beaulieu , Lapo Frati , Thomas Miconi , Joel Lehman , Kenneth O. Stanley , Jeff Clune , Nick Cheney

AntisymmetricRNN: A Dynamical System View on Recurrent Neural Networks

Recurrent neural networks have gained widespread use in modeling sequential data. Learning long-term dependencies using these models remains difficult though, due to exploding or vanishing gradients. In this paper, we draw connections…

Machine Learning · Statistics 2019-02-27 Bo Chang , Minmin Chen , Eldad Haber , Ed H. Chi

All-In-One: Artificial Association Neural Networks

Most deep learning models are limited to specific datasets or tasks because of network structures using fixed layers. In this paper, we discuss the differences between existing neural networks and real human neurons, propose association…

Artificial Intelligence · Computer Science 2023-01-31 Seokjun Kim , Jaeeun Jang , Hyeoncheol Kim

Continual learning with hypernetworks

Artificial neural networks suffer from catastrophic forgetting when they are sequentially trained on multiple tasks. To overcome this problem, we present a novel approach based on task-conditioned hypernetworks, i.e., networks that generate…

Machine Learning · Computer Science 2022-04-12 Johannes von Oswald , Christian Henning , Benjamin F. Grewe , João Sacramento

Multigrid Neural Memory

We introduce a novel approach to endowing neural networks with emergent, long-term, large-scale memory. Distinct from strategies that connect neural networks to external memory banks via intricately crafted controllers and hand-designed…

Machine Learning · Computer Science 2020-08-18 Tri Huynh , Michael Maire , Matthew R. Walter