Related papers: Sparse Distributed Memory is a Continual Learner

Learning to Learn Neural Networks

Meta-learning consists in learning learning algorithms. We use a Long Short Term Memory (LSTM) based network to learn to compute on-line updates of the parameters of another neural network. These parameters are stored in the cell state of…

Machine Learning · Computer Science 2016-10-20 Tom Bosc

A Meta-Learned Neuron model for Continual Learning

Continual learning is the ability to acquire new knowledge without forgetting the previously learned one, assuming no further access to past training data. Neural network approximators trained with gradient descent are known to fail in this…

Machine Learning · Computer Science 2021-11-05 Rodrigue Siry

Distillation-Guided Structural Transfer for Continual Learning Beyond Sparse Distributed Memory

Sparse neural systems are gaining traction for efficient continual learning due to their modularity and low interference. Architectures such as Sparse Distributed Memory Multi-Layer Perceptrons (SDMLP) construct task-specific subnetworks…

Machine Learning · Computer Science 2025-12-18 Huiyan Xue , Xuming Ran , Yaxin Li , Qi Xu , Enhui Li , Yi Xu , Qiang Zhang

Sparse evolutionary Deep Learning with over one million artificial neurons on commodity hardware

Artificial Neural Networks (ANNs) have emerged as hot topics in the research community. Despite the success of ANNs, it is challenging to train and deploy modern ANNs on commodity hardware due to the ever-increasing model size and the…

Neural and Evolutionary Computing · Computer Science 2021-01-19 Shiwei Liu , Decebal Constantin Mocanu , Amarsagar Reddy Ramapuram Matavalam , Yulong Pei , Mykola Pechenizkiy

Use it or Lose it: Selective Memory and Forgetting in a Perpetual Learning Machine

In a recent article we described a new type of deep neural network - a Perpetual Learning Machine (PLM) - which is capable of learning 'on the fly' like a brain by existing in a state of Perpetual Stochastic Gradient Descent (PSGD). Here,…

Machine Learning · Computer Science 2015-09-30 Andrew J. R. Simpson

Sparse Meta Networks for Sequential Adaptation and its Application to Adaptive Language Modelling

Training a deep neural network requires a large amount of single-task data and involves a long time-consuming optimization phase. This is not scalable to complex, realistic environments with new unexpected changes. Humans can perform fast…

Neural and Evolutionary Computing · Computer Science 2020-09-04 Tsendsuren Munkhdalai

Memorized Sparse Backpropagation

Neural network learning is usually time-consuming since backpropagation needs to compute full gradients and backpropagate them across multiple layers. Despite its success of existing works in accelerating propagation through sparseness, the…

Machine Learning · Computer Science 2020-10-28 Zhiyuan Zhang , Pengcheng Yang , Xuancheng Ren , Qi Su , Xu Sun

Embarrassingly Parallel Independent Training of Multi-Layer Perceptrons with Heterogeneous Architectures

The definition of a Neural Network architecture is one of the most critical and challenging tasks to perform. In this paper, we propose ParallelMLPs. ParallelMLPs is a procedure to enable the training of several independent Multilayer…

Machine Learning · Computer Science 2022-06-20 Felipe Costa Farias , Teresa Bernarda Ludermir , Carmelo Jose Albanez Bastos-Filho

Transfer Learning with Sparse Associative Memories

In this paper, we introduce a novel layer designed to be used as the output of pre-trained neural networks in the context of classification. Based on Associative Memories, this layer can help design Deep Neural Networks which support…

Machine Learning · Computer Science 2019-09-20 Quentin Jodelet , Vincent Gripon , Masafumi Hagiwara

Learning Neural Networks with Sparse Activations

A core component present in many successful neural network architectures, is an MLP block of two fully connected layers with a non-linear activation in between. An intriguing phenomenon observed empirically, including in transformer…

Machine Learning · Computer Science 2024-06-27 Pranjal Awasthi , Nishanth Dikkala , Pritish Kamath , Raghu Meka

Scaling Memory-Augmented Neural Networks with Sparse Reads and Writes

Neural networks augmented with external memory have the ability to learn algorithmic solutions to complex tasks. These models appear promising for applications such as language modeling and machine translation. However, they scale poorly in…

Machine Learning · Computer Science 2016-10-31 Jack W Rae , Jonathan J Hunt , Tim Harley , Ivo Danihelka , Andrew Senior , Greg Wayne , Alex Graves , Timothy P Lillicrap

Sparse Coding in a Dual Memory System for Lifelong Learning

Efficient continual learning in humans is enabled by a rich set of neurophysiological mechanisms and interactions between multiple memory systems. The brain efficiently encodes information in non-overlapping sparse codes, which facilitates…

Neural and Evolutionary Computing · Computer Science 2023-01-13 Fahad Sarfraz , Elahe Arani , Bahram Zonooz

Continual Learning Through Synaptic Intelligence

While deep learning has led to remarkable advances across diverse applications, it struggles in domains where the data distribution changes over the course of learning. In stark contrast, biological neural networks continually adapt to…

Machine Learning · Computer Science 2017-06-14 Friedemann Zenke , Ben Poole , Surya Ganguli

Sparse Training Theory for Scalable and Efficient Agents

A fundamental task for artificial intelligence is learning. Deep Neural Networks have proven to cope perfectly with all learning paradigms, i.e. supervised, unsupervised, and reinforcement learning. Nevertheless, traditional deep learning…

Artificial Intelligence · Computer Science 2021-03-03 Decebal Constantin Mocanu , Elena Mocanu , Tiago Pinto , Selima Curci , Phuong H. Nguyen , Madeleine Gibescu , Damien Ernst , Zita A. Vale

Memory Bounds for Continual Learning

Continual learning, or lifelong learning, is a formidable current challenge to machine learning. It requires the learner to solve a sequence of $k$ different learning tasks, one after the other, while retaining its aptitude for earlier…

Machine Learning · Computer Science 2022-04-25 Xi Chen , Christos Papadimitriou , Binghui Peng

Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning

Continual Learning (CL) methods aim to enable machine learning models to learn new tasks without catastrophic forgetting of those that have been previously mastered. Existing CL approaches often keep a buffer of previously-seen samples,…

Machine Learning · Computer Science 2022-02-22 Dong Gong , Qingsen Yan , Yuhang Liu , Anton van den Hengel , Javen Qinfeng Shi

Semiparametric Language Models Are Scalable Continual Learners

Semiparametric language models (LMs) have shown promise in continuously learning from new text data by combining a parameterized neural LM with a growable non-parametric memory for memorizing new content. However, conventional…

Computation and Language · Computer Science 2023-03-03 Guangyue Peng , Tao Ge , Si-Qing Chen , Furu Wei , Houfeng Wang

Separability is not the best goal for machine learning

Neural networks use their hidden layers to transform input data into linearly separable data clusters, with a linear or a perceptron type output layer making the final projection on the line perpendicular to the discriminating hyperplane.…

Machine Learning · Computer Science 2018-07-10 Wlodzislaw Duch

Efficient Distributed Learning with Sparsity

We propose a novel, efficient approach for distributed sparse learning in high-dimensions, where observations are randomly partitioned across machines. Computationally, at each round our method only requires the master machine to solve a…

Machine Learning · Statistics 2016-05-26 Jialei Wang , Mladen Kolar , Nathan Srebro , Tong Zhang

Adaptive Reorganization of Neural Pathways for Continual Learning with Spiking Neural Networks

The human brain can self-organize rich and diverse sparse neural pathways to incrementally master hundreds of cognitive tasks. However, most existing continual learning algorithms for deep artificial and spiking neural networks are unable…

Neural and Evolutionary Computing · Computer Science 2026-05-06 Bing Han , Feifei Zhao , Wenxuan Pan , Zhuoya Zhao , Xianqi Li , Qingqun Kong , Yi Zeng