Related papers: Conditional Computation for Continual Learning

Meta Continual Learning

Using neural networks in practical settings would benefit from the ability of the networks to learn new tasks throughout their lifetimes without forgetting the previous tasks. This ability is limited in the current deep neural networks by a…

Machine Learning · Computer Science 2018-06-20 Risto Vuorio , Dong-Yeon Cho , Daejoong Kim , Jiwon Kim

Neural Networks Remember More: The Power of Parameter Isolation and Combination

Catastrophic forgetting is a pervasive issue for pre-trained language models (PLMs) during continual learning, where models lose previously acquired knowledge when sequentially trained on a series of tasks. The model's ability to retain old…

Computation and Language · Computer Science 2025-02-18 Biqing Zeng , Zehan Li , Aladdin Ayesh

Learning to Learn with Generative Models of Neural Network Checkpoints

We explore a data-driven approach for learning to optimize neural networks. We construct a dataset of neural network checkpoints and train a generative model on the parameters. In particular, our model is a conditional diffusion transformer…

Machine Learning · Computer Science 2022-09-27 William Peebles , Ilija Radosavovic , Tim Brooks , Alexei A. Efros , Jitendra Malik

Towards guarantees for parameter isolation in continual learning

Deep learning has proved to be a successful paradigm for solving many challenges in machine learning. However, deep neural networks fail when trained sequentially on multiple tasks, a shortcoming known as catastrophic forgetting in the…

Machine Learning · Computer Science 2023-10-03 Giulia Lanzillotta , Sidak Pal Singh , Benjamin F. Grewe , Thomas Hofmann

Condensed Composite Memory Continual Learning

Deep Neural Networks (DNNs) suffer from a rapid decrease in performance when trained on a sequence of tasks where only data of the most recent task is available. This phenomenon, known as catastrophic forgetting, prevents DNNs from…

Machine Learning · Computer Science 2021-04-22 Felix Wiewel , Bin Yang

On Generalizing Beyond Domains in Cross-Domain Continual Learning

Humans have the ability to accumulate knowledge of new tasks in varying conditions, but deep neural networks often suffer from catastrophic forgetting of previously learned knowledge after learning a new task. Many recent methods focus on…

Machine Learning · Computer Science 2022-03-09 Christian Simon , Masoud Faraki , Yi-Hsuan Tsai , Xiang Yu , Samuel Schulter , Yumin Suh , Mehrtash Harandi , Manmohan Chandraker

Continual Learning Using Bayesian Neural Networks

Continual learning models allow to learn and adapt to new changes and tasks over time. However, in continual and sequential learning scenarios in which the models are trained using different data with various distributions, neural networks…

Machine Learning · Computer Science 2020-08-17 HongLin Li , Payam Barnaghi , Shirin Enshaeifar , Frieder Ganz

Revised Regularization for Efficient Continual Learning through Correlation-Based Parameter Update in Bayesian Neural Networks

We propose a Bayesian neural network-based continual learning algorithm using Variational Inference, aiming to overcome several drawbacks of existing methods. Specifically, in continual learning scenarios, storing network parameters at each…

Machine Learning · Computer Science 2024-11-22 Sanchar Palit , Biplab Banerjee , Subhasis Chaudhuri

Statistical Mechanical Analysis of Catastrophic Forgetting in Continual Learning with Teacher and Student Networks

When a computational system continuously learns from an ever-changing environment, it rapidly forgets its past experiences. This phenomenon is called catastrophic forgetting. While a line of studies has been proposed with respect to…

Machine Learning · Statistics 2021-09-15 Haruka Asanuma , Shiro Takagi , Yoshihiro Nagano , Yuki Yoshida , Yasuhiko Igarashi , Masato Okada

Order parameters and phase transitions of continual learning in deep neural networks

Continual learning (CL) enables animals to learn new tasks without erasing prior knowledge. CL in artificial neural networks (NNs) is challenging due to catastrophic forgetting, where new learning degrades performance on older tasks. While…

Machine Learning · Computer Science 2025-01-28 Haozhe Shan , Qianyi Li , Haim Sompolinsky

Graph-Based Continual Learning

Despite significant advances, continual learning models still suffer from catastrophic forgetting when exposed to incrementally available data from non-stationary distributions. Rehearsal approaches alleviate the problem by maintaining and…

Machine Learning · Statistics 2021-03-02 Binh Tang , David S. Matteson

Lifelong Neural Predictive Coding: Learning Cumulatively Online without Forgetting

In lifelong learning systems based on artificial neural networks, one of the biggest obstacles is the inability to retain old knowledge as new information is encountered. This phenomenon is known as catastrophic forgetting. In this paper,…

Machine Learning · Computer Science 2022-08-16 Alexander Ororbia , Ankur Mali , Daniel Kifer , C. Lee Giles

Investigating Catastrophic Forgetting During Continual Training for Neural Machine Translation

Neural machine translation (NMT) models usually suffer from catastrophic forgetting during continual training where the models tend to gradually forget previously learned knowledge and swing to fit the newly added data which may have a…

Computation and Language · Computer Science 2020-12-01 Shuhao Gu , Yang Feng

An Empirical Study of Example Forgetting during Deep Neural Network Learning

Inspired by the phenomenon of catastrophic forgetting, we investigate the learning dynamics of neural networks as they train on single classification tasks. Our goal is to understand whether a related phenomenon occurs when data does not…

Machine Learning · Computer Science 2019-11-18 Mariya Toneva , Alessandro Sordoni , Remi Tachet des Combes , Adam Trischler , Yoshua Bengio , Geoffrey J. Gordon

Probing Representation Forgetting in Supervised and Unsupervised Continual Learning

Continual Learning research typically focuses on tackling the phenomenon of catastrophic forgetting in neural networks. Catastrophic forgetting is associated with an abrupt loss of knowledge previously learned by a model when the task, or…

Machine Learning · Computer Science 2022-04-06 MohammadReza Davari , Nader Asadi , Sudhir Mudur , Rahaf Aljundi , Eugene Belilovsky

Negotiated Representations to Prevent Forgetting in Machine Learning Applications

Catastrophic forgetting is a significant challenge in the field of machine learning, particularly in neural networks. When a neural network learns to perform well on a new task, it often forgets its previously acquired knowledge or…

Machine Learning · Computer Science 2023-12-04 Nuri Korhan , Ceren Öner

Predicting the Susceptibility of Examples to Catastrophic Forgetting

Catastrophic forgetting - the tendency of neural networks to forget previously learned data when learning new information - remains a central challenge in continual learning. In this work, we adopt a behavioral approach, observing a…

Machine Learning · Computer Science 2025-07-08 Guy Hacohen , Tinne Tuytelaars

Learning to Continually Learn with the Bayesian Principle

In the present era of deep learning, continual learning research is mainly focused on mitigating forgetting when training a neural network with stochastic gradient descent on a non-stationary stream of data. On the other hand, in the more…

Machine Learning · Computer Science 2024-05-30 Soochan Lee , Hyeonseong Jeon , Jaehyeon Son , Gunhee Kim

Analysis of Overparameterization in Continual Learning under a Linear Model

Autonomous machine learning systems that learn many tasks in sequence are prone to the catastrophic forgetting problem. Mathematical theory is needed in order to understand the extent of forgetting during continual learning. As a…

Machine Learning · Computer Science 2025-02-18 Daniel Goldfarb , Paul Hand

Posterior Meta-Replay for Continual Learning

Learning a sequence of tasks without access to i.i.d. observations is a widely studied form of continual learning (CL) that remains challenging. In principle, Bayesian learning directly applies to this setting, since recursive and one-off…

Machine Learning · Computer Science 2021-10-22 Christian Henning , Maria R. Cervera , Francesco D'Angelo , Johannes von Oswald , Regina Traber , Benjamin Ehret , Seijin Kobayashi , Benjamin F. Grewe , João Sacramento