English
Related papers

Related papers: Conditional Computation for Continual Learning

200 papers

Using neural networks in practical settings would benefit from the ability of the networks to learn new tasks throughout their lifetimes without forgetting the previous tasks. This ability is limited in the current deep neural networks by a…

Machine Learning · Computer Science 2018-06-20 Risto Vuorio , Dong-Yeon Cho , Daejoong Kim , Jiwon Kim

Catastrophic forgetting is a pervasive issue for pre-trained language models (PLMs) during continual learning, where models lose previously acquired knowledge when sequentially trained on a series of tasks. The model's ability to retain old…

Computation and Language · Computer Science 2025-02-18 Biqing Zeng , Zehan Li , Aladdin Ayesh

We explore a data-driven approach for learning to optimize neural networks. We construct a dataset of neural network checkpoints and train a generative model on the parameters. In particular, our model is a conditional diffusion transformer…

Machine Learning · Computer Science 2022-09-27 William Peebles , Ilija Radosavovic , Tim Brooks , Alexei A. Efros , Jitendra Malik

Deep learning has proved to be a successful paradigm for solving many challenges in machine learning. However, deep neural networks fail when trained sequentially on multiple tasks, a shortcoming known as catastrophic forgetting in the…

Machine Learning · Computer Science 2023-10-03 Giulia Lanzillotta , Sidak Pal Singh , Benjamin F. Grewe , Thomas Hofmann

Deep Neural Networks (DNNs) suffer from a rapid decrease in performance when trained on a sequence of tasks where only data of the most recent task is available. This phenomenon, known as catastrophic forgetting, prevents DNNs from…

Machine Learning · Computer Science 2021-04-22 Felix Wiewel , Bin Yang

Humans have the ability to accumulate knowledge of new tasks in varying conditions, but deep neural networks often suffer from catastrophic forgetting of previously learned knowledge after learning a new task. Many recent methods focus on…

Continual learning models allow to learn and adapt to new changes and tasks over time. However, in continual and sequential learning scenarios in which the models are trained using different data with various distributions, neural networks…

Machine Learning · Computer Science 2020-08-17 HongLin Li , Payam Barnaghi , Shirin Enshaeifar , Frieder Ganz

We propose a Bayesian neural network-based continual learning algorithm using Variational Inference, aiming to overcome several drawbacks of existing methods. Specifically, in continual learning scenarios, storing network parameters at each…

Machine Learning · Computer Science 2024-11-22 Sanchar Palit , Biplab Banerjee , Subhasis Chaudhuri

When a computational system continuously learns from an ever-changing environment, it rapidly forgets its past experiences. This phenomenon is called catastrophic forgetting. While a line of studies has been proposed with respect to…

Machine Learning · Statistics 2021-09-15 Haruka Asanuma , Shiro Takagi , Yoshihiro Nagano , Yuki Yoshida , Yasuhiko Igarashi , Masato Okada

Continual learning (CL) enables animals to learn new tasks without erasing prior knowledge. CL in artificial neural networks (NNs) is challenging due to catastrophic forgetting, where new learning degrades performance on older tasks. While…

Machine Learning · Computer Science 2025-01-28 Haozhe Shan , Qianyi Li , Haim Sompolinsky

Despite significant advances, continual learning models still suffer from catastrophic forgetting when exposed to incrementally available data from non-stationary distributions. Rehearsal approaches alleviate the problem by maintaining and…

Machine Learning · Statistics 2021-03-02 Binh Tang , David S. Matteson

In lifelong learning systems based on artificial neural networks, one of the biggest obstacles is the inability to retain old knowledge as new information is encountered. This phenomenon is known as catastrophic forgetting. In this paper,…

Machine Learning · Computer Science 2022-08-16 Alexander Ororbia , Ankur Mali , Daniel Kifer , C. Lee Giles

Neural machine translation (NMT) models usually suffer from catastrophic forgetting during continual training where the models tend to gradually forget previously learned knowledge and swing to fit the newly added data which may have a…

Computation and Language · Computer Science 2020-12-01 Shuhao Gu , Yang Feng

Inspired by the phenomenon of catastrophic forgetting, we investigate the learning dynamics of neural networks as they train on single classification tasks. Our goal is to understand whether a related phenomenon occurs when data does not…

Continual Learning research typically focuses on tackling the phenomenon of catastrophic forgetting in neural networks. Catastrophic forgetting is associated with an abrupt loss of knowledge previously learned by a model when the task, or…

Machine Learning · Computer Science 2022-04-06 MohammadReza Davari , Nader Asadi , Sudhir Mudur , Rahaf Aljundi , Eugene Belilovsky

Catastrophic forgetting is a significant challenge in the field of machine learning, particularly in neural networks. When a neural network learns to perform well on a new task, it often forgets its previously acquired knowledge or…

Machine Learning · Computer Science 2023-12-04 Nuri Korhan , Ceren Öner

Catastrophic forgetting - the tendency of neural networks to forget previously learned data when learning new information - remains a central challenge in continual learning. In this work, we adopt a behavioral approach, observing a…

Machine Learning · Computer Science 2025-07-08 Guy Hacohen , Tinne Tuytelaars

In the present era of deep learning, continual learning research is mainly focused on mitigating forgetting when training a neural network with stochastic gradient descent on a non-stationary stream of data. On the other hand, in the more…

Machine Learning · Computer Science 2024-05-30 Soochan Lee , Hyeonseong Jeon , Jaehyeon Son , Gunhee Kim

Autonomous machine learning systems that learn many tasks in sequence are prone to the catastrophic forgetting problem. Mathematical theory is needed in order to understand the extent of forgetting during continual learning. As a…

Machine Learning · Computer Science 2025-02-18 Daniel Goldfarb , Paul Hand

Learning a sequence of tasks without access to i.i.d. observations is a widely studied form of continual learning (CL) that remains challenging. In principle, Bayesian learning directly applies to this setting, since recursive and one-off…

‹ Prev 1 2 3 10 Next ›