Related papers: Neural Network Retraining for Model Serving

Frosting Weights for Better Continual Training

Training a neural network model can be a lifelong learning process and is a computationally intensive one. A severe adverse effect that may occur in deep neural network models is that they can suffer from catastrophic forgetting during…

Machine Learning · Computer Science 2020-01-10 Xiaofeng Zhu , Feng Liu , Goce Trajcevski , Dingding Wang

Meta Continual Learning

Using neural networks in practical settings would benefit from the ability of the networks to learn new tasks throughout their lifetimes without forgetting the previous tasks. This ability is limited in the current deep neural networks by a…

Machine Learning · Computer Science 2018-06-20 Risto Vuorio , Dong-Yeon Cho , Daejoong Kim , Jiwon Kim

Catastrophic Forgetting in the Context of Model Updates

A large obstacle to deploying deep learning models in practice is the process of updating models post-deployment (ideally, frequently). Deep neural networks can cost many thousands of dollars to train. When new data comes in the pipeline,…

Machine Learning · Computer Science 2023-06-21 Rich Harang , Hillary Sanders

Towards Training Recurrent Neural Networks for Lifelong Learning

Catastrophic forgetting and capacity saturation are the central challenges of any parametric lifelong learning system. In this work, we study these challenges in the context of sequential supervised learning with an emphasis on recurrent…

Machine Learning · Computer Science 2019-09-10 Shagun Sodhani , Sarath Chandar , Yoshua Bengio

Efficient Rehearsal Free Zero Forgetting Continual Learning using Adaptive Weight Modulation

Artificial neural networks encounter a notable challenge known as continual learning, which involves acquiring knowledge of multiple tasks over an extended period. This challenge arises due to the tendency of previously learned weights to…

Computer Vision and Pattern Recognition · Computer Science 2023-11-28 Yonatan Sverdlov , Shimon Ullman

An Empirical Investigation of the Role of Pre-training in Lifelong Learning

The lifelong learning paradigm in machine learning is an attractive alternative to the more prominent isolated learning scheme not only due to its resemblance to biological learning but also its potential to reduce energy waste by obviating…

Machine Learning · Computer Science 2023-08-30 Sanket Vaibhav Mehta , Darshan Patil , Sarath Chandar , Emma Strubell

Continual Learning with Invertible Generative Models

Catastrophic forgetting (CF) happens whenever a neural network overwrites past knowledge while being trained on new tasks. Common techniques to handle CF include regularization of the weights (using, e.g., their importance on past tasks),…

Machine Learning · Computer Science 2022-12-29 Jary Pomponi , Simone Scardapane , Aurelio Uncini

Measuring Catastrophic Forgetting in Neural Networks

Deep neural networks are used in many state-of-the-art systems for machine perception. Once a network is trained to do a specific task, e.g., bird classification, it cannot easily be trained to do new tasks, e.g., incrementally learning to…

Artificial Intelligence · Computer Science 2017-11-10 Ronald Kemker , Marc McClure , Angelina Abitino , Tyler Hayes , Christopher Kanan

Memory Efficient Continual Learning with Transformers

In many real-world scenarios, data to train machine learning models becomes available over time. Unfortunately, these models struggle to continually learn new concepts without forgetting what has been learnt in the past. This phenomenon is…

Computation and Language · Computer Science 2023-01-16 Beyza Ermis , Giovanni Zappella , Martin Wistuba , Aditya Rawal , Cedric Archambeau

Pseudo-Rehearsal for Continual Learning with Normalizing Flows

Catastrophic forgetting (CF) happens whenever a neural network overwrites past knowledge while being trained on new tasks. Common techniques to handle CF include regularization of the weights (using, e.g., their importance on past tasks),…

Machine Learning · Statistics 2021-08-06 Jary Pomponi , Simone Scardapane , Aurelio Uncini

Deep Generative Dual Memory Network for Continual Learning

Despite advances in deep learning, neural networks can only learn multiple tasks when trained on them jointly. When tasks arrive sequentially, they lose performance on previously learnt tasks. This phenomenon called catastrophic forgetting…

Machine Learning · Computer Science 2018-05-29 Nitin Kamra , Umang Gupta , Yan Liu

Negotiated Representations to Prevent Forgetting in Machine Learning Applications

Catastrophic forgetting is a significant challenge in the field of machine learning, particularly in neural networks. When a neural network learns to perform well on a new task, it often forgets its previously acquired knowledge or…

Machine Learning · Computer Science 2023-12-04 Nuri Korhan , Ceren Öner

REMIND Your Neural Network to Prevent Catastrophic Forgetting

People learn throughout life. However, incrementally updating conventional neural networks leads to catastrophic forgetting. A common remedy is replay, which is inspired by how the brain consolidates memory. Replay involves fine-tuning a…

Machine Learning · Computer Science 2020-07-14 Tyler L. Hayes , Kushal Kafle , Robik Shrestha , Manoj Acharya , Christopher Kanan

Learning to Remember from a Multi-Task Teacher

Recent studies on catastrophic forgetting during sequential learning typically focus on fixing the accuracy of the predictions for a previously learned task. In this paper we argue that the outputs of neural networks are subject to rapid…

Machine Learning · Computer Science 2020-02-14 Yuwen Xiong , Mengye Ren , Raquel Urtasun

Continual Reinforcement Learning with Complex Synapses

Unlike humans, who are capable of continual learning over their lifetimes, artificial neural networks have long been known to suffer from a phenomenon known as catastrophic forgetting, whereby new learning can lead to abrupt erasure of…

Artificial Intelligence · Computer Science 2018-06-20 Christos Kaplanis , Murray Shanahan , Claudia Clopath

Continual Learning Using Bayesian Neural Networks

Continual learning models allow to learn and adapt to new changes and tasks over time. However, in continual and sequential learning scenarios in which the models are trained using different data with various distributions, neural networks…

Machine Learning · Computer Science 2020-08-17 HongLin Li , Payam Barnaghi , Shirin Enshaeifar , Frieder Ganz

A Comprehensive Study of Class Incremental Learning Algorithms for Visual Tasks

The ability of artificial agents to increment their capabilities when confronted with new data is an open challenge in artificial intelligence. The main challenge faced in such cases is catastrophic forgetting, i.e., the tendency of neural…

Machine Learning · Computer Science 2020-12-16 Eden Belouadah , Adrian Popescu , Ioannis Kanellos

Overcoming catastrophic forgetting in neural networks

The ability to learn tasks in a sequential fashion is crucial to the development of artificial intelligence. Neural networks are not, in general, capable of this and it has been widely thought that catastrophic forgetting is an inevitable…

Machine Learning · Computer Science 2022-06-08 James Kirkpatrick , Razvan Pascanu , Neil Rabinowitz , Joel Veness , Guillaume Desjardins , Andrei A. Rusu , Kieran Milan , John Quan , Tiago Ramalho , Agnieszka Grabska-Barwinska , Demis Hassabis , Claudia Clopath , Dharshan Kumaran , Raia Hadsell

Weight Factorization and Centralization for Continual Learning in Speech Recognition

Modern neural network based speech recognition models are required to continually absorb new data without re-training the whole system, especially in downstream applications using foundation models, having no access to the original training…

Computation and Language · Computer Science 2025-06-23 Enes Yavuz Ugan , Ngoc-Quan Pham , Alexander Waibel

Adaptive Compression-based Lifelong Learning

The problem of a deep learning model losing performance on a previously learned task when fine-tuned to a new one is a phenomenon known as Catastrophic forgetting. There are two major ways to mitigate this problem: either preserving…

Computer Vision and Pattern Recognition · Computer Science 2019-07-24 Shivangi Srivastava , Maxim Berman , Matthew B. Blaschko , Devis Tuia