Related papers: Gradient Projection Memory for Continual Learning

Continual Learning with Scaled Gradient Projection

In neural networks, continual learning results in gradient interference among sequential tasks, leading to catastrophic forgetting of old tasks while learning new ones. This issue is addressed in recent methods by storing the important…

Machine Learning · Computer Science 2023-02-06 Gobinda Saha , Kaushik Roy

Orthogonal Gradient Descent for Continual Learning

Neural networks are achieving state of the art and sometimes super-human performance on learning tasks across a variety of domains. Whenever these problems require learning in a continual or sequential manner, however, neural networks…

Machine Learning · Computer Science 2019-10-17 Mehrdad Farajtabar , Navid Azizan , Alex Mott , Ang Li

Layerwise Optimization by Gradient Decomposition for Continual Learning

Deep neural networks achieve state-of-the-art and sometimes super-human performance across various domains. However, when learning tasks sequentially, the networks easily forget the knowledge of previous tasks, known as "catastrophic…

Computer Vision and Pattern Recognition · Computer Science 2021-05-18 Shixiang Tang , Dapeng Chen , Jinguo Zhu , Shijie Yu , Wanli Ouyang

On the Theory of Continual Learning with Gradient Descent for Neural Networks

Continual learning, the ability of a model to adapt to an ongoing sequence of tasks without forgetting earlier ones, is a central goal of artificial intelligence. To better understand its underlying mechanisms, we study the limitations of…

Machine Learning · Statistics 2026-04-21 Hossein Taheri , Avishek Ghosh , Arya Mazumdar

TRGP: Trust Region Gradient Projection for Continual Learning

Catastrophic forgetting is one of the major challenges in continual learning. To address this issue, some existing methods put restrictive constraints on the optimization space of the new task for minimizing the interference to old tasks.…

Machine Learning · Computer Science 2022-02-08 Sen Lin , Li Yang , Deliang Fan , Junshan Zhang

Class Gradient Projection For Continual Learning

Catastrophic forgetting is one of the most critical challenges in Continual Learning (CL). Recent approaches tackle this problem by projecting the gradient update orthogonal to the gradient subspace of existing tasks. While the results are…

Computer Vision and Pattern Recognition · Computer Science 2023-11-28 Cheng Chen , Ji Zhang , Jingkuan Song , Lianli Gao

Sparsity and Heterogeneous Dropout for Continual Learning in the Null Space of Neural Activations

Continual/lifelong learning from a non-stationary input data stream is a cornerstone of intelligence. Despite their phenomenal performance in a wide variety of applications, deep neural networks are prone to forgetting their previously…

Machine Learning · Computer Science 2022-07-11 Ali Abbasi , Parsa Nooralinejad , Vladimir Braverman , Hamed Pirsiavash , Soheil Kolouri

Learn to Unlearn for Deep Neural Networks: Minimizing Unlearning Interference with Gradient Projection

Recent data-privacy laws have sparked interest in machine unlearning, which involves removing the effect of specific training samples from a learnt model as if they were never present in the original training dataset. The challenge of…

Machine Learning · Computer Science 2023-12-08 Tuan Hoang , Santu Rana , Sunil Gupta , Svetha Venkatesh

Gradient Monitored Reinforcement Learning

This paper presents a novel neural network training approach for faster convergence and better generalization abilities in deep reinforcement learning. Particularly, we focus on the enhancement of training and evaluation performance in…

Machine Learning · Computer Science 2020-05-26 Mohammed Sharafath Abdul Hameed , Gavneet Singh Chadha , Andreas Schwung , Steven X. Ding

A Meta-Learned Neuron model for Continual Learning

Continual learning is the ability to acquire new knowledge without forgetting the previously learned one, assuming no further access to past training data. Neural network approximators trained with gradient descent are known to fail in this…

Machine Learning · Computer Science 2021-11-05 Rodrigue Siry

On the Implicit Adversariality of Catastrophic Forgetting in Deep Continual Learning

Continual learning seeks the human-like ability to accumulate new skills in machine intelligence. Its central challenge is catastrophic forgetting, whose underlying cause has not been fully understood for deep networks. In this paper, we…

Machine Learning · Computer Science 2025-10-13 Ze Peng , Jian Zhang , Jintao Guo , Lei Qi , Yang Gao , Yinghuan Shi

Mitigating Forgetting in Continual Learning with Selective Gradient Projection

As neural networks are increasingly deployed in dynamic environments, they face the challenge of catastrophic forgetting, the tendency to overwrite previously learned knowledge when adapting to new tasks, resulting in severe performance…

Machine Learning · Computer Science 2026-03-31 Anika Singh , Aayush Dhaulakhandi , Varun Chopade , Likhith Malipati , David Martinez , Kevin Zhu

Facilitating Bayesian Continual Learning by Natural Gradients and Stein Gradients

Continual learning aims to enable machine learning models to learn a general solution space for past and future tasks in a sequential manner. Conventional models tend to forget the knowledge of previous tasks while learning a new task, a…

Machine Learning · Computer Science 2019-04-25 Yu Chen , Tom Diethe , Neil Lawrence

Flattening Sharpness for Dynamic Gradient Projection Memory Benefits Continual Learning

The backpropagation networks are notably susceptible to catastrophic forgetting, where networks tend to forget previously learned skills upon learning new ones. To address such the 'sensitivity-stability' dilemma, most previous efforts have…

Machine Learning · Computer Science 2021-10-12 Danruo Deng , Guangyong Chen , Jianye Hao , Qiong Wang , Pheng-Ann Heng

Gradient Methods with Memory for Minimizing Composite Functions

The recently introduced Gradient Methods with Memory use a subset of the past oracle information to create an accurate model of the objective function that enables them to surpass the Gradient Method in practical performance. The model…

Optimization and Control · Mathematics 2024-01-30 Mihai I. Florea

Gradient Descent based Optimization Algorithms for Deep Learning Models Training

In this paper, we aim at providing an introduction to the gradient descent based optimization algorithms for learning deep neural network models. Deep learning models involving multiple nonlinear projection layers are very challenging to…

Machine Learning · Computer Science 2019-03-12 Jiawei Zhang

Gradient-free Continual Learning

Continual learning (CL) presents a fundamental challenge in training neural networks on sequential tasks without experiencing catastrophic forgetting. Traditionally, the dominant approach in CL has been gradient-based optimization, where…

Machine Learning · Computer Science 2025-04-03 Grzegorz Rypeść

Class-Prototype Conditional Diffusion Model with Gradient Projection for Continual Learning

Mitigating catastrophic forgetting is a key hurdle in continual learning. Deep Generative Replay (GR) provides techniques focused on generating samples from prior tasks to enhance the model's memory capabilities using generative AI models…

Machine Learning · Computer Science 2024-03-25 Khanh Doan , Quyen Tran , Tung Lam Tran , Tuan Nguyen , Dinh Phung , Trung Le

Learning Longer Memory in Recurrent Neural Networks

Recurrent neural network is a powerful model that learns temporal patterns in sequential data. For a long time, it was believed that recurrent networks are difficult to train using simple optimizers, such as stochastic gradient descent, due…

Neural and Evolutionary Computing · Computer Science 2015-04-20 Tomas Mikolov , Armand Joulin , Sumit Chopra , Michael Mathieu , Marc'Aurelio Ranzato

Saliency-Augmented Memory Completion for Continual Learning

Continual Learning is considered a key step toward next-generation Artificial Intelligence. Among various methods, replay-based approaches that maintain and replay a small episodic memory of previous samples are one of the most successful…

Machine Learning · Computer Science 2022-12-27 Guangji Bai , Chen Ling , Yuyang Gao , Liang Zhao