Related papers: Mitigating Forgetting in Continual Learning with S…

Continual Learning with Scaled Gradient Projection

In neural networks, continual learning results in gradient interference among sequential tasks, leading to catastrophic forgetting of old tasks while learning new ones. This issue is addressed in recent methods by storing the important…

Machine Learning · Computer Science 2023-02-06 Gobinda Saha , Kaushik Roy

Continual Learning with Recursive Gradient Optimization

Learning multiple tasks sequentially without forgetting previous knowledge, called Continual Learning(CL), remains a long-standing challenge for neural networks. Most existing methods rely on additional network capacity or data replay. In…

Machine Learning · Computer Science 2022-02-01 Hao Liu , Huaping Liu

Continuous Subspace Optimization for Continual Learning

Continual learning aims to learn multiple tasks sequentially while preserving prior knowledge, but faces the challenge of catastrophic forgetting when adapting to new tasks. Recently, approaches leveraging pre-trained models have gained…

Computer Vision and Pattern Recognition · Computer Science 2025-11-12 Quan Cheng , Yuanyu Wan , Lingyu Wu , Chenping Hou , Lijun Zhang

Gradient-free Continual Learning

Continual learning (CL) presents a fundamental challenge in training neural networks on sequential tasks without experiencing catastrophic forgetting. Traditionally, the dominant approach in CL has been gradient-based optimization, where…

Machine Learning · Computer Science 2025-04-03 Grzegorz Rypeść

Forward-Only Continual Learning

Catastrophic forgetting remains a central challenge in continual learning (CL) with pre-trained models. While existing approaches typically freeze the backbone and fine-tune a small number of parameters to mitigate forgetting, they still…

Machine Learning · Computer Science 2025-09-03 Jiao Chen , Jiayi He , Fangfang Chen , Zuohong Lv , Jianhua Tang

Gradient Projection Memory for Continual Learning

The ability to learn continually without forgetting the past tasks is a desired attribute for artificial learning systems. Existing approaches to enable such learning in artificial neural networks usually rely on network growth, importance…

Machine Learning · Computer Science 2021-03-18 Gobinda Saha , Isha Garg , Kaushik Roy

Orthogonal Gradient Descent for Continual Learning

Neural networks are achieving state of the art and sometimes super-human performance on learning tasks across a variety of domains. Whenever these problems require learning in a continual or sequential manner, however, neural networks…

Machine Learning · Computer Science 2019-10-17 Mehrdad Farajtabar , Navid Azizan , Alex Mott , Ang Li

Soup to go: mitigating forgetting during continual learning with model averaging

In continual learning, where task data arrives in a sequence, fine-tuning on later tasks will often lead to performance degradation on earlier tasks. This is especially pronounced when these tasks come from diverse domains. In this setting,…

Machine Learning · Computer Science 2025-01-13 Anat Kleiman , Gintare Karolina Dziugaite , Jonathan Frankle , Sham Kakade , Mansheej Paul

FAMO: Fast Adaptive Multitask Optimization

One of the grand enduring goals of AI is to create generalist agents that can learn multiple different tasks from diverse data via multitask learning (MTL). However, in practice, applying gradient descent (GD) on the average loss across all…

Machine Learning · Computer Science 2023-10-31 Bo Liu , Yihao Feng , Peter Stone , Qiang Liu

Learning to optimize by multi-gradient for multi-objective optimization

The development of artificial intelligence (AI) for science has led to the emergence of learning-based research paradigms, necessitating a compelling reevaluation of the design of multi-objective optimization (MOO) methods. The new…

Machine Learning · Computer Science 2023-11-02 Linxi Yang , Xinmin Yang , Liping Tang

An Effective Dynamic Gradient Calibration Method for Continual Learning

Continual learning (CL) is a fundamental topic in machine learning, where the goal is to train a model with continuously incoming data and tasks. Due to the memory limit, we cannot store all the historical data, and therefore confront the…

Machine Learning · Computer Science 2024-07-31 Weichen Lin , Jiaxiang Chen , Ruomin Huang , Hu Ding

Restricted Orthogonal Gradient Projection for Continual Learning

Continual learning aims to avoid catastrophic forgetting and effectively leverage learned experiences to master new knowledge. Existing gradient projection approaches impose hard constraints on the optimization space for new tasks to…

Machine Learning · Computer Science 2023-01-31 Zeyuan Yang , Zonghan Yang , Peng Li , Yang Liu

Sequencing to Mitigate Catastrophic Forgetting in Continual Learning

To cope with real-world dynamics, an intelligent system needs to incrementally acquire, update, and exploit knowledge throughout its lifetime. This ability, known as Continual learning, provides a foundation for AI systems to develop…

Machine Learning · Computer Science 2025-12-19 Hesham G. Moussa , Aroosa Hameed , Arashmid Akhavain

Enhancing Fractional Gradient Descent with Learned Optimizers

Fractional Gradient Descent (FGD) offers a novel and promising way to accelerate optimization by incorporating fractional calculus into machine learning. Although FGD has shown encouraging initial results across various optimization tasks,…

Machine Learning · Computer Science 2025-10-22 Jan Sobotka , Petr Šimánek , Pavel Kordík

A Contrastive Symmetric Forward-Forward Algorithm (SFFA) for Continual Learning Tasks

The so-called Forward-Forward Algorithm (FFA) has recently gained momentum as an alternative to the conventional back-propagation algorithm for neural network learning, yielding competitive performance across various modeling tasks. By…

Machine Learning · Computer Science 2025-01-10 Erik B. Terres-Escudero , Javier Del Ser , Pablo Garcia Bringas

Sparsity and Heterogeneous Dropout for Continual Learning in the Null Space of Neural Activations

Continual/lifelong learning from a non-stationary input data stream is a cornerstone of intelligence. Despite their phenomenal performance in a wide variety of applications, deep neural networks are prone to forgetting their previously…

Machine Learning · Computer Science 2022-07-11 Ali Abbasi , Parsa Nooralinejad , Vladimir Braverman , Hamed Pirsiavash , Soheil Kolouri

On the Convergence of Continual Learning with Adaptive Methods

One of the objectives of continual learning is to prevent catastrophic forgetting in learning multiple tasks sequentially, and the existing solutions have been driven by the conceptualization of the plasticity-stability dilemma. However,…

Machine Learning · Computer Science 2024-04-16 Seungyub Han , Yeongmo Kim , Taehyun Cho , Jungwoo Lee

Multimodal Continual Instruction Tuning with Dynamic Gradient Guidance

Multimodal continual instruction tuning enables multimodal large language models to sequentially adapt to new tasks while building upon previously acquired knowledge. However, this continual learning paradigm faces the significant challenge…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Songze Li , Mingyu Gao , Tonghua Su , Xu-Yao Zhang , Zhongjie Wang

Class Gradient Projection For Continual Learning

Catastrophic forgetting is one of the most critical challenges in Continual Learning (CL). Recent approaches tackle this problem by projecting the gradient update orthogonal to the gradient subspace of existing tasks. While the results are…

Computer Vision and Pattern Recognition · Computer Science 2023-11-28 Cheng Chen , Ji Zhang , Jingkuan Song , Lianli Gao

ZeroFlow: Overcoming Catastrophic Forgetting is Easier than You Think

Backpropagation provides a generalized configuration for overcoming catastrophic forgetting. Optimizers such as SGD and Adam are commonly used for weight updates in continual learning and continual pre-training. However, access to gradient…

Computer Vision and Pattern Recognition · Computer Science 2025-06-09 Tao Feng , Wei Li , Didi Zhu , Hangjie Yuan , Wendi Zheng , Dan Zhang , Jie Tang