Related papers: Collaborative Parameter Learning: Mitigating Forge…

An Effective Dynamic Gradient Calibration Method for Continual Learning

Continual learning (CL) is a fundamental topic in machine learning, where the goal is to train a model with continuously incoming data and tasks. Due to the memory limit, we cannot store all the historical data, and therefore confront the…

Machine Learning · Computer Science 2024-07-31 Weichen Lin , Jiaxiang Chen , Ruomin Huang , Hu Ding

Neural Networks Remember More: The Power of Parameter Isolation and Combination

Catastrophic forgetting is a pervasive issue for pre-trained language models (PLMs) during continual learning, where models lose previously acquired knowledge when sequentially trained on a series of tasks. The model's ability to retain old…

Computation and Language · Computer Science 2025-02-18 Biqing Zeng , Zehan Li , Aladdin Ayesh

Exploring the Impact of Parameter Update Magnitude on Forgetting and Generalization of Continual Learning

The magnitude of parameter updates are considered a key factor in continual learning. However, most existing studies focus on designing diverse update strategies, while a theoretical understanding of the underlying mechanisms remains…

Machine Learning · Computer Science 2026-02-25 JinLi He , Liang Bai , Xian Yang

Slowing Down Forgetting in Continual Learning

A common challenge in continual learning (CL) is catastrophic forgetting, where the performance on old tasks drops after new, additional tasks are learned. In this paper, we propose a novel framework called ReCL to slow down forgetting in…

Machine Learning · Computer Science 2025-03-04 Pascal Janetzky , Tobias Schlagenhauf , Stefan Feuerriegel

Attribution-Guided Continual Learning for Large Language Models

Large language models (LLMs) often suffer from catastrophic forgetting in continual learning: after learning new tasks sequentially, they perform worse on earlier tasks. Existing methods mitigate catastrophic forgetting by data replay,…

Machine Learning · Computer Science 2026-05-08 Yazheng Liu , Yuxuan Wan , Rui Xu , Xi Zhang , Sihong Xie , Hui Xiong

Gradient-free Continual Learning

Continual learning (CL) presents a fundamental challenge in training neural networks on sequential tasks without experiencing catastrophic forgetting. Traditionally, the dominant approach in CL has been gradient-based optimization, where…

Machine Learning · Computer Science 2025-04-03 Grzegorz Rypeść

Order parameters and phase transitions of continual learning in deep neural networks

Continual learning (CL) enables animals to learn new tasks without erasing prior knowledge. CL in artificial neural networks (NNs) is challenging due to catastrophic forgetting, where new learning degrades performance on older tasks. While…

Machine Learning · Computer Science 2025-01-28 Haozhe Shan , Qianyi Li , Haim Sompolinsky

Class Gradient Projection For Continual Learning

Catastrophic forgetting is one of the most critical challenges in Continual Learning (CL). Recent approaches tackle this problem by projecting the gradient update orthogonal to the gradient subspace of existing tasks. While the results are…

Computer Vision and Pattern Recognition · Computer Science 2023-11-28 Cheng Chen , Ji Zhang , Jingkuan Song , Lianli Gao

Overcoming Growth-Induced Forgetting in Task-Agnostic Continual Learning

In continual learning (CL), model growth enhances adaptability to new data. However, when model growth is applied improperly, especially in task-agnostic CL, where the entire grown model is used for inference, it can lead to severe…

Machine Learning · Computer Science 2025-12-23 Yuqing Zhao , Jiannong Cao , Divya Saxena , Xiaoyun Liu , Changlin Song , Bo Yuan , Julie McCann

Train with Perturbation, Infer after Merging: A Two-Stage Framework for Continual Learning

Continual Learning (CL) aims to enable models to continuously acquire new knowledge from a sequence of tasks with avoiding the forgetting of learned information. However, existing CL methods only rely on the parameters of the most recent…

Machine Learning · Computer Science 2025-10-24 Haomiao Qiu , Miao Zhang , Ziyue Qiao , Liqiang Nie

Exploring Forgetting in Large Language Model Pre-Training

Catastrophic forgetting remains a formidable obstacle to building an omniscient model in large language models (LLMs). Despite the pioneering research on task-level forgetting in LLM fine-tuning, there is scant focus on forgetting during…

Computation and Language · Computer Science 2024-10-23 Chonghua Liao , Ruobing Xie , Xingwu Sun , Haowen Sun , Zhanhui Kang

AdaCL:Adaptive Continual Learning

Class-Incremental Learning aims to update a deep classifier to learn new categories while maintaining or improving its accuracy on previously observed classes. Common methods to prevent forgetting previously learned classes include…

Machine Learning · Computer Science 2024-07-02 Elif Ceren Gok Yildirim , Murat Onur Yildirim , Mert Kilickaya , Joaquin Vanschoren

Continual Learning via Sparse Memory Finetuning

Modern language models are powerful, but typically static after deployment. A major obstacle to building models that continually learn over time is catastrophic forgetting, where updating on new data erases previously acquired capabilities.…

Computation and Language · Computer Science 2025-10-20 Jessy Lin , Luke Zettlemoyer , Gargi Ghosh , Wen-Tau Yih , Aram Markosyan , Vincent-Pierre Berges , Barlas Oğuz

Is Parameter Isolation Better for Prompt-Based Continual Learning?

Prompt-based continual learning methods effectively mitigate catastrophic forgetting. However, most existing methods assign a fixed set of prompts to each task, completely isolating knowledge across tasks and resulting in suboptimal…

Machine Learning · Computer Science 2026-01-30 Jiangyang Li , Chenhao Ding , Songlin Dong , Qiang Wang , Jianchao Zhao , Yuhang He , Yihong Gong

Continual Learning via Bit-Level Information Preserving

Continual learning tackles the setting of learning different tasks sequentially. Despite the lots of previous solutions, most of them still suffer significant forgetting or expensive memory cost. In this work, targeted at these problems, we…

Machine Learning · Computer Science 2022-03-17 Yujun Shi , Li Yuan , Yunpeng Chen , Jiashi Feng

On the Convergence of Continual Federated Learning Using Incrementally Aggregated Gradients

The holy grail of machine learning is to enable Continual Federated Learning (CFL) to enhance the efficiency, privacy, and scalability of AI systems while learning from streaming data. The primary challenge of a CFL system is to overcome…

Machine Learning · Computer Science 2025-11-11 Satish Kumar Keshri , Nazreen Shah , Ranjitha Prasad

Continual Learning with Scaled Gradient Projection

In neural networks, continual learning results in gradient interference among sequential tasks, leading to catastrophic forgetting of old tasks while learning new ones. This issue is addressed in recent methods by storing the important…

Machine Learning · Computer Science 2023-02-06 Gobinda Saha , Kaushik Roy

Revised Regularization for Efficient Continual Learning through Correlation-Based Parameter Update in Bayesian Neural Networks

We propose a Bayesian neural network-based continual learning algorithm using Variational Inference, aiming to overcome several drawbacks of existing methods. Specifically, in continual learning scenarios, storing network parameters at each…

Machine Learning · Computer Science 2024-11-22 Sanchar Palit , Biplab Banerjee , Subhasis Chaudhuri

RanPAC: Random Projections and Pre-trained Models for Continual Learning

Continual learning (CL) aims to incrementally learn different tasks (such as classification) in a non-stationary data stream without forgetting old ones. Most CL works focus on tackling catastrophic forgetting under a learning-from-scratch…

Machine Learning · Computer Science 2024-01-17 Mark D. McDonnell , Dong Gong , Amin Parveneh , Ehsan Abbasnejad , Anton van den Hengel

CLoRA: Parameter-Efficient Continual Learning with Low-Rank Adaptation

In the past, continual learning (CL) was mostly concerned with the problem of catastrophic forgetting in neural networks, that arises when incrementally learning a sequence of tasks. Current CL methods function within the confines of…

Machine Learning · Computer Science 2025-07-29 Shishir Muralidhara , Didier Stricker , René Schuster