Related papers: Regularization-Based Efficient Continual Learning …

Exemplar-Free Continual Learning for State Space Models

State-Space Models (SSMs) excel at capturing long-range dependencies with structured recurrence, making them well-suited for sequence modeling. However, their evolving internal states pose challenges in adapting them under Continual…

Machine Learning · Computer Science 2026-05-14 Isaac Ning Lee , Leila Mahmoodi , Trung Le , Mehrtash Harandi

A Unified and General Framework for Continual Learning

Continual Learning (CL) focuses on learning from dynamic and changing data distributions while retaining previously acquired knowledge. Various methods have been developed to address the challenge of catastrophic forgetting, including…

Machine Learning · Computer Science 2024-03-21 Zhenyi Wang , Yan Li , Li Shen , Heng Huang

Bio-Inspired, Task-Free Continual Learning through Activity Regularization

The ability to sequentially learn multiple tasks without forgetting is a key skill of biological brains, whereas it represents a major challenge to the field of deep learning. To avoid catastrophic forgetting, various continual learning…

Neural and Evolutionary Computing · Computer Science 2022-12-09 Francesco Lässig , Pau Vilimelis Aceituno , Martino Sorbaro , Benjamin F. Grewe

Efficient Continual Learning in Neural Networks with Embedding Regularization

Continual learning of deep neural networks is a key requirement for scaling them up to more complex applicative scenarios and for achieving real lifelong learning of these architectures. Previous approaches to the problem have considered…

Machine Learning · Computer Science 2020-06-25 Jary Pomponi , Simone Scardapane , Vincenzo Lomonaco , Aurelio Uncini

Continual Semi-Supervised Learning through Contrastive Interpolation Consistency

Continual Learning (CL) investigates how to train Deep Networks on a stream of tasks without incurring forgetting. CL settings proposed in literature assume that every incoming example is paired with ground-truth annotations. However, this…

Machine Learning · Statistics 2022-08-30 Matteo Boschini , Pietro Buzzega , Lorenzo Bonicelli , Angelo Porrello , Simone Calderara

Deep Learning-based Approaches for State Space Models: A Selective Review

State-space models (SSMs) offer a powerful framework for dynamical system analysis, wherein the temporal dynamics of the system are assumed to be captured through the evolution of the latent states, which govern the values of the…

Machine Learning · Statistics 2024-12-17 Jiahe Lin , George Michailidis

Continual Learning in Large Language Models: Methods, Challenges, and Opportunities

Continual learning (CL) has emerged as a pivotal paradigm to enable large language models (LLMs) to dynamically adapt to evolving knowledge and sequential tasks while mitigating catastrophic forgetting-a critical limitation of the static…

Computation and Language · Computer Science 2026-03-16 Hongyang Chen , Zhongwu Sun , Hongfei Ye , Kunchi Li , Xuemin Lin

Meta Continual Learning via Dynamic Programming

Meta continual learning algorithms seek to train a model when faced with similar tasks observed in a sequential manner. Despite promising methodological advancements, there is a lack of theoretical frameworks that enable analysis of…

Machine Learning · Computer Science 2020-10-12 R. Krishnan , Prasanna Balaprakash

SpaceNet: Make Free Space For Continual Learning

The continual learning (CL) paradigm aims to enable neural networks to learn tasks continually in a sequential fashion. The fundamental challenge in this learning paradigm is catastrophic forgetting previously learned tasks when the model…

Machine Learning · Computer Science 2021-04-15 Ghada Sokar , Decebal Constantin Mocanu , Mykola Pechenizkiy

MS-SSM: A Multi-Scale State Space Model for Efficient Sequence Modeling

State-space models (SSMs) have recently attention as an efficient alternative to computationally expensive attention-based models for sequence modeling. They rely on linear recurrences to integrate information over time, enabling fast…

Machine Learning · Computer Science 2026-01-01 Mahdi Karami , Ali Behrouz , Peilin Zhong , Razvan Pascanu , Vahab Mirrokni

Fine-Tuning Regimes Define Distinct Continual Learning Problems

Continual learning (CL) studies how models acquire tasks sequentially while retaining previously learned knowledge. Despite substantial progress in benchmarking CL methods, comparative evaluations typically keep the fine-tuning regime…

Machine Learning · Computer Science 2026-04-28 Paul-Tiberiu Iordache , Elena Burceanu

Recent Advances of Foundation Language Models-based Continual Learning: A Survey

Recently, foundation language models (LMs) have marked significant achievements in the domains of natural language processing (NLP) and computer vision (CV). Unlike traditional neural network models, foundation LMs obtain a great ability…

Computation and Language · Computer Science 2024-12-02 Yutao Yang , Jie Zhou , Xuanwen Ding , Tianyu Huai , Shunyu Liu , Qin Chen , Yuan Xie , Liang He

Probabilistic Recurrent State-Space Models

State-space models (SSMs) are a highly expressive model class for learning patterns in time series data and for system identification. Deterministic versions of SSMs (e.g. LSTMs) proved extremely successful in modeling complex time series…

Machine Learning · Statistics 2018-02-13 Andreas Doerr , Christian Daniel , Martin Schiegg , Duy Nguyen-Tuong , Stefan Schaal , Marc Toussaint , Sebastian Trimpe

Mamba-CL: Optimizing Selective State Space Model in Null Space for Continual Learning

Continual Learning (CL) aims to equip AI models with the ability to learn a sequence of tasks over time, without forgetting previously learned knowledge. Recently, State Space Models (SSMs), particularly the Mamba model, have achieved…

Computer Vision and Pattern Recognition · Computer Science 2025-07-11 De Cheng , Yue Lu , Lingfeng He , Shizhou Zhang , Xi Yang , Nannan Wang , Xinbo Gao

Exploring Continual Learning of Diffusion Models

Diffusion models have achieved remarkable success in generating high-quality images thanks to their novel training procedures applied to unprecedented amounts of data. However, training a diffusion model from scratch is computationally…

Machine Learning · Computer Science 2023-03-28 Michał Zając , Kamil Deja , Anna Kuzina , Jakub M. Tomczak , Tomasz Trzciński , Florian Shkurti , Piotr Miłoś

Continual Learning Using Bayesian Neural Networks

Continual learning models allow to learn and adapt to new changes and tasks over time. However, in continual and sequential learning scenarios in which the models are trained using different data with various distributions, neural networks…

Machine Learning · Computer Science 2020-08-17 HongLin Li , Payam Barnaghi , Shirin Enshaeifar , Frieder Ganz

LSEBMCL: A Latent Space Energy-Based Model for Continual Learning

Continual learning has become essential in many practical applications such as online news summaries and product classification. The primary challenge is known as catastrophic forgetting, a phenomenon where a model inadvertently discards…

Machine Learning · Computer Science 2025-01-13 Xiaodi Li , Dingcheng Li , Rujun Gao , Mahmoud Zamani , Latifur Khan

Unlocking the Power of Rehearsal in Continual Learning: A Theoretical Perspective

Rehearsal-based methods have shown superior performance in addressing catastrophic forgetting in continual learning (CL) by storing and training on a subset of past data alongside new data in current task. While such a concurrent rehearsal…

Machine Learning · Computer Science 2025-06-03 Junze Deng , Qinhang Wu , Peizhong Ju , Sen Lin , Yingbin Liang , Ness Shroff

An Effective Dynamic Gradient Calibration Method for Continual Learning

Continual learning (CL) is a fundamental topic in machine learning, where the goal is to train a model with continuously incoming data and tasks. Due to the memory limit, we cannot store all the historical data, and therefore confront the…

Machine Learning · Computer Science 2024-07-31 Weichen Lin , Jiaxiang Chen , Ruomin Huang , Hu Ding

Task-Core Memory Management and Consolidation for Long-term Continual Learning

In this paper, we focus on a long-term continual learning (CL) task, where a model learns sequentially from a stream of vast tasks over time, acquiring new knowledge while retaining previously learned information in a manner akin to human…

Machine Learning · Computer Science 2025-05-16 Tianyu Huai , Jie Zhou , Yuxuan Cai , Qin Chen , Wen Wu , Xingjiao Wu , Xipeng Qiu , Liang He