Related papers: Task Difficulty Aware Parameter Allocation & Regul…

Task Switching Without Forgetting via Proximal Decoupling

In continual learning, the primary challenge is to learn new information without forgetting old knowledge. A common solution addresses this trade-off through regularization, penalizing changes to parameters critical for previous tasks. In…

Machine Learning · Computer Science 2026-04-22 Pourya Shamsolmoali , Masoumeh Zareapoor , Eric Granger , William A. P. Smith , Yue Lu

Revised Regularization for Efficient Continual Learning through Correlation-Based Parameter Update in Bayesian Neural Networks

We propose a Bayesian neural network-based continual learning algorithm using Variational Inference, aiming to overcome several drawbacks of existing methods. Specifically, in continual learning scenarios, storing network parameters at each…

Machine Learning · Computer Science 2024-11-22 Sanchar Palit , Biplab Banerjee , Subhasis Chaudhuri

Scalable and Order-robust Continual Learning with Additive Parameter Decomposition

While recent continual learning methods largely alleviate the catastrophic problem on toy-sized datasets, some issues remain to be tackled to apply them to real-world problem domains. First, a continual learning model should effectively…

Machine Learning · Computer Science 2020-02-18 Jaehong Yoon , Saehoon Kim , Eunho Yang , Sung Ju Hwang

Unifying Regularisation Methods for Continual Learning

Continual Learning addresses the challenge of learning a number of different tasks sequentially. The goal of maintaining knowledge of earlier tasks without re-accessing them starkly conflicts with standard SGD training for artificial neural…

Machine Learning · Computer Science 2021-02-05 Frederik Benzing

Meta Continual Learning

Using neural networks in practical settings would benefit from the ability of the networks to learn new tasks throughout their lifetimes without forgetting the previous tasks. This ability is limited in the current deep neural networks by a…

Machine Learning · Computer Science 2018-06-20 Risto Vuorio , Dong-Yeon Cho , Daejoong Kim , Jiwon Kim

Small Towers Make Big Differences

Multi-task learning aims at solving multiple machine learning tasks at the same time. A good solution to a multi-task learning problem should be generalizable in addition to being Pareto optimal. In this paper, we provide some insights on…

Machine Learning · Computer Science 2020-08-14 Yuyan Wang , Zhe Zhao , Bo Dai , Christopher Fifty , Dong Lin , Lichan Hong , Ed H. Chi

Neural Networks Remember More: The Power of Parameter Isolation and Combination

Catastrophic forgetting is a pervasive issue for pre-trained language models (PLMs) during continual learning, where models lose previously acquired knowledge when sequentially trained on a series of tasks. The model's ability to retain old…

Computation and Language · Computer Science 2025-02-18 Biqing Zeng , Zehan Li , Aladdin Ayesh

Quantization through Piecewise-Affine Regularization: Optimization and Statistical Guarantees

Optimization problems over discrete or quantized variables are very challenging in general due to the combinatorial nature of their search space. Piecewise-affine regularization (PAR) provides a flexible modeling and computational framework…

Machine Learning · Computer Science 2025-08-18 Jianhao Ma , Lin Xiao

Overcoming Catastrophic Forgetting by Soft Parameter Pruning

Catastrophic forgetting is a challenge issue in continual learning when a deep neural network forgets the knowledge acquired from the former task after learning on subsequent tasks. However, existing methods try to find the joint…

Machine Learning · Computer Science 2018-12-06 Jian Peng , Jiang Hao , Zhuo Li , Enqiang Guo , Xiaohong Wan , Deng Min , Qing Zhu , Haifeng Li

Parameter Efficient Continual Learning with Dynamic Low-Rank Adaptation

Catastrophic forgetting has remained a critical challenge for deep neural networks in Continual Learning (CL) as it undermines consolidated knowledge when learning new tasks. Parameter efficient fine tuning CL techniques are gaining…

Machine Learning · Computer Science 2026-01-27 Prashant Shivaram Bhat , Shakib Yazdani , Elahe Arani , Bahram Zonooz

Generalization Performance of Transfer Learning: Overparameterized and Underparameterized Regimes

Transfer learning is a useful technique for achieving improved performance and reducing training costs by leveraging the knowledge gained from source tasks and applying it to target tasks. Assessing the effectiveness of transfer learning…

Machine Learning · Computer Science 2023-06-12 Peizhong Ju , Sen Lin , Mark S. Squillante , Yingbin Liang , Ness B. Shroff

Optimization and Generalization of Regularization-Based Continual Learning: a Loss Approximation Viewpoint

Neural networks have achieved remarkable success in many cognitive tasks. However, when they are trained sequentially on multiple tasks without access to old data, their performance on early tasks tend to drop significantly. This problem is…

Machine Learning · Computer Science 2021-02-10 Dong Yin , Mehrdad Farajtabar , Ang Li , Nir Levine , Alex Mott

Hessian Aware Low-Rank Perturbation for Order-Robust Continual Learning

Continual learning aims to learn a series of tasks sequentially without forgetting the knowledge acquired from the previous ones. In this work, we propose the Hessian Aware Low-Rank Perturbation algorithm for continual learning. By modeling…

Machine Learning · Computer Science 2024-09-24 Jiaqi Li , Yuanhao Lai , Rui Wang , Changjian Shui , Sabyasachi Sahoo , Charles X. Ling , Shichun Yang , Boyu Wang , Christian Gagné , Fan Zhou

Multiple Modes for Continual Learning

Adapting model parameters to incoming streams of data is a crucial factor to deep learning scalability. Interestingly, prior continual learning strategies in online settings inadvertently anchor their updated parameters to a local parameter…

Machine Learning · Computer Science 2022-09-30 Siddhartha Datta , Nigel Shadbolt

Catastrophic Forgetting Mitigation Through Plateau Phase Activity Profiling

Catastrophic forgetting in deep neural networks occurs when learning new tasks degrades performance on previously learned tasks due to knowledge overwriting. Among the approaches to mitigate this issue, regularization techniques aim to…

Machine Learning · Computer Science 2025-07-14 Idan Mashiach , Oren Glickman , Tom Tirer

Shared LoRA Subspaces for almost Strict Continual Learning

Adapting large pretrained models to new tasks efficiently and continually is crucial for real-world deployment but remains challenging due to catastrophic forgetting and the high cost of retraining. While parameter-efficient tuning methods…

Machine Learning · Computer Science 2026-02-06 Prakhar Kaushik , Ankit Vaidya , Shravan Chaudhari , Rama Chellappa , Alan Yuille

FAAR: Efficient Frequency-Aware Multi-Task Fine-Tuning via Automatic Rank Selection

Adapting models pre-trained on large-scale datasets is a proven way to reach strong performance quickly for down-stream tasks. However, the growth of state-of-the-art mod-els makes traditional full fine-tuning unsuitable and difficult,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Maxime Fontana , Michael Spratling , Miaojing Shi

Efficient Task Adaptation in Large Language Models via Selective Parameter Optimization

Large Language Models (LLMs) have demonstrated excellent performance in general language understanding, generation and other tasks. However, when fine-tuning for specific domain tasks, the general knowledge accumulated in the pre-training…

Computation and Language · Computer Science 2026-04-21 Weijie Wan , Jiangjiang Zhao

Rotate your Networks: Better Weight Consolidation and Less Catastrophic Forgetting

In this paper we propose an approach to avoiding catastrophic forgetting in sequential task learning scenarios. Our technique is based on a network reparameterization that approximately diagonalizes the Fisher Information Matrix of the…

Computer Vision and Pattern Recognition · Computer Science 2018-12-13 Xialei Liu , Marc Masana , Luis Herranz , Joost Van de Weijer , Antonio M. Lopez , Andrew D. Bagdanov

Analysis of Overparameterization in Continual Learning under a Linear Model

Autonomous machine learning systems that learn many tasks in sequence are prone to the catastrophic forgetting problem. Mathematical theory is needed in order to understand the extent of forgetting during continual learning. As a…

Machine Learning · Computer Science 2025-02-18 Daniel Goldfarb , Paul Hand