English
Related papers

Related papers: Task Difficulty Aware Parameter Allocation & Regul…

200 papers

In continual learning, the primary challenge is to learn new information without forgetting old knowledge. A common solution addresses this trade-off through regularization, penalizing changes to parameters critical for previous tasks. In…

Machine Learning · Computer Science 2026-04-22 Pourya Shamsolmoali , Masoumeh Zareapoor , Eric Granger , William A. P. Smith , Yue Lu

We propose a Bayesian neural network-based continual learning algorithm using Variational Inference, aiming to overcome several drawbacks of existing methods. Specifically, in continual learning scenarios, storing network parameters at each…

Machine Learning · Computer Science 2024-11-22 Sanchar Palit , Biplab Banerjee , Subhasis Chaudhuri

While recent continual learning methods largely alleviate the catastrophic problem on toy-sized datasets, some issues remain to be tackled to apply them to real-world problem domains. First, a continual learning model should effectively…

Machine Learning · Computer Science 2020-02-18 Jaehong Yoon , Saehoon Kim , Eunho Yang , Sung Ju Hwang

Continual Learning addresses the challenge of learning a number of different tasks sequentially. The goal of maintaining knowledge of earlier tasks without re-accessing them starkly conflicts with standard SGD training for artificial neural…

Machine Learning · Computer Science 2021-02-05 Frederik Benzing

Using neural networks in practical settings would benefit from the ability of the networks to learn new tasks throughout their lifetimes without forgetting the previous tasks. This ability is limited in the current deep neural networks by a…

Machine Learning · Computer Science 2018-06-20 Risto Vuorio , Dong-Yeon Cho , Daejoong Kim , Jiwon Kim

Multi-task learning aims at solving multiple machine learning tasks at the same time. A good solution to a multi-task learning problem should be generalizable in addition to being Pareto optimal. In this paper, we provide some insights on…

Machine Learning · Computer Science 2020-08-14 Yuyan Wang , Zhe Zhao , Bo Dai , Christopher Fifty , Dong Lin , Lichan Hong , Ed H. Chi

Catastrophic forgetting is a pervasive issue for pre-trained language models (PLMs) during continual learning, where models lose previously acquired knowledge when sequentially trained on a series of tasks. The model's ability to retain old…

Computation and Language · Computer Science 2025-02-18 Biqing Zeng , Zehan Li , Aladdin Ayesh

Optimization problems over discrete or quantized variables are very challenging in general due to the combinatorial nature of their search space. Piecewise-affine regularization (PAR) provides a flexible modeling and computational framework…

Machine Learning · Computer Science 2025-08-18 Jianhao Ma , Lin Xiao

Catastrophic forgetting is a challenge issue in continual learning when a deep neural network forgets the knowledge acquired from the former task after learning on subsequent tasks. However, existing methods try to find the joint…

Machine Learning · Computer Science 2018-12-06 Jian Peng , Jiang Hao , Zhuo Li , Enqiang Guo , Xiaohong Wan , Deng Min , Qing Zhu , Haifeng Li

Catastrophic forgetting has remained a critical challenge for deep neural networks in Continual Learning (CL) as it undermines consolidated knowledge when learning new tasks. Parameter efficient fine tuning CL techniques are gaining…

Machine Learning · Computer Science 2026-01-27 Prashant Shivaram Bhat , Shakib Yazdani , Elahe Arani , Bahram Zonooz

Transfer learning is a useful technique for achieving improved performance and reducing training costs by leveraging the knowledge gained from source tasks and applying it to target tasks. Assessing the effectiveness of transfer learning…

Machine Learning · Computer Science 2023-06-12 Peizhong Ju , Sen Lin , Mark S. Squillante , Yingbin Liang , Ness B. Shroff

Neural networks have achieved remarkable success in many cognitive tasks. However, when they are trained sequentially on multiple tasks without access to old data, their performance on early tasks tend to drop significantly. This problem is…

Machine Learning · Computer Science 2021-02-10 Dong Yin , Mehrdad Farajtabar , Ang Li , Nir Levine , Alex Mott

Continual learning aims to learn a series of tasks sequentially without forgetting the knowledge acquired from the previous ones. In this work, we propose the Hessian Aware Low-Rank Perturbation algorithm for continual learning. By modeling…

Adapting model parameters to incoming streams of data is a crucial factor to deep learning scalability. Interestingly, prior continual learning strategies in online settings inadvertently anchor their updated parameters to a local parameter…

Machine Learning · Computer Science 2022-09-30 Siddhartha Datta , Nigel Shadbolt

Catastrophic forgetting in deep neural networks occurs when learning new tasks degrades performance on previously learned tasks due to knowledge overwriting. Among the approaches to mitigate this issue, regularization techniques aim to…

Machine Learning · Computer Science 2025-07-14 Idan Mashiach , Oren Glickman , Tom Tirer

Adapting large pretrained models to new tasks efficiently and continually is crucial for real-world deployment but remains challenging due to catastrophic forgetting and the high cost of retraining. While parameter-efficient tuning methods…

Machine Learning · Computer Science 2026-02-06 Prakhar Kaushik , Ankit Vaidya , Shravan Chaudhari , Rama Chellappa , Alan Yuille

Adapting models pre-trained on large-scale datasets is a proven way to reach strong performance quickly for down-stream tasks. However, the growth of state-of-the-art mod-els makes traditional full fine-tuning unsuitable and difficult,…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Maxime Fontana , Michael Spratling , Miaojing Shi

Large Language Models (LLMs) have demonstrated excellent performance in general language understanding, generation and other tasks. However, when fine-tuning for specific domain tasks, the general knowledge accumulated in the pre-training…

Computation and Language · Computer Science 2026-04-21 Weijie Wan , Jiangjiang Zhao

In this paper we propose an approach to avoiding catastrophic forgetting in sequential task learning scenarios. Our technique is based on a network reparameterization that approximately diagonalizes the Fisher Information Matrix of the…

Computer Vision and Pattern Recognition · Computer Science 2018-12-13 Xialei Liu , Marc Masana , Luis Herranz , Joost Van de Weijer , Antonio M. Lopez , Andrew D. Bagdanov

Autonomous machine learning systems that learn many tasks in sequence are prone to the catastrophic forgetting problem. Mathematical theory is needed in order to understand the extent of forgetting during continual learning. As a…

Machine Learning · Computer Science 2025-02-18 Daniel Goldfarb , Paul Hand
‹ Prev 1 2 3 10 Next ›