English
Related papers

Related papers: Partial Hypernetworks for Continual Learning

200 papers

Humans learn continually throughout their lifespan by accumulating diverse knowledge and fine-tuning it for future tasks. When presented with a similar goal, neural networks suffer from catastrophic forgetting if data distributions across…

Machine Learning · Computer Science 2022-09-19 Dupati Srikar Chandra , Sakshi Varshney , P. K. Srijith , Sunil Gupta

Artificial neural networks suffer from catastrophic forgetting when they are sequentially trained on multiple tasks. To overcome this problem, we present a novel approach based on task-conditioned hypernetworks, i.e., networks that generate…

Machine Learning · Computer Science 2022-04-12 Johannes von Oswald , Christian Henning , Benjamin F. Grewe , João Sacramento

We introduce a new training paradigm that enforces interval constraints on neural network parameter space to control forgetting. Contemporary Continual Learning (CL) methods focus on training neural networks efficiently from a stream of…

Artificial neural networks suffer from catastrophic forgetting when they are sequentially trained on multiple tasks. Many continual learning (CL) strategies are trying to overcome this problem. One of the most effective is the…

Machine Learning · Computer Science 2024-05-27 Kamil Książek , Przemysław Spurek

While a diverse collection of continual learning (CL) methods has been proposed to prevent catastrophic forgetting, a thorough investigation of their effectiveness for processing sequential data with recurrent neural networks (RNNs) is…

Continual learning (CL) aims to learn a sequence of tasks without forgetting prior knowledge, but gradient updates for a new task often overwrite the weights learned earlier, causing catastrophic forgetting (CF). We propose…

Machine Learning · Computer Science 2025-05-19 Neil De La Fuente , Maria Pilligua , Daniel Vidal , Albin Soutiff , Cecilia Curreli , Daniel Cremers , Andrey Barsky

Continual Learning (CL) methods aim to enable machine learning models to learn new tasks without catastrophic forgetting of those that have been previously mastered. Existing CL approaches often keep a buffer of previously-seen samples,…

Machine Learning · Computer Science 2022-02-22 Dong Gong , Qingsen Yan , Yuhang Liu , Anton van den Hengel , Javen Qinfeng Shi

Learning a set of tasks over time, also known as continual learning (CL), is one of the most challenging problems in artificial intelligence. While recent approaches achieve some degree of CL in deep neural networks, they either (1) grow…

Machine Learning · Computer Science 2019-07-15 Blake Camp , Jaya Krishna Mandivarapu , Rolando Estrada

This work explores hypernetworks: an approach of using a one network, also known as a hypernetwork, to generate the weights for another network. Hypernetworks provide an abstraction that is similar to what is found in nature: the…

Machine Learning · Computer Science 2016-12-02 David Ha , Andrew Dai , Quoc V. Le

Hypernetworks, or hypernets for short, are neural networks that generate weights for another neural network, known as the target network. They have emerged as a powerful deep learning technique that allows for greater flexibility,…

Machine Learning · Computer Science 2025-01-03 Vinod Kumar Chauhan , Jiandong Zhou , Ping Lu , Soheila Molaei , David A. Clifton

Modern neural network based speech recognition models are required to continually absorb new data without re-training the whole system, especially in downstream applications using foundation models, having no access to the original training…

Computation and Language · Computer Science 2025-06-23 Enes Yavuz Ugan , Ngoc-Quan Pham , Alexander Waibel

The ability of intelligent agents to learn and remember multiple tasks sequentially is crucial to achieving artificial general intelligence. Many continual learning (CL) methods have been proposed to overcome catastrophic forgetting which…

Machine Learning · Computer Science 2020-09-15 Gehui Shen , Song Zhang , Xiang Chen , Zhi-Hong Deng

Training deep neural networks at the edge on light computational devices, embedded systems and robotic platforms is nowadays very challenging. Continual learning techniques, where complex models are incrementally trained on small batches of…

Machine Learning · Computer Science 2020-03-05 Lorenzo Pellegrini , Gabriele Graffieti , Vincenzo Lomonaco , Davide Maltoni

Recent studies on catastrophic forgetting during sequential learning typically focus on fixing the accuracy of the predictions for a previously learned task. In this paper we argue that the outputs of neural networks are subject to rapid…

Machine Learning · Computer Science 2020-02-14 Yuwen Xiong , Mengye Ren , Raquel Urtasun

A common challenge in continual learning (CL) is catastrophic forgetting, where the performance on old tasks drops after new, additional tasks are learned. In this paper, we propose a novel framework called ReCL to slow down forgetting in…

Machine Learning · Computer Science 2025-03-04 Pascal Janetzky , Tobias Schlagenhauf , Stefan Feuerriegel

We focus on the problem of learning without forgetting from multiple tasks arriving sequentially, where each task is defined using a few-shot episode of novel or already seen classes. We approach this problem using the recently published…

Machine Learning · Computer Science 2024-08-20 Max Vladymyrov , Andrey Zhmoginov , Mark Sandler

Humans learn all their life long. They accumulate knowledge from a sequence of learning experiences and remember the essential concepts without forgetting what they have learned previously. Artificial neural networks struggle to learn…

Machine Learning · Computer Science 2020-12-09 Timothée Lesort

Existing Continual Learning (CL) approaches have focused on addressing catastrophic forgetting by leveraging regularization methods, replay buffers, and task-specific components. However, realistic CL solutions must be shaped not only by…

Machine Learning · Computer Science 2023-10-11 Jinyung Hong , Theodore P. Pavlic

The continual learning (CL) paradigm aims to enable neural networks to learn tasks continually in a sequential fashion. The fundamental challenge in this learning paradigm is catastrophic forgetting previously learned tasks when the model…

Machine Learning · Computer Science 2021-04-15 Ghada Sokar , Decebal Constantin Mocanu , Mykola Pechenizkiy

Most Continual Learning (CL) methods maintain performance on earlier tasks by storing exemplars in a replay buffer, introducing memory overhead that scales with the number of tasks and raising privacy concerns in regulated domains. We…

Machine Learning · Computer Science 2026-05-11 Mohammad Ali Vahedifar , Qi Zhang
‹ Prev 1 2 3 10 Next ›