Related papers: Partial Hypernetworks for Continual Learning

Continual Learning with Dependency Preserving Hypernetworks

Humans learn continually throughout their lifespan by accumulating diverse knowledge and fine-tuning it for future tasks. When presented with a similar goal, neural networks suffer from catastrophic forgetting if data distributions across…

Machine Learning · Computer Science 2022-09-19 Dupati Srikar Chandra , Sakshi Varshney , P. K. Srijith , Sunil Gupta

Continual learning with hypernetworks

Artificial neural networks suffer from catastrophic forgetting when they are sequentially trained on multiple tasks. To overcome this problem, we present a novel approach based on task-conditioned hypernetworks, i.e., networks that generate…

Machine Learning · Computer Science 2022-04-12 Johannes von Oswald , Christian Henning , Benjamin F. Grewe , João Sacramento

Continual Learning with Guarantees via Weight Interval Constraints

We introduce a new training paradigm that enforces interval constraints on neural network parameter space to control forgetting. Contemporary Continual Learning (CL) methods focus on training neural networks efficiently from a stream of…

Machine Learning · Computer Science 2022-06-17 Maciej Wołczyk , Karol J. Piczak , Bartosz Wójcik , Łukasz Pustelnik , Paweł Morawiecki , Jacek Tabor , Tomasz Trzciński , Przemysław Spurek

HyperMask: Adaptive Hypernetwork-based Masks for Continual Learning

Artificial neural networks suffer from catastrophic forgetting when they are sequentially trained on multiple tasks. Many continual learning (CL) strategies are trying to overcome this problem. One of the most effective is the…

Machine Learning · Computer Science 2024-05-27 Kamil Książek , Przemysław Spurek

Continual Learning in Recurrent Neural Networks

While a diverse collection of continual learning (CL) methods has been proposed to prevent catastrophic forgetting, a thorough investigation of their effectiveness for processing sequential data with recurrent neural networks (RNNs) is…

Machine Learning · Computer Science 2021-03-11 Benjamin Ehret , Christian Henning , Maria R. Cervera , Alexander Meulemans , Johannes von Oswald , Benjamin F. Grewe

Prototype Augmented Hypernetworks for Continual Learning

Continual learning (CL) aims to learn a sequence of tasks without forgetting prior knowledge, but gradient updates for a new task often overwrite the weights learned earlier, causing catastrophic forgetting (CF). We propose…

Machine Learning · Computer Science 2025-05-19 Neil De La Fuente , Maria Pilligua , Daniel Vidal , Albin Soutiff , Cecilia Curreli , Daniel Cremers , Andrey Barsky

Learning Bayesian Sparse Networks with Full Experience Replay for Continual Learning

Continual Learning (CL) methods aim to enable machine learning models to learn new tasks without catastrophic forgetting of those that have been previously mastered. Existing CL approaches often keep a buffer of previously-seen samples,…

Machine Learning · Computer Science 2022-02-22 Dong Gong , Qingsen Yan , Yuhang Liu , Anton van den Hengel , Javen Qinfeng Shi

Self-Net: Lifelong Learning via Continual Self-Modeling

Learning a set of tasks over time, also known as continual learning (CL), is one of the most challenging problems in artificial intelligence. While recent approaches achieve some degree of CL in deep neural networks, they either (1) grow…

Machine Learning · Computer Science 2019-07-15 Blake Camp , Jaya Krishna Mandivarapu , Rolando Estrada

HyperNetworks

This work explores hypernetworks: an approach of using a one network, also known as a hypernetwork, to generate the weights for another network. Hypernetworks provide an abstraction that is similar to what is found in nature: the…

Machine Learning · Computer Science 2016-12-02 David Ha , Andrew Dai , Quoc V. Le

A Brief Review of Hypernetworks in Deep Learning

Hypernetworks, or hypernets for short, are neural networks that generate weights for another neural network, known as the target network. They have emerged as a powerful deep learning technique that allows for greater flexibility,…

Machine Learning · Computer Science 2025-01-03 Vinod Kumar Chauhan , Jiandong Zhou , Ping Lu , Soheila Molaei , David A. Clifton

Weight Factorization and Centralization for Continual Learning in Speech Recognition

Modern neural network based speech recognition models are required to continually absorb new data without re-training the whole system, especially in downstream applications using foundation models, having no access to the original training…

Computation and Language · Computer Science 2025-06-23 Enes Yavuz Ugan , Ngoc-Quan Pham , Alexander Waibel

Generative Feature Replay with Orthogonal Weight Modification for Continual Learning

The ability of intelligent agents to learn and remember multiple tasks sequentially is crucial to achieving artificial general intelligence. Many continual learning (CL) methods have been proposed to overcome catastrophic forgetting which…

Machine Learning · Computer Science 2020-09-15 Gehui Shen , Song Zhang , Xiang Chen , Zhi-Hong Deng

Latent Replay for Real-Time Continual Learning

Training deep neural networks at the edge on light computational devices, embedded systems and robotic platforms is nowadays very challenging. Continual learning techniques, where complex models are incrementally trained on small batches of…

Machine Learning · Computer Science 2020-03-05 Lorenzo Pellegrini , Gabriele Graffieti , Vincenzo Lomonaco , Davide Maltoni

Learning to Remember from a Multi-Task Teacher

Recent studies on catastrophic forgetting during sequential learning typically focus on fixing the accuracy of the predictions for a previously learned task. In this paper we argue that the outputs of neural networks are subject to rapid…

Machine Learning · Computer Science 2020-02-14 Yuwen Xiong , Mengye Ren , Raquel Urtasun

Slowing Down Forgetting in Continual Learning

A common challenge in continual learning (CL) is catastrophic forgetting, where the performance on old tasks drops after new, additional tasks are learned. In this paper, we propose a novel framework called ReCL to slow down forgetting in…

Machine Learning · Computer Science 2025-03-04 Pascal Janetzky , Tobias Schlagenhauf , Stefan Feuerriegel

Continual HyperTransformer: A Meta-Learner for Continual Few-Shot Learning

We focus on the problem of learning without forgetting from multiple tasks arriving sequentially, where each task is defined using a few-shot episode of novel or already seen classes. We approach this problem using the recently published…

Machine Learning · Computer Science 2024-08-20 Max Vladymyrov , Andrey Zhmoginov , Mark Sandler

Continual Learning: Tackling Catastrophic Forgetting in Deep Neural Networks with Replay Processes

Humans learn all their life long. They accumulate knowledge from a sequence of learning experiences and remember the essential concepts without forgetting what they have learned previously. Artificial neural networks struggle to learn…

Machine Learning · Computer Science 2020-12-09 Timothée Lesort

Learning to Modulate Random Weights: Neuromodulation-inspired Neural Networks For Efficient Continual Learning

Existing Continual Learning (CL) approaches have focused on addressing catastrophic forgetting by leveraging regularization methods, replay buffers, and task-specific components. However, realistic CL solutions must be shaped not only by…

Machine Learning · Computer Science 2023-10-11 Jinyung Hong , Theodore P. Pavlic

SpaceNet: Make Free Space For Continual Learning

The continual learning (CL) paradigm aims to enable neural networks to learn tasks continually in a sequential fashion. The fundamental challenge in this learning paradigm is catastrophic forgetting previously learned tasks when the model…

Machine Learning · Computer Science 2021-04-15 Ghada Sokar , Decebal Constantin Mocanu , Mykola Pechenizkiy

No Forgetting Learning: Buffer-free Continual Learning Classification

Most Continual Learning (CL) methods maintain performance on earlier tasks by storing exemplars in a replay buffer, introducing memory overhead that scales with the number of tasks and raising privacy concerns in regulated domains. We…

Machine Learning · Computer Science 2026-05-11 Mohammad Ali Vahedifar , Qi Zhang