Related papers: Premonition: Using Generative Models to Preempt Fu…

Continual Pre-Training Mitigates Forgetting in Language and Vision

Pre-trained models are nowadays a fundamental component of machine learning research. In continual learning, they are commonly used to initialize the model before training on the stream of non-stationary data. However, pre-training is…

Machine Learning · Computer Science 2022-05-20 Andrea Cossu , Tinne Tuytelaars , Antonio Carta , Lucia Passaro , Vincenzo Lomonaco , Davide Bacciu

Learn to Predict Sets Using Feed-Forward Neural Networks

This paper addresses the task of set prediction using deep feed-forward neural networks. A set is a collection of elements which is invariant under permutation and the size of a set is not fixed in advance. Many real-world problems, such as…

Computer Vision and Pattern Recognition · Computer Science 2021-10-26 Hamid Rezatofighi , Tianyu Zhu , Roman Kaskman , Farbod T. Motlagh , Qinfeng Shi , Anton Milan , Daniel Cremers , Laura Leal-Taixé , Ian Reid

Continual Learning for Generative AI: From LLMs to MLLMs and Beyond

The rapid advancement of generative models has empowered modern AI systems to comprehend and produce highly sophisticated content, even achieving human-level performance in specific domains. However, these models are fundamentally…

Machine Learning · Computer Science 2025-08-26 Haiyang Guo , Fanhu Zeng , Fei Zhu , Jiayi Wang , Xukai Wang , Jingang Zhou , Hongbo Zhao , Wenzhuo Liu , Shijie Ma , Da-Han Wang , Xu-Yao Zhang , Cheng-Lin Liu

Investigating Continual Pretraining in Large Language Models: Insights and Implications

Continual learning (CL) in large language models (LLMs) is an evolving domain that focuses on developing efficient and sustainable training strategies to adapt models to emerging knowledge and achieve robustness in dynamic environments. Our…

Computation and Language · Computer Science 2025-02-13 Çağatay Yıldız , Nishaanth Kanna Ravichandran , Nitin Sharma , Matthias Bethge , Beyza Ermis

PrePrompt: Predictive prompting for class incremental learning

Class Incremental Learning (CIL) based on pre-trained models offers a promising direction for open-world continual learning. Existing methods typically rely on correlation-based strategies, where an image's classification feature is used as…

Computer Vision and Pattern Recognition · Computer Science 2025-05-20 Libo Huang , Zhulin An , Chuanguang Yang , Boyu Diao , Fei Wang , Yan Zeng , Zhifeng Hao , Yongjun Xu

Continual Learning for Predictive Maintenance: Overview and Challenges

Deep learning techniques have become one of the main propellers for solving engineering problems effectively and efficiently. For instance, Predictive Maintenance methods have been used to improve predictions of when maintenance is needed…

Machine Learning · Computer Science 2023-06-30 Julio Hurtado , Dario Salvati , Rudy Semola , Mattia Bosio , Vincenzo Lomonaco

Adaptive Explainable Continual Learning Framework for Regression Problems with Focus on Power Forecasts

Compared with traditional deep learning techniques, continual learning enables deep neural networks to learn continually and adaptively. Deep neural networks have to learn new tasks and overcome forgetting the knowledge obtained from the…

Machine Learning · Computer Science 2022-02-08 Yujiang He

Continual Classification Learning Using Generative Models

Continual learning is the ability to sequentially learn over time by accommodating knowledge while retaining previously learned experiences. Neural networks can learn multiple tasks when trained on them jointly, but cannot maintain…

Machine Learning · Computer Science 2018-10-26 Frantzeska Lavda , Jason Ramapuram , Magda Gregorova , Alexandros Kalousis

Antagonising explanation and revealing bias directly through sequencing and multimodal inference

Deep generative models produce data according to a learned representation, e.g. diffusion models, through a process of approximation computing possible samples. Approximation can be understood as reconstruction and the large datasets used…

Human-Computer Interaction · Computer Science 2023-09-25 Luís Arandas , Mick Grierson , Miguel Carvalhais

Convolutional Prompting meets Language Models for Continual Learning

Continual Learning (CL) enables machine learning models to learn from continuously shifting new training data in absence of data from old tasks. Recently, pretrained vision transformers combined with prompt tuning have shown promise for…

Computer Vision and Pattern Recognition · Computer Science 2024-04-01 Anurag Roy , Riddhiman Moulick , Vinay K. Verma , Saptarshi Ghosh , Abir Das

Personalized Prompt Learning for Explainable Recommendation

Providing user-understandable explanations to justify recommendations could help users better understand the recommended items, increase the system's ease of use, and gain users' trust. A typical approach to realize it is natural language…

Information Retrieval · Computer Science 2023-01-16 Lei Li , Yongfeng Zhang , Li Chen

Continual Learning with Pretrained Backbones by Tuning in the Input Space

The intrinsic difficulty in adapting deep learning models to non-stationary environments limits the applicability of neural networks to real-world tasks. This issue is critical in practical supervised learning settings, such as the ones in…

Machine Learning · Computer Science 2023-06-09 Simone Marullo , Matteo Tiezzi , Marco Gori , Stefano Melacci , Tinne Tuytelaars

Continual Predictive Learning from Videos

Predictive learning ideally builds the world model of physical processes in one or more given environments. Typical setups assume that we can collect data from all environments at all times. In practice, however, different prediction tasks…

Computer Vision and Pattern Recognition · Computer Science 2022-04-13 Geng Chen , Wendong Zhang , Han Lu , Siyu Gao , Yunbo Wang , Mingsheng Long , Xiaokang Yang

Lifelong GAN: Continual Learning for Conditional Image Generation

Lifelong learning is challenging for deep neural networks due to their susceptibility to catastrophic forgetting. Catastrophic forgetting occurs when a trained network is not able to maintain its ability to accomplish previously learned…

Computer Vision and Pattern Recognition · Computer Science 2019-08-23 Mengyao Zhai , Lei Chen , Fred Tung , Jiawei He , Megha Nawhal , Greg Mori

Continual Learning with Pre-Trained Models: A Survey

Nowadays, real-world applications often face streaming data, which requires the learning system to absorb new knowledge as data evolves. Continual Learning (CL) aims to achieve this goal and meanwhile overcome the catastrophic forgetting of…

Machine Learning · Computer Science 2024-04-24 Da-Wei Zhou , Hai-Long Sun , Jingyi Ning , Han-Jia Ye , De-Chuan Zhan

Predicting Through Generation: Why Generation Is Better for Prediction

This paper argues that generating output tokens is more effective than using pooled representations for prediction tasks because token-level generation retains more mutual information. Since LLMs are trained on massive text corpora using…

Computation and Language · Computer Science 2025-05-28 Md Kowsher , Nusrat Jahan Prottasha , Prakash Bhat , Chun-Nam Yu , Mojtaba Soltanalian , Ivan Garibay , Ozlem Garibay , Chen Chen , Niloofar Yousefi

GPT4Image: Large Pre-trained Models Help Vision Models Learn Better on Perception Task

The upsurge in pre-trained large models started by ChatGPT has swept across the entire deep learning community. Such powerful models demonstrate advanced generative ability and multimodal understanding capability, which quickly set new…

Computer Vision and Pattern Recognition · Computer Science 2025-02-28 Ning Ding , Yehui Tang , Zhongqian Fu , Chao Xu , Kai Han , Yunhe Wang

CLIP with Generative Latent Replay: a Strong Baseline for Incremental Learning

With the emergence of Transformers and Vision-Language Models (VLMs) such as CLIP, fine-tuning large pre-trained models has recently become a prevalent strategy in Continual Learning. This has led to the development of numerous prompting…

Computer Vision and Pattern Recognition · Computer Science 2024-10-29 Emanuele Frascaroli , Aniello Panariello , Pietro Buzzega , Lorenzo Bonicelli , Angelo Porrello , Simone Calderara

Improving In-context Learning of Multilingual Generative Language Models with Cross-lingual Alignment

Multilingual generative models obtain remarkable cross-lingual in-context learning capabilities through pre-training on large-scale corpora. However, they still exhibit a performance bias toward high-resource languages and learn isolated…

Computation and Language · Computer Science 2024-06-13 Chong Li , Shaonan Wang , Jiajun Zhang , Chengqing Zong

Conditioning Predictive Models: Risks and Strategies

Our intention is to provide a definitive reference on what it would take to safely make use of generative/predictive models in the absence of a solution to the Eliciting Latent Knowledge problem. Furthermore, we believe that large language…

Artificial Intelligence · Computer Science 2023-02-07 Evan Hubinger , Adam Jermyn , Johannes Treutlein , Rubi Hudson , Kate Woolverton