English
Related papers

Related papers: Exploring Continual Learning for Code Generation M…

200 papers

Continual Learning (CL) enables machine learning models to learn from continuously shifting new training data in absence of data from old tasks. Recently, pretrained vision transformers combined with prompt tuning have shown promise for…

Computer Vision and Pattern Recognition · Computer Science 2024-04-01 Anurag Roy , Riddhiman Moulick , Vinay K. Verma , Saptarshi Ghosh , Abir Das

User modeling in large e-commerce platforms aims to optimize user experiences by incorporating various customer activities. Traditional models targeting a single task often focus on specific business metrics, neglecting the comprehensive…

Information Retrieval · Computer Science 2025-02-28 Mingdai Yang , Fan Yang , Yanhui Guo , Shaoyuan Xu , Tianchen Zhou , Yetian Chen , Simone Shao , Jia Liu , Yan Gao

Learning-based techniques, especially advanced pre-trained models for code have demonstrated capabilities in code understanding and generation, solving diverse software engineering (SE) tasks. Despite the promising results, current training…

Software Engineering · Computer Science 2025-02-07 Kyi Shin Khant , Hong Yi Lin , Patanamon Thongtanunam

Large Language Models (LLMs) have achieved remarkable performance across various reasoning tasks, yet post-training is constrained by inefficient sample utilization and inflexible difficulty samples processing. To address these limitations,…

Prompt-tuning methods for Continual Learning (CL) freeze a large pre-trained model and train a few parameter vectors termed prompts. Most of these methods organize these vectors in a pool of key-value pairs and use the input image as query…

Continual Learning aims to learn a single model on a sequence of tasks without having access to data from previous tasks. The biggest challenge in the domain still remains catastrophic forgetting: a loss in performance on seen classes of…

Computer Vision and Pattern Recognition · Computer Science 2023-08-31 Muhammad Gul Zain Ali Khan , Muhammad Ferjad Naeem , Luc Van Gool , Didier Stricker , Federico Tombari , Muhammad Zeshan Afzal

Continual learning (CL) enables deep networks to acquire new knowledge while avoiding catastrophic forgetting. The powerful generalization ability of pre-trained models (PTMs), such as the Contrastive Language-Image Pre-training (CLIP)…

Computer Vision and Pattern Recognition · Computer Science 2025-12-22 Haodong Lu , Xinyu Zhang , Kristen Moore , Jason Xue , Lina Yao , Anton van den Hengel , Dong Gong

Continual learning (CL) has emerged as a pivotal paradigm to enable large language models (LLMs) to dynamically adapt to evolving knowledge and sequential tasks while mitigating catastrophic forgetting-a critical limitation of the static…

Computation and Language · Computer Science 2026-03-16 Hongyang Chen , Zhongwu Sun , Hongfei Ye , Kunchi Li , Xuemin Lin

Continual learning (CL) refers to a machine learning paradigm that learns continuously without forgetting previously acquired knowledge. Thereby, major difficulty in CL is catastrophic forgetting of preceding tasks, caused by shifts in data…

Machine Learning · Computer Science 2023-03-08 Stella Ho , Ming Liu , Lan Du , Longxiang Gao , Yong Xiang

Continual learning (CL) is a major challenge of machine learning (ML) and describes the ability to learn several tasks sequentially without catastrophic forgetting (CF). Recent works indicate that CL is a complex topic, even more so when…

Machine Learning · Computer Science 2022-06-09 Benedikt Bagus , Alexander Gepperth

This paper explores the problem of continual learning (CL) of vision-language models (VLMs) in open domains, where the models need to perform continual updating and inference on a streaming of datasets from diverse seen and unseen domains…

Computer Vision and Pattern Recognition · Computer Science 2024-03-18 Yukun Li , Guansong Pang , Wei Suo , Chenchen Jing , Yuling Xi , Lingqiao Liu , Hao Chen , Guoqiang Liang , Peng Wang

Continual learning (CL) empowers pre-trained vision-language models to adapt effectively to novel or previously underrepresented data distributions without comprehensive retraining, enhancing their adaptability and efficiency. While…

Artificial Intelligence · Computer Science 2025-09-04 Zhiyuan Wang , Bokui Chen

Comprehensively understanding and accurately predicting the performance of large language models across diverse downstream tasks has emerged as a pivotal challenge in NLP research. The pioneering scaling law on downstream works demonstrated…

Computation and Language · Computer Science 2024-10-04 Qiyuan Zhang , Fuyuan Lyu , Xue Liu , Chen Ma

Continual learning (CL) is a learning paradigm that emulates the human capability of learning and accumulating knowledge continually without forgetting the previously learned knowledge and also transferring the learned knowledge to help…

Computation and Language · Computer Science 2023-05-12 Zixuan Ke , Bing Liu

Continual learning (CL) in large language models (LLMs) is an evolving domain that focuses on developing efficient and sustainable training strategies to adapt models to emerging knowledge and achieve robustness in dynamic environments. Our…

Computation and Language · Computer Science 2025-02-13 Çağatay Yıldız , Nishaanth Kanna Ravichandran , Nitin Sharma , Matthias Bethge , Beyza Ermis

Recent Prompt-based Continual learning (PCL) has achieved remarkable performance with pre-trained models. These approaches expand a prompt pool by adding a new set of prompts while learning and select the correct set during inference.…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Qian Feng , Da-wei Zhou , Hanbin Zhao , Chao Zhang , Jiahua Dong , Dengxin Dai , Hui Qian

Nowadays, real-world applications often face streaming data, which requires the learning system to absorb new knowledge as data evolves. Continual Learning (CL) aims to achieve this goal and meanwhile overcome the catastrophic forgetting of…

Machine Learning · Computer Science 2024-04-24 Da-Wei Zhou , Hai-Long Sun , Jingyi Ning , Han-Jia Ye , De-Chuan Zhan

Continual learning (CL) enables models to adapt to evolving data streams without catastrophic forgetting, a fundamental requirement for real-world AI systems. However, the current methods often depend on large replay buffers or heavily…

Machine Learning · Computer Science 2025-11-14 Indu Solomon , Aye Phyu Phyu Aung , Uttam Kumar , Senthilnath Jayavelu

Continual learning (CL) learns a sequence of tasks incrementally with the goal of achieving two main objectives: overcoming catastrophic forgetting (CF) and encouraging knowledge transfer (KT) across tasks. However, most existing techniques…

Computation and Language · Computer Science 2021-12-21 Zixuan Ke , Bing Liu , Nianzu Ma , Hu Xu , Lei Shu

Large Language Models (LMs) are known to encode world knowledge in their parameters as they pretrain on a vast amount of web corpus, which is often utilized for performing knowledge-dependent downstream tasks such as question answering,…

Computation and Language · Computer Science 2022-05-25 Joel Jang , Seonghyeon Ye , Sohee Yang , Joongbo Shin , Janghoon Han , Gyeonghun Kim , Stanley Jungkyu Choi , Minjoon Seo
‹ Prev 1 2 3 10 Next ›