English
Related papers

Related papers: Audio-Visual Class-Incremental Learning

200 papers

Continual learning is a challenging problem in which models need to be trained on non-stationary data across sequential tasks for class-incremental learning. While previous methods have focused on using either regularization or…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Shentong Mo , Weiguo Pian , Yapeng Tian

Deep models, e.g., CNNs and Vision Transformers, have achieved impressive achievements in many vision tasks in the closed world. However, novel classes emerge from time to time in our ever-changing world, requiring a learning system to…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Da-Wei Zhou , Qi-Wei Wang , Zhi-Hong Qi , Han-Jia Ye , De-Chuan Zhan , Ziwei Liu

Deep learning, with its robust aotomatic feature extraction capabilities, has demonstrated significant success in audio signal processing. Typically, these methods rely on static, pre-collected large-scale datasets for training, performing…

Sound · Computer Science 2024-12-19 Qisheng Xu , Yulin Sun , Yi Su , Qian Zhu , Xiaoyi Tan , Hongyu Wen , Zijian Gao , Kele Xu , Yong Dou , Dawei Feng

Incremental Learning (IL) aims to accumulate knowledge from sequential input tasks while overcoming catastrophic forgetting. Existing IL methods typically assume that an incoming task has only increments of classes or domains, referred to…

Computer Vision and Pattern Recognition · Computer Science 2024-09-18 Min-Yeong Park , Jae-Ho Lee , Gyeong-Moon Park

Continual learning aims to acquire new knowledge while retaining past information. Class-incremental learning (CIL) presents a challenging scenario where classes are introduced sequentially. For video data, the task becomes more complex…

Computer Vision and Pattern Recognition · Computer Science 2025-01-14 Tieyuan Chen , Huabin Liu , Chern Hong Lim , John See , Xing Gao , Junhui Hou , Weiyao Lin

In this paper, we propose a method for incremental learning of two distinct tasks over time: acoustic scene classification (ASC) and audio tagging (AT). We use a simple convolutional neural network (CNN) model as an incremental learner to…

Audio and Speech Processing · Electrical Eng. & Systems 2023-08-25 Manjunath Mulimani , Annamaria Mesaros

Multi-view learning (MVL) has gained great success in integrating information from multiple perspectives of a dataset to improve downstream task performance. To make MVL methods more practical in an open-ended environment, this paper…

Machine Learning · Computer Science 2023-10-16 Depeng Li , Tianqi Wang , Junwei Chen , Kenji Kawaguchi , Cheng Lian , Zhigang Zeng

This work explores class-incremental learning (CIL) for sound event detection (SED), advancing adaptability towards real-world scenarios. CIL's success in domains like computer vision inspired our SED-tailored method, addressing the unique…

Audio and Speech Processing · Electrical Eng. & Systems 2025-01-14 Yang Xiao , Rohan Kumar Das

This paper investigates the feasibility of class-incremental learning (CIL) for Sound Event Localization and Detection (SELD) tasks. The method features an incremental learner that can learn new sound classes independently while preserving…

Audio and Speech Processing · Electrical Eng. & Systems 2024-11-21 Ruchi Pandey , Manjunath Mulimani , Archontis Politis , Annamaria Mesaros

Class-incremental learning (CIL) with pre-trained models (PTMs) faces a critical trade-off between efficient adaptation and long-term stability. While analytic learning enables rapid, recursive closed-form updates, its efficacy is often…

Machine Learning · Computer Science 2026-05-08 Binyu Zhao , Wei Zhang , Xingrui Yu , Zhaonian Zou , Ivor Tsang

Deep learning has achieved notable success in 3D object detection with the advent of large-scale point cloud datasets. However, severe performance degradation in the past trained classes, i.e., catastrophic forgetting, still remains a…

Computer Vision and Pattern Recognition · Computer Science 2022-12-06 Ziyuan Zhao , Mingxi Xu , Peisheng Qian , Ramanpreet Singh Pahwa , Richard Chang

Class-incremental learning (CIL) enables models to continuously learn new categories from sequential tasks without forgetting previously acquired knowledge. While recent advances in vision-language models such as CLIP have demonstrated…

Computer Vision and Pattern Recognition · Computer Science 2025-11-17 Haoran Chen , Houze Xu , Micah Goldblum , Daoguo Dong , Zuxuan Wu

Class-incremental learning (CIL) learns a classification model with training data of different classes arising progressively. Existing CIL either suffers from serious accuracy loss due to catastrophic forgetting, or invades data privacy by…

Machine Learning · Computer Science 2022-12-13 Huiping Zhuang , Zhenyu Weng , Hongxin Wei , Renchunzi Xie , Kar-Ann Toh , Zhiping Lin

Unlike traditional Multimodal Class-Incremental Learning (MCIL) methods that focus only on vision and text, this paper explores MCIL across vision, audio and text modalities, addressing challenges in integrating complementary information…

Machine Learning · Computer Science 2025-06-13 Yukun Chen , Zihuan Qiu , Fanman Meng , Hongliang Li , Linfeng Xu , Qingbo Wu

Class-incremental Learning (CIL) enables the model to incrementally absorb knowledge from new classes and build a generic classifier across all previously encountered classes. When the model optimizes with new classes, the knowledge of…

Computer Vision and Pattern Recognition · Computer Science 2025-03-31 Juncen Guo , Xiaoguang Zhu , Liangyu Teng , Hao Yang , Jing Liu , Yang Liu , Liang Song

For future learning systems, incremental learning is desirable because it allows for: efficient resource usage by eliminating the need to retrain from scratch at the arrival of new data; reduced memory usage by preventing or limiting the…

Machine Learning · Computer Science 2022-10-12 Marc Masana , Xialei Liu , Bartlomiej Twardowski , Mikel Menta , Andrew D. Bagdanov , Joost van de Weijer

Audio-visual correlation learning aims to capture and understand natural phenomena between audio and visual data. The rapid growth of Deep Learning propelled the development of proposals that process audio-visual data and can be observed in…

Multimedia · Computer Science 2024-12-03 Luis Vilaca , Yi Yu , Paula Vinan

Real-world environments are inherently non-stationary, frequently introducing new classes over time. This is especially common in time series classification, such as the emergence of new disease classification in healthcare or the addition…

Machine Learning · Computer Science 2024-08-06 Zhongzheng Qiao , Quang Pham , Zhen Cao , Hoang H Le , P. N. Suganthan , Xudong Jiang , Ramasamy Savitha

In contrast to the incremental classification task, the incremental detection task is characterized by the presence of data ambiguity, as an image may have differently labeled bounding boxes across multiple continuous learning stages. This…

Computer Vision and Pattern Recognition · Computer Science 2024-07-10 Ziyue Huang , Yupeng He , Qingjie Liu , Yunhong Wang

Continual learning (or class incremental learning) is a realistic learning scenario for computer vision systems, where deep neural networks are trained on episodic data, and the data from previous episodes are generally inaccessible to the…

Computer Vision and Pattern Recognition · Computer Science 2026-02-05 Aditya R. Bhattacharya , Debanjan Goswami , Shayok Chakraborty
‹ Prev 1 2 3 10 Next ›