English
Related papers

Related papers: DiffusePast: Diffusion-based Generative Replay for…

200 papers

Recent advancements in 3D content generation from text or a single image struggle with limited high-quality 3D datasets and inconsistency from 2D multi-view generation. We introduce DiffSplat, a novel 3D generative framework that natively…

Computer Vision and Pattern Recognition · Computer Science 2025-01-29 Chenguo Lin , Panwang Pan , Bangbang Yang , Zeming Li , Yadong Mu

Semantic segmentation is a computer vision task where classification is performed at a pixel level. Due to this, the process of labeling images for semantic segmentation is time-consuming and expensive. To mitigate this cost there has been…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Javier Montalvo , Álvaro García-Martín , Pablo Carballeira , Juan C. SanMiguel

Language-driven image segmentation is a fundamental task in vision-language understanding, requiring models to segment regions of an image corresponding to natural language expressions. Traditional methods approach this as a discriminative…

Computer Vision and Pattern Recognition · Computer Science 2025-08-28 Yuhao Chen , Shubin Chen , Liang Lin , Guangrun Wang

This paper considers the problem of utilizing a large-scale text-to-image diffusion model to tackle the challenging Inexact Segmentation (IS) task. Unlike traditional approaches that rely heavily on discriminative-model-based paradigms or…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Tianjiao Zhang , Fei Zhang , Jiangchao Yao , Ya Zhang , Yanfeng Wang

Class-incremental learning aims to learn new classes in an incremental fashion without forgetting the previously learned ones. Several research works have shown how additional data can be used by incremental models to help mitigate…

Machine Learning · Computer Science 2023-10-11 Quentin Jodelet , Xin Liu , Yin Jun Phua , Tsuyoshi Murata

Diffusion models are powerful generative models that achieve state-of-the-art performance in image synthesis. However, training them demands substantial amounts of data and computational resources. Continual learning would allow for…

Machine Learning · Computer Science 2025-03-05 Sergi Masip , Pau Rodriguez , Tinne Tuytelaars , Gido M. van de Ven

Weakly Supervised Semantic Segmentation (WSSS) with image-level labels typically leverages Class Activation Maps (CAMs) to achieve pixel-level predictions. Recently, Contrastive Language-Image Pre-training (CLIP) has been introduced to…

Computer Vision and Pattern Recognition · Computer Science 2026-05-07 Zhiwei Yang , Pengfei Song , Yucong Meng , Kexue Fu , Shuo Wang , Zhijian Song

Recently, large-scale diffusion models, e.g., Stable diffusion and DallE2, have shown remarkable results on image synthesis. On the other hand, large-scale cross-modal pre-trained models (e.g., CLIP, ALIGN, and FILIP) are competent for…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Runhui Huang , Jianhua Han , Guansong Lu , Xiaodan Liang , Yihan Zeng , Wei Zhang , Hang Xu

Diffusion models has emerged as a powerful framework for tasks like image controllable generation and dense prediction. However, existing models often struggle to capture underlying semantics (e.g., edges, textures, shapes) and effectively…

Computer Vision and Pattern Recognition · Computer Science 2025-03-07 Zhong Ji , Weilong Cao , Yan Zhang , Yanwei Pang , Jungong Han , Xuelong Li

Class incremental semantic segmentation (CISS) aims to segment new classes during continual steps while preventing the forgetting of old knowledge. Existing methods alleviate catastrophic forgetting by replaying distributions of previously…

Computer Vision and Pattern Recognition · Computer Science 2024-12-18 Guilin Zhu , Dongyue Wu , Changxin Gao , Runmin Wang , Weidong Yang , Nong Sang

Preparing training data for deep vision models is a labor-intensive task. To address this, generative models have emerged as an effective solution for generating synthetic data. While current generative models produce image-level category…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Quang Nguyen , Truong Vu , Anh Tran , Khoi Nguyen

Accurate medical image segmentation is crucial for precise anatomical delineation. Deep learning models like U-Net have shown great success but depend heavily on large datasets and struggle with domain shifts, complex structures, and…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Hantao Zhang , Yuhe Liu , Jiancheng Yang , Weidong Guo , Xinyuan Wang , Pascal Fua

In the field of class incremental learning (CIL), generative replay has become increasingly prominent as a method to mitigate the catastrophic forgetting, alongside the continuous improvements in generative models. However, its application…

Computer Vision and Pattern Recognition · Computer Science 2024-05-08 Junsu Kim , Hoseong Cho , Jihyeon Kim , Yihalem Yimolal Tiruneh , Seungryul Baek

While learned image compression (LIC) focuses on efficient data transmission, generative image compression (GIC) extends this framework by integrating generative modeling to produce photo-realistic reconstructed images. In this paper, we…

Image and Video Processing · Electrical Eng. & Systems 2025-05-28 Minghao Han , Weiyi You , Jinhua Zhang , Leheng Zhang , Ce Zhu , Shuhang Gu

Diffusion models have demonstrated excellent performance in image generation. Although various few-shot semantic segmentation (FSS) models with different network structures have been proposed, performance improvement has reached a…

Computer Vision and Pattern Recognition · Computer Science 2023-10-12 Weimin Tan , Siyuan Chen , Bo Yan

Dataset expansion can effectively alleviate the problem of data scarcity for medical image segmentation, due to privacy concerns and labeling difficulties. However, existing expansion algorithms still face great challenges due to their…

Image and Video Processing · Electrical Eng. & Systems 2023-06-07 Shitong Shao , Xiaohan Yuan , Zhen Huang , Ziming Qiu , Shuai Wang , Kevin Zhou

We propose DAVIS, a Diffusion-based Audio-VIsual Separation framework that solves the audio-visual sound source separation task through generative learning. Existing methods typically frame sound separation as a mask-based regression…

Computer Vision and Pattern Recognition · Computer Science 2025-09-29 Chao Huang , Susan Liang , Yapeng Tian , Anurag Kumar , Chenliang Xu

Image synthesis approaches, e.g., generative adversarial networks, have been popular as a form of data augmentation in medical image analysis tasks. It is primarily beneficial to overcome the shortage of publicly accessible data and…

Computer Vision and Pattern Recognition · Computer Science 2023-10-05 Shiyi Du , Xiaosong Wang , Yongyi Lu , Yuyin Zhou , Shaoting Zhang , Alan Yuille , Kang Li , Zongwei Zhou

Semantic communication is expected to be one of the cores of next-generation AI-based communications. One of the possibilities offered by semantic communication is the capability to regenerate, at the destination side, images or videos…

Artificial Intelligence · Computer Science 2026-05-18 Eleonora Grassucci , Sergio Barbarossa , Danilo Comminiello

Learning from a large corpus of data, pre-trained models have achieved impressive progress nowadays. As popular generative pre-training, diffusion models capture both low-level visual knowledge and high-level semantic relations. In this…

Computer Vision and Pattern Recognition · Computer Science 2023-03-20 Chaofan Ma , Yuhuan Yang , Chen Ju , Fei Zhang , Jinxiang Liu , Yu Wang , Ya Zhang , Yanfeng Wang
‹ Prev 1 2 3 10 Next ›