Related papers: DiffusePast: Diffusion-based Generative Replay for…

DiffSplat: Repurposing Image Diffusion Models for Scalable Gaussian Splat Generation

Recent advancements in 3D content generation from text or a single image struggle with limited high-quality 3D datasets and inconsistency from 2D multi-view generation. We introduce DiffSplat, a novel 3D generative framework that natively…

Computer Vision and Pattern Recognition · Computer Science 2025-01-29 Chenguo Lin , Panwang Pan , Bangbang Yang , Zeming Li , Yadong Mu

Unsupervised Class Generation to Expand Semantic Segmentation Datasets

Semantic segmentation is a computer vision task where classification is performed at a pixel level. Due to this, the process of labeling images for semantic segmentation is time-consuming and expensive. To mitigate this cost there has been…

Computer Vision and Pattern Recognition · Computer Science 2025-01-07 Javier Montalvo , Álvaro García-Martín , Pablo Carballeira , Juan C. SanMiguel

GS: Generative Segmentation via Label Diffusion

Language-driven image segmentation is a fundamental task in vision-language understanding, requiring models to segment regions of an image corresponding to natural language expressions. Traditional methods approach this as a discriminative…

Computer Vision and Pattern Recognition · Computer Science 2025-08-28 Yuhao Chen , Shubin Chen , Liang Lin , Guangrun Wang

G4Seg: Generation for Inexact Segmentation Refinement with Diffusion Models

This paper considers the problem of utilizing a large-scale text-to-image diffusion model to tackle the challenging Inexact Segmentation (IS) task. Unlike traditional approaches that rely heavily on discriminative-model-based paradigms or…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Tianjiao Zhang , Fei Zhang , Jiangchao Yao , Ya Zhang , Yanfeng Wang

Class-Incremental Learning using Diffusion Model for Distillation and Replay

Class-incremental learning aims to learn new classes in an incremental fashion without forgetting the previously learned ones. Several research works have shown how additional data can be used by incremental models to help mitigate…

Machine Learning · Computer Science 2023-10-11 Quentin Jodelet , Xin Liu , Yin Jun Phua , Tsuyoshi Murata

Continual Learning of Diffusion Models with Generative Distillation

Diffusion models are powerful generative models that achieve state-of-the-art performance in image synthesis. However, training them demands substantial amounts of data and computational resources. Continual learning would allow for…

Machine Learning · Computer Science 2025-03-05 Sergi Masip , Pau Rodriguez , Tinne Tuytelaars , Gido M. van de Ven

DiCLIP: Diffusion Model Enhances CLIP's Dense Knowledge for Weakly Supervised Semantic Segmentation

Weakly Supervised Semantic Segmentation (WSSS) with image-level labels typically leverages Class Activation Maps (CAMs) to achieve pixel-level predictions. Recently, Contrastive Language-Image Pre-training (CLIP) has been introduced to…

Computer Vision and Pattern Recognition · Computer Science 2026-05-07 Zhiwei Yang , Pengfei Song , Yucong Meng , Kexue Fu , Shuo Wang , Zhijian Song

DiffDis: Empowering Generative Diffusion Model with Cross-Modal Discrimination Capability

Recently, large-scale diffusion models, e.g., Stable diffusion and DallE2, have shown remarkable results on image synthesis. On the other hand, large-scale cross-modal pre-trained models (e.g., CLIP, ALIGN, and FILIP) are competent for…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Runhui Huang , Jianhua Han , Guansong Lu , Xiaodan Liang , Yihan Zeng , Wei Zhang , Hang Xu

Underlying Semantic Diffusion for Effective and Efficient In-Context Learning

Diffusion models has emerged as a powerful framework for tasks like image controllable generation and dense prediction. However, existing models often struggle to capture underlying semantics (e.g., edges, textures, shapes) and effectively…

Computer Vision and Pattern Recognition · Computer Science 2025-03-07 Zhong Ji , Weilong Cao , Yan Zhang , Yanwei Pang , Jungong Han , Xuelong Li

Adaptive Prototype Replay for Class Incremental Semantic Segmentation

Class incremental semantic segmentation (CISS) aims to segment new classes during continual steps while preventing the forgetting of old knowledge. Existing methods alleviate catastrophic forgetting by replaying distributions of previously…

Computer Vision and Pattern Recognition · Computer Science 2024-12-18 Guilin Zhu , Dongyue Wu , Changxin Gao , Runmin Wang , Weidong Yang , Nong Sang

Dataset Diffusion: Diffusion-based Synthetic Dataset Generation for Pixel-Level Semantic Segmentation

Preparing training data for deep vision models is a labor-intensive task. To address this, generative models have emerged as an effective solution for generating synthetic data. While current generative models produce image-level category…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Quang Nguyen , Truong Vu , Anh Tran , Khoi Nguyen

DiffAtlas: GenAI-fying Atlas Segmentation via Image-Mask Diffusion

Accurate medical image segmentation is crucial for precise anatomical delineation. Deep learning models like U-Net have shown great success but depend heavily on large datasets and struggle with domain shifts, complex structures, and…

Computer Vision and Pattern Recognition · Computer Science 2025-03-11 Hantao Zhang , Yuhe Liu , Jiancheng Yang , Weidong Guo , Xinyuan Wang , Pascal Fua

SDDGR: Stable Diffusion-based Deep Generative Replay for Class Incremental Object Detection

In the field of class incremental learning (CIL), generative replay has become increasingly prominent as a method to mitigate the catastrophic forgetting, alongside the continuous improvements in generative models. However, its application…

Computer Vision and Pattern Recognition · Computer Science 2024-05-08 Junsu Kim , Hoseong Cho , Jihyeon Kim , Yihalem Yimolal Tiruneh , Seungryul Baek

Generative Image Compression by Estimating Gradients of the Rate-variable Feature Distribution

While learned image compression (LIC) focuses on efficient data transmission, generative image compression (GIC) extends this framework by integrating generative modeling to produce photo-realistic reconstructed images. In this paper, we…

Image and Video Processing · Electrical Eng. & Systems 2025-05-28 Minghao Han , Weiyi You , Jinhua Zhang , Leheng Zhang , Ce Zhu , Shuhang Gu

DifFSS: Diffusion Model for Few-Shot Semantic Segmentation

Diffusion models have demonstrated excellent performance in image generation. Although various few-shot semantic segmentation (FSS) models with different network structures have been proposed, performance improvement has reached a…

Computer Vision and Pattern Recognition · Computer Science 2023-10-12 Weimin Tan , Siyuan Chen , Bo Yan

DiffuseExpand: Expanding dataset for 2D medical image segmentation using diffusion models

Dataset expansion can effectively alleviate the problem of data scarcity for medical image segmentation, due to privacy concerns and labeling difficulties. However, existing expansion algorithms still face great challenges due to their…

Image and Video Processing · Electrical Eng. & Systems 2023-06-07 Shitong Shao , Xiaohan Yuan , Zhen Huang , Ziming Qiu , Shuai Wang , Kevin Zhou

High-Quality Sound Separation Across Diverse Categories via Visually-Guided Generative Modeling

We propose DAVIS, a Diffusion-based Audio-VIsual Separation framework that solves the audio-visual sound source separation task through generative learning. Existing methods typically frame sound separation as a mask-based regression…

Computer Vision and Pattern Recognition · Computer Science 2025-09-29 Chao Huang , Susan Liang , Yapeng Tian , Anurag Kumar , Chenliang Xu

Boosting Dermatoscopic Lesion Segmentation via Diffusion Models with Visual and Textual Prompts

Image synthesis approaches, e.g., generative adversarial networks, have been popular as a form of data augmentation in medical image analysis tasks. It is primarily beneficial to overcome the shortage of publicly accessible data and…

Computer Vision and Pattern Recognition · Computer Science 2023-10-05 Shiyi Du , Xiaosong Wang , Yongyi Lu , Yuyin Zhou , Shaoting Zhang , Alan Yuille , Kang Li , Zongwei Zhou

Generative Semantic Communication: Diffusion Models Beyond Bit Recovery

Semantic communication is expected to be one of the cores of next-generation AI-based communications. One of the possibilities offered by semantic communication is the capability to regenerate, at the destination side, images or videos…

Artificial Intelligence · Computer Science 2026-05-18 Eleonora Grassucci , Sergio Barbarossa , Danilo Comminiello

DiffusionSeg: Adapting Diffusion Towards Unsupervised Object Discovery

Learning from a large corpus of data, pre-trained models have achieved impressive progress nowadays. As popular generative pre-training, diffusion models capture both low-level visual knowledge and high-level semantic relations. In this…

Computer Vision and Pattern Recognition · Computer Science 2023-03-20 Chaofan Ma , Yuhuan Yang , Chen Ju , Fei Zhang , Jinxiang Liu , Yu Wang , Ya Zhang , Yanfeng Wang