English
Related papers

Related papers: EmerDiff: Emerging Pixel-level Semantic Knowledge …

200 papers

Diffusion models have shown impressive performance for generative modelling of images. In this paper, we present a novel semantic segmentation method based on diffusion models. By modifying the training and sampling scheme, we show that…

Computer Vision and Pattern Recognition · Computer Science 2021-12-28 Julia Wolleb , Robin Sandkühler , Florentin Bieder , Philippe Valmaggia , Philippe C. Cattin

Denoising diffusion probabilistic models have recently received much research attention since they outperform alternative approaches, such as GANs, and currently provide state-of-the-art generative performance. The superior performance of…

Computer Vision and Pattern Recognition · Computer Science 2022-03-17 Dmitry Baranchuk , Ivan Rubachev , Andrey Voynov , Valentin Khrulkov , Artem Babenko

Preparing training data for deep vision models is a labor-intensive task. To address this, generative models have emerged as an effective solution for generating synthetic data. While current generative models produce image-level category…

Computer Vision and Pattern Recognition · Computer Science 2023-11-14 Quang Nguyen , Truong Vu , Anh Tran , Khoi Nguyen

Diffusion models are highly regarded for their controllability and the diversity of images they generate. However, class-conditional generation methods based on diffusion models often focus on more common categories. In large-scale…

Computer Vision and Pattern Recognition · Computer Science 2025-12-08 Kun Wang , Donglin Di , Tonghua Su , Lei Fan

Pre-trained diffusion models have demonstrated remarkable proficiency in synthesizing images across a wide range of scenarios with customizable prompts, indicating their effective capacity to capture universal features. Motivated by this,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-22 Yuxiang Ji , Boyong He , Chenyuan Qu , Zhuoyue Tan , Chuan Qin , Liaoni Wu

The pre-trained text-image discriminative models, such as CLIP, has been explored for open-vocabulary semantic segmentation with unsatisfactory results due to the loss of crucial localization information and awareness of object shapes.…

Computer Vision and Pattern Recognition · Computer Science 2024-01-23 Jinglong Wang , Xiawei Li , Jing Zhang , Qingyuan Xu , Qin Zhou , Qian Yu , Lu Sheng , Dong Xu

Diffusion Probabilistic Methods are employed for state-of-the-art image generation. In this work, we present a method for extending such models for performing image segmentation. The method learns end-to-end, without relying on a…

Computer Vision and Pattern Recognition · Computer Science 2022-09-08 Tomer Amit , Tal Shaharbany , Eliya Nachmani , Lior Wolf

Aside from offering state-of-the-art performance in medical image generation, denoising diffusion probabilistic models (DPM) can also serve as a representation learner to capture semantic information and potentially be used as an image…

Image and Video Processing · Electrical Eng. & Systems 2024-07-09 Chun-Mei Feng

Text-to-image diffusion models excel at translating language prompts into photorealistic images by implicitly grounding textual concepts through their cross-modal attention mechanisms. Recent multi-modal diffusion transformers extend this…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Chaehyun Kim , Heeseong Shin , Eunbeen Hong , Heeji Yoon , Anurag Arnab , Paul Hongsuck Seo , Sunghwan Hong , Seungryong Kim

Entrusted with the goal of pixel-level object classification, the semantic segmentation networks entail the laborious preparation of pixel-level annotation masks. To obtain pixel-level annotation masks for a given class without human…

Computer Vision and Pattern Recognition · Computer Science 2025-09-16 Joon Hyun Park , Kumju Jo , Sungyong Baik

Constructing high-definition (HD) maps is a crucial requirement for enabling autonomous driving. In recent years, several map segmentation algorithms have been developed to address this need, leveraging advancements in Bird's-Eye View (BEV)…

Computer Vision and Pattern Recognition · Computer Science 2024-09-04 Peijin Jia , Tuopu Wen , Ziang Luo , Mengmeng Yang , Kun Jiang , Zhiquan Lei , Xuewei Tang , Ziyuan Liu , Le Cui , Bo Zhang , Long Huang , Diange Yang

Text-to-image diffusion models are now capable of generating images that are often indistinguishable from real images. To generate such images, these models must understand the semantics of the objects they are asked to generate. In this…

Computer Vision and Pattern Recognition · Computer Science 2023-12-29 Eric Hedlin , Gopal Sharma , Shweta Mahajan , Hossam Isack , Abhishek Kar , Andrea Tagliasacchi , Kwang Moo Yi

The evolution of semantic segmentation has long been dominated by learning more discriminative image representations for classifying each pixel. Despite the prominent advancements, the priors of segmentation masks themselves, e.g.,…

Computer Vision and Pattern Recognition · Computer Science 2023-06-23 Zeqiang Lai , Yuchen Duan , Jifeng Dai , Ziheng Li , Ying Fu , Hongsheng Li , Yu Qiao , Wenhai Wang

Depth information provides valuable insights into the 3D structure especially the outline of objects, which can be utilized to improve the semantic segmentation tasks. However, a naive fusion of depth information can disrupt feature and…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Wei Sun , Yuan Li , Qixiang Ye , Jianbin Jiao , Yanzhao Zhou

Image segmentation is crucial in many computational pathology pipelines, including accurate disease diagnosis, subtyping, outcome, and survivability prediction. The common approach for training a segmentation model relies on a pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2025-04-10 Sachin Kumar Danisetty , Alexandros Graikos , Srikar Yellapragada , Dimitris Samaras

Medical image segmentation is crucial for accurate clinical diagnoses, yet it faces challenges such as low contrast between lesions and normal tissues, unclear boundaries, and high variability across patients. Deep learning has improved…

Image and Video Processing · Electrical Eng. & Systems 2024-12-09 Houze Liu , Tong Zhou , Yanlin Xiang , Aoran Shen , Jiacheng Hu , Junliang Du

Diffusion models, while trained for image generation, have emerged as powerful foundational feature extractors for downstream tasks. We find that off-the-shelf diffusion models, trained exclusively to generate natural RGB images, can…

Computer Vision and Pattern Recognition · Computer Science 2025-06-04 Nurislam Tursynbek , Hastings Greer , Basar Demir , Marc Niethammer

Diffusion models have demonstrated excellent performance in image generation. Although various few-shot semantic segmentation (FSS) models with different network structures have been proposed, performance improvement has reached a…

Computer Vision and Pattern Recognition · Computer Science 2023-10-12 Weimin Tan , Siyuan Chen , Bo Yan

Semantic segmentation demands dense pixel-level annotations, which can be prohibitively expensive - especially under extremely constrained labeling budgets. In this paper, we address the problem of low-budget active learning for semantic…

Computer Vision and Pattern Recognition · Computer Science 2025-10-28 Jeongin Kim , Wonho Bae , YouLee Han , Giyeong Oh , Youngjae Yu , Danica J. Sutherland , Junhyug Noh

The advance of generative models for images has inspired various training techniques for image recognition utilizing synthetic images. In semantic segmentation, one promising approach is extracting pseudo-masks from attention maps in…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Ryota Yoshihashi , Yuya Otsuka , Kenji Doi , Tomohiro Tanaka , Hirokatsu Kataoka
‹ Prev 1 2 3 10 Next ›