Related papers: Depth-guided Texture Diffusion for Image Semantic …

Language-Depth Navigated Thermal and Visible Image Fusion

Depth-guided multimodal fusion combines depth information from visible and infrared images, significantly enhancing the performance of 3D reconstruction and robotics applications. Existing thermal-visible image fusion mainly focuses on…

Computer Vision and Pattern Recognition · Computer Science 2025-04-15 Jinchang Zhang , Zijun Li , Guoyu Lu

Diffusion Features to Bridge Domain Gap for Semantic Segmentation

Pre-trained diffusion models have demonstrated remarkable proficiency in synthesizing images across a wide range of scenarios with customizable prompts, indicating their effective capacity to capture universal features. Motivated by this,…

Computer Vision and Pattern Recognition · Computer Science 2024-11-22 Yuxiang Ji , Boyong He , Chenyuan Qu , Zhuoyue Tan , Chuan Qin , Liaoni Wu

Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features

We present Diff3F as a simple, robust, and class-agnostic feature descriptor that can be computed for untextured input shapes (meshes or point clouds). Our method distills diffusion features from image foundational models onto input shapes.…

Computer Vision and Pattern Recognition · Computer Science 2024-04-04 Niladri Shekhar Dutt , Sanjeev Muralikrishnan , Niloy J. Mitra

Structure Matters: Tackling the Semantic Discrepancy in Diffusion Models for Image Inpainting

Denoising diffusion probabilistic models for image inpainting aim to add the noise to the texture of image during the forward process and recover masked regions with unmasked ones of the texture via the reverse denoising process. Despite…

Computer Vision and Pattern Recognition · Computer Science 2024-04-02 Haipeng Liu , Yang Wang , Biao Qian , Meng Wang , Yong Rui

Text-DiFuse: An Interactive Multi-Modal Image Fusion Framework based on Text-modulated Diffusion Model

Existing multi-modal image fusion methods fail to address the compound degradations presented in source images, resulting in fusion images plagued by noise, color bias, improper exposure, \textit{etc}. Additionally, these methods often…

Computer Vision and Pattern Recognition · Computer Science 2024-11-01 Hao Zhang , Lei Cao , Jiayi Ma

SegDiff: Image Segmentation with Diffusion Probabilistic Models

Diffusion Probabilistic Methods are employed for state-of-the-art image generation. In this work, we present a method for extending such models for performing image segmentation. The method learns end-to-end, without relying on a…

Computer Vision and Pattern Recognition · Computer Science 2022-09-08 Tomer Amit , Tal Shaharbany , Eliya Nachmani , Lior Wolf

Geometry Image Diffusion: Fast and Data-Efficient Text-to-3D with Image-Based Surface Representation

Generating high-quality 3D objects from textual descriptions remains a challenging problem due to computational cost, the scarcity of 3D data, and complex 3D representations. We introduce Geometry Image Diffusion (GIMDiffusion), a novel…

Computer Vision and Pattern Recognition · Computer Science 2024-09-06 Slava Elizarov , Ciara Rowles , Simon Donné

EmerDiff: Emerging Pixel-level Semantic Knowledge in Diffusion Models

Diffusion models have recently received increasing research attention for their remarkable transfer abilities in semantic segmentation tasks. However, generating fine-grained segmentation masks with diffusion models often requires…

Computer Vision and Pattern Recognition · Computer Science 2024-01-23 Koichi Namekata , Amirmojtaba Sabour , Sanja Fidler , Seung Wook Kim

Diffusion Models for Implicit Image Segmentation Ensembles

Diffusion models have shown impressive performance for generative modelling of images. In this paper, we present a novel semantic segmentation method based on diffusion models. By modifying the training and sampling scheme, we show that…

Computer Vision and Pattern Recognition · Computer Science 2021-12-28 Julia Wolleb , Robin Sandkühler , Florentin Bieder , Philippe Valmaggia , Philippe C. Cattin

Improving Semantic Segmentation via Decoupled Body and Edge Supervision

Existing semantic segmentation approaches either aim to improve the object's inner consistency by modeling the global context, or refine objects detail along their boundaries by multi-scale feature fusion. In this paper, a new paradigm for…

Computer Vision and Pattern Recognition · Computer Science 2020-08-19 Xiangtai Li , Xia Li , Li Zhang , Guangliang Cheng , Jianping Shi , Zhouchen Lin , Shaohua Tan , Yunhai Tong

TextDiffSeg: Text-guided Latent Diffusion Model for 3d Medical Images Segmentation

Diffusion Probabilistic Models (DPMs) have demonstrated significant potential in 3D medical image segmentation tasks. However, their high computational cost and inability to fully capture global 3D contextual information limit their…

Image and Video Processing · Electrical Eng. & Systems 2025-04-17 Kangbo Ma

Unsupervised Semantic Segmentation Through Depth-Guided Feature Correlation and Sampling

Traditionally, training neural networks to perform semantic segmentation required expensive human-made annotations. But more recently, advances in the field of unsupervised learning have made significant progress on this issue and towards…

Computer Vision and Pattern Recognition · Computer Science 2024-03-27 Leon Sick , Dominik Engel , Pedro Hermosilla , Timo Ropinski

Towards Deeper and Better Multi-view Feature Fusion for 3D Semantic Segmentation

3D point clouds are rich in geometric structure information, while 2D images contain important and continuous texture information. Combining 2D information to achieve better 3D semantic segmentation has become mainstream in 3D scene…

Computer Vision and Pattern Recognition · Computer Science 2022-12-14 Chaolong Yang , Yuyao Yan , Weiguang Zhao , Jianan Ye , Xi Yang , Amir Hussain , Kaizhu Huang

TEXTure: Text-Guided Texturing of 3D Shapes

In this paper, we present TEXTure, a novel method for text-guided generation, editing, and transfer of textures for 3D shapes. Leveraging a pretrained depth-to-image diffusion model, TEXTure applies an iterative scheme that paints a 3D…

Computer Vision and Pattern Recognition · Computer Science 2023-02-06 Elad Richardson , Gal Metzer , Yuval Alaluf , Raja Giryes , Daniel Cohen-Or

DiffEdit: Diffusion-based semantic image editing with mask guidance

Image generation has recently seen tremendous advances, with diffusion models allowing to synthesize convincing images for a large variety of text prompts. In this article, we propose DiffEdit, a method to take advantage of text-conditioned…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Guillaume Couairon , Jakob Verbeek , Holger Schwenk , Matthieu Cord

LayerDiffusion: Layered Controlled Image Editing with Diffusion Models

Text-guided image editing has recently experienced rapid development. However, simultaneously performing multiple editing actions on a single image, such as background replacement and specific subject attribute changes, while maintaining…

Computer Vision and Pattern Recognition · Computer Science 2024-04-09 Pengzhi Li , QInxuan Huang , Yikang Ding , Zhiheng Li

Improving Semantic Image Segmentation via Label Fusion in Semantically Textured Meshes

Models for semantic segmentation require a large amount of hand-labeled training data which is costly and time-consuming to produce. For this purpose, we present a label fusion framework that is capable of improving semantic pixel labels of…

Computer Vision and Pattern Recognition · Computer Science 2022-02-25 Florian Fervers , Timo Breuer , Gregor Stachowiak , Sebastian Bullinger , Christoph Bodensteiner , Michael Arens

Open-Vocabulary 3D Semantic Segmentation with Text-to-Image Diffusion Models

In this paper, we investigate the use of diffusion models which are pre-trained on large-scale image-caption pairs for open-vocabulary 3D semantic understanding. We propose a novel method, namely Diff2Scene, which leverages frozen…

Computer Vision and Pattern Recognition · Computer Science 2024-07-19 Xiaoyu Zhu , Hao Zhou , Pengfei Xing , Long Zhao , Hao Xu , Junwei Liang , Alexander Hauptmann , Ting Liu , Andrew Gallagher

Dif-Fusion: Towards High Color Fidelity in Infrared and Visible Image Fusion with Diffusion Models

Color plays an important role in human visual perception, reflecting the spectrum of objects. However, the existing infrared and visible image fusion methods rarely explore how to handle multi-spectral/channel data directly and achieve high…

Computer Vision and Pattern Recognition · Computer Science 2024-10-28 Jun Yue , Leyuan Fang , Shaobo Xia , Yue Deng , Jiayi Ma

TexFusion: Synthesizing 3D Textures with Text-Guided Image Diffusion Models

We present TexFusion (Texture Diffusion), a new method to synthesize textures for given 3D geometries, using large-scale text-guided image diffusion models. In contrast to recent works that leverage 2D text-to-image diffusion models to…

Computer Vision and Pattern Recognition · Computer Science 2023-10-24 Tianshi Cao , Karsten Kreis , Sanja Fidler , Nicholas Sharp , Kangxue Yin