Related papers: TransDiff: Diffusion-Based Method for Manipulating…

Diffusion-Based Depth Inpainting for Transparent and Reflective Objects

Transparent and reflective objects, which are common in our everyday lives, present a significant challenge to 3D imaging techniques due to their unique visual and optical properties. Faced with these types of objects, RGB-D cameras fail to…

Computer Vision and Pattern Recognition · Computer Science 2025-09-22 Tianyu Sun , Dingchang Hu , Yixiang Dai , Guijin Wang

DidSee: Diffusion-Based Depth Completion for Material-Agnostic Robotic Perception and Manipulation

Commercial RGB-D cameras often produce noisy, incomplete depth maps for non-Lambertian objects. Traditional depth completion methods struggle to generalize due to the limited diversity and scale of training data. Recent advances exploit…

Computer Vision and Pattern Recognition · Computer Science 2025-06-30 Wenzhou Lyu , Jialing Lin , Wenqi Ren , Ruihao Xia , Feng Qian , Yang Tang

RGB-D Local Implicit Function for Depth Completion of Transparent Objects

Majority of the perception methods in robotics require depth information provided by RGB-D cameras. However, standard 3D sensors fail to capture depth of transparent objects due to refraction and absorption of light. In this paper, we…

Computer Vision and Pattern Recognition · Computer Science 2021-04-02 Luyang Zhu , Arsalan Mousavian , Yu Xiang , Hammad Mazhar , Jozef van Eenbergen , Shoubhik Debnath , Dieter Fox

6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation

Estimating the 6D object pose from a single RGB image often involves noise and indeterminacy due to challenges such as occlusions and cluttered backgrounds. Meanwhile, diffusion models have shown appealing performance in generating…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Li Xu , Haoxuan Qu , Yujun Cai , Jun Liu

Multi-Sensor Diffusion-Driven Optical Image Translation for Large-Scale Applications

Comparing images captured by disparate sensors is a common challenge in remote sensing. This requires image translation -- converting imagery from one sensor domain to another while preserving the original content. Denoising Diffusion…

Computer Vision and Pattern Recognition · Computer Science 2024-12-05 João Gabriel Vinholi , Marco Chini , Anis Amziane , Renato Machado , Danilo Silva , Patrick Matgen

Transparent Object Depth Completion

The perception of transparent objects for grasp and manipulation remains a major challenge, because existing robotic grasp methods which heavily rely on depth maps are not suitable for transparent objects due to their unique visual…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Yifan Zhou , Wanli Peng , Zhongyu Yang , He Liu , Yi Sun

D3RoMa: Disparity Diffusion-based Depth Sensing for Material-Agnostic Robotic Manipulation

Depth sensing is an important problem for 3D vision-based robotics. Yet, a real-world active stereo or ToF depth camera often produces noisy and incomplete depth which bottlenecks robot performances. In this work, we propose D3RoMa, a…

Robotics · Computer Science 2024-09-26 Songlin Wei , Haoran Geng , Jiayi Chen , Congyue Deng , Wenbo Cui , Chengyang Zhao , Xiaomeng Fang , Leonidas Guibas , He Wang

Diffusion-based RGB-D Semantic Segmentation with Deformable Attention Transformer

Vision-based perception and reasoning is essential for scene understanding in any autonomous system. RGB and depth images are commonly used to capture both the semantic and geometric features of the environment. Developing methods to…

Computer Vision and Pattern Recognition · Computer Science 2025-10-13 Minh Bui , Kostas Alexis

DCIRNet: Depth Completion with Iterative Refinement for Dexterous Grasping of Transparent and Reflective Objects

Transparent and reflective objects in everyday environments pose significant challenges for depth sensors due to their unique visual properties, such as specular reflections and light transmission. These characteristics often lead to…

Robotics · Computer Science 2025-06-12 Guanghu Xie , Zhiduo Jiang , Yonglong Zhang , Yang Liu , Zongwu Xie , Baoshi Cao , Hong Liu

Pyramid Diffusion Models For Low-light Image Enhancement

Recovering noise-covered details from low-light images is challenging, and the results given by previous methods leave room for improvement. Recent diffusion models show realistic and detailed image generation through a sequence of…

Computer Vision and Pattern Recognition · Computer Science 2023-05-18 Dewei Zhou , Zongxin Yang , Yi Yang

Depth-guided Texture Diffusion for Image Semantic Segmentation

Depth information provides valuable insights into the 3D structure especially the outline of objects, which can be utilized to improve the semantic segmentation tasks. However, a naive fusion of depth information can disrupt feature and…

Computer Vision and Pattern Recognition · Computer Science 2024-08-20 Wei Sun , Yuan Li , Qixiang Ye , Jianbin Jiao , Yanzhao Zhou

BlindDiff: Empowering Degradation Modelling in Diffusion Models for Blind Image Super-Resolution

Diffusion models (DM) have achieved remarkable promise in image super-resolution (SR). However, most of them are tailored to solving non-blind inverse problems with fixed known degradation settings, limiting their adaptability to real-world…

Computer Vision and Pattern Recognition · Computer Science 2024-03-18 Feng Li , Yixuan Wu , Zichao Liang , Runmin Cong , Huihui Bai , Yao Zhao , Meng Wang

PSDiffusion: Harmonized Multi-Layer Image Generation via Layout and Appearance Alignment

Transparent image layer generation plays a significant role in digital art and design workflows. Existing methods typically decompose transparent layers from a single RGB image using a set of tools or generate multiple transparent layers…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Dingbang Huang , Wenbo Li , Yifei Zhao , Xinyu Pan , Chun Wang , Yanhong Zeng , Bo Dai

DiffusionDepth: Diffusion Denoising Approach for Monocular Depth Estimation

Monocular depth estimation is a challenging task that predicts the pixel-wise depth from a single 2D image. Current methods typically model this problem as a regression or classification task. We propose DiffusionDepth, a new approach that…

Computer Vision and Pattern Recognition · Computer Science 2023-08-30 Yiqun Duan , Xianda Guo , Zheng Zhu

Relightify: Relightable 3D Faces from a Single Image via Diffusion Models

Following the remarkable success of diffusion models on image generation, recent works have also demonstrated their impressive ability to address a number of inverse problems in an unsupervised way, by properly constraining the sampling…

Computer Vision and Pattern Recognition · Computer Science 2023-08-23 Foivos Paraperas Papantoniou , Alexandros Lattas , Stylianos Moschoglou , Stefanos Zafeiriou

SinDDM: A Single Image Denoising Diffusion Model

Denoising diffusion models (DDMs) have led to staggering performance leaps in image generation, editing and restoration. However, existing DDMs use very large datasets for training. Here, we introduce a framework for training a DDM on a…

Computer Vision and Pattern Recognition · Computer Science 2023-06-08 Vladimir Kulikov , Shahar Yadin , Matan Kleiner , Tomer Michaeli

ClearGrasp: 3D Shape Estimation of Transparent Objects for Manipulation

Transparent objects are a common part of everyday life, yet they possess unique visual properties that make them incredibly difficult for standard 3D sensors to produce accurate depth estimates for. In many cases, they often appear as noisy…

Computer Vision and Pattern Recognition · Computer Science 2019-10-16 Shreeyak S. Sajjan , Matthew Moore , Mike Pan , Ganesh Nagaraja , Johnny Lee , Andy Zeng , Shuran Song

TranSplat: Surface Embedding-guided 3D Gaussian Splatting for Transparent Object Manipulation

Transparent object manipulation remains a significant challenge in robotics due to the difficulty of acquiring accurate and dense depth measurements. Conventional depth sensors often fail with transparent objects, resulting in incomplete or…

Computer Vision and Pattern Recognition · Computer Science 2025-02-19 Jeongyun Kim , Jeongho Noh , Dong-Guw Lee , Ayoung Kim

GeoDiff: Geometry-Guided Diffusion for Metric Depth Estimation

We introduce a novel framework for metric depth estimation that enhances pretrained diffusion-based monocular depth estimation (DB-MDE) models with stereo vision guidance. While existing DB-MDE methods excel at predicting relative depth,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-22 Tuan Pham , Thanh-Tung Le , Xiaohui Xie , Stephan Mandt

AquaDiff: Diffusion-Based Underwater Image Enhancement for Addressing Color Distortion

Underwater images are severely degraded by wavelength-dependent light absorption and scattering, resulting in color distortion, low contrast, and loss of fine details that hinder vision-based underwater applications. To address these…

Computer Vision and Pattern Recognition · Computer Science 2025-12-18 Afrah Shaahid , Muzammil Behzad