Related papers: AlignDiff: Learning Physically-Grounded Camera Ali…

CaLDiff: Camera Localization in NeRF via Pose Diffusion

With the widespread use of NeRF-based implicit 3D representation, the need for camera localization in the same representation becomes manifestly apparent. Doing so not only simplifies the localization process -- by avoiding an…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Rashik Shrestha , Bishad Koju , Abhigyan Bhusal , Danda Pani Paudel , François Rameau

TAlignDiff: Automatic Tooth Alignment assisted by Diffusion-based Transformation Learning

Orthodontic treatment hinges on tooth alignment, which significantly affects occlusal function, facial aesthetics, and patients' quality of life. Current deep learning approaches predominantly concentrate on predicting transformation…

Computer Vision and Pattern Recognition · Computer Science 2025-08-07 Yunbi Liu , Enqi Tang , Shiyu Li , Lei Ma , Juncheng Li , Shu Lou , Yongchu Pan , Qingshan Liu

Cameras as Rays: Pose Estimation via Ray Diffusion

Estimating camera poses is a fundamental task for 3D reconstruction and remains challenging given sparsely sampled views (<10). In contrast to existing approaches that pursue top-down prediction of global parametrizations of camera…

Computer Vision and Pattern Recognition · Computer Science 2024-04-05 Jason Y. Zhang , Amy Lin , Moneish Kumar , Tzu-Hsuan Yang , Deva Ramanan , Shubham Tulsiani

AquaDiff: Diffusion-Based Underwater Image Enhancement for Addressing Color Distortion

Underwater images are severely degraded by wavelength-dependent light absorption and scattering, resulting in color distortion, low contrast, and loss of fine details that hinder vision-based underwater applications. To address these…

Computer Vision and Pattern Recognition · Computer Science 2025-12-18 Afrah Shaahid , Muzammil Behzad

Distortion Estimation Through Explicit Modeling of the Refractive Surface

Precise calibration is a must for high reliance 3D computer vision algorithms. A challenging case is when the camera is behind a protective glass or transparent object: due to refraction, the image is heavily distorted; the pinhole camera…

Computer Vision and Pattern Recognition · Computer Science 2019-09-25 Szabolcs Pável , Csanád Sándor , Lehel Csató

ReorientDiff: Diffusion Model based Reorientation for Object Manipulation

The ability to manipulate objects in a desired configurations is a fundamental requirement for robots to complete various practical applications. While certain goals can be achieved by picking and placing the objects of interest directly,…

Robotics · Computer Science 2023-09-18 Utkarsh A. Mishra , Yongxin Chen

BADGR: Bundle Adjustment Diffusion Conditioned by GRadients for Wide-Baseline Floor Plan Reconstruction

Reconstructing precise camera poses and floor plan layouts from wide-baseline RGB panoramas is a difficult and unsolved problem. We introduce BADGR, a novel diffusion model that jointly performs reconstruction and bundle adjustment (BA) to…

Computer Vision and Pattern Recognition · Computer Science 2025-03-26 Yuguang Li , Ivaylo Boyadzhiev , Zixuan Liu , Linda Shapiro , Alex Colburn

DiffCalib: Reformulating Monocular Camera Calibration as Diffusion-Based Dense Incident Map Generation

Monocular camera calibration is a key precondition for numerous 3D vision applications. Despite considerable advancements, existing methods often hinge on specific assumptions and struggle to generalize across varied real-world scenarios,…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Xiankang He , Guangkai Xu , Bo Zhang , Hao Chen , Ying Cui , Dongyan Guo

Boost 3D Reconstruction using Diffusion-based Monocular Camera Calibration

In this paper, we present DM-Calib, a diffusion-based approach for estimating pinhole camera intrinsic parameters from a single input image. Monocular camera calibration is essential for many 3D vision tasks. However, most existing methods…

Computer Vision and Pattern Recognition · Computer Science 2025-09-19 Junyuan Deng , Wei Yin , Xiaoyang Guo , Qian Zhang , Xiaotao Hu , Weiqiang Ren , Xiao-Xiao Long , Ping Tan

D3RoMa: Disparity Diffusion-based Depth Sensing for Material-Agnostic Robotic Manipulation

Depth sensing is an important problem for 3D vision-based robotics. Yet, a real-world active stereo or ToF depth camera often produces noisy and incomplete depth which bottlenecks robot performances. In this work, we propose D3RoMa, a…

Robotics · Computer Science 2024-09-26 Songlin Wei , Haoran Geng , Jiayi Chen , Congyue Deng , Wenbo Cui , Chengyang Zhao , Xiaomeng Fang , Leonidas Guibas , He Wang

PRaDA: Projective Radial Distortion Averaging

We tackle the problem of automatic calibration of radially distorted cameras in challenging conditions. Accurately determining distortion parameters typically requires either 1) solving the full Structure from Motion (SfM) problem involving…

Computer Vision and Pattern Recognition · Computer Science 2025-09-16 Daniil Sinitsyn , Linus Härenstam-Nielsen , Daniel Cremers

DMAligner: Enhancing Image Alignment via Diffusion Model Based View Synthesis

Image alignment is a fundamental task in computer vision with broad applications. Existing methods predominantly employ optical flow-based image warping. However, this technique is susceptible to common challenges such as occlusions and…

Computer Vision and Pattern Recognition · Computer Science 2026-03-27 Xinglong Luo , Ao Luo , Zhengning Wang , Yueqi Yang , Chaoyu Feng , Lei Lei , Bing Zeng , Shuaicheng Liu

A Deep Perceptual Measure for Lens and Camera Calibration

Image editing and compositing have become ubiquitous in entertainment, from digital art to AR and VR experiences. To produce beautiful composites, the camera needs to be geometrically calibrated, which can be tedious and requires a physical…

Computer Vision and Pattern Recognition · Computer Science 2023-07-28 Yannick Hold-Geoffroy , Dominique Piché-Meunier , Kalyan Sunkavalli , Jean-Charles Bazin , François Rameau , Jean-François Lalonde

PoseDiff: A Unified Diffusion Model Bridging Robot Pose Estimation and Video-to-Action Control

We present PoseDiff, a conditional diffusion model that unifies robot state estimation and control within a single framework. At its core, PoseDiff maps raw visual observations into structured robot states-such as 3D keypoints or joint…

Robotics · Computer Science 2025-11-03 Haozhuo Zhang , Michele Caprio , Jing Shao , Qiang Zhang , Jian Tang , Shanghang Zhang , Wei Pan

LiDAR-Camera Calibration under Arbitrary Configurations: Observability and Methods

LiDAR-camera calibration is a precondition for many heterogeneous systems that fuse data from LiDAR and camera. However, the constraint from common field of view and the requirement for strict time synchronization make the calibration a…

Robotics · Computer Science 2019-07-31 Bo Fu , Yue Wang , Xiaqing Ding , Yanmei Jiao , Li Tang , Rong Xiong

Deep Learning for Camera Calibration and Beyond: A Survey

Camera calibration involves estimating camera parameters to infer geometric features from captured sequences, which is crucial for computer vision and robotics. However, conventional calibration is laborious and requires dedicated…

Computer Vision and Pattern Recognition · Computer Science 2025-02-25 Kang Liao , Lang Nie , Shujuan Huang , Chunyu Lin , Jing Zhang , Yao Zhao , Moncef Gabbouj , Dacheng Tao

PoseDiffusion: Solving Pose Estimation via Diffusion-aided Bundle Adjustment

Camera pose estimation is a long-standing computer vision problem that to date often relies on classical methods, such as handcrafted keypoint matching, RANSAC and bundle adjustment. In this paper, we propose to formulate the Structure from…

Computer Vision and Pattern Recognition · Computer Science 2024-01-26 Jianyuan Wang , Christian Rupprecht , David Novotny

Cascaded Robust Rectification for Arbitrary Document Images

Document rectification in real-world scenarios poses significant challenges due to extreme variations in camera perspectives and physical distortions. Driven by the insight that complex transformations can be decomposed and resolved…

Computer Vision and Pattern Recognition · Computer Science 2025-12-01 Chaoyun Wang , Quanxin Huang , I-Chao Shen , Takeo Igarashi , Nanning Zheng , Caigui Jiang

AGLLDiff: Guiding Diffusion Models Towards Unsupervised Training-free Real-world Low-light Image Enhancement

Existing low-light image enhancement (LIE) methods have achieved noteworthy success in solving synthetic distortions, yet they often fall short in practical applications. The limitations arise from two inherent challenges in real-world LIE:…

Computer Vision and Pattern Recognition · Computer Science 2024-07-24 Yunlong Lin , Tian Ye , Sixiang Chen , Zhenqi Fu , Yingying Wang , Wenhao Chai , Zhaohu Xing , Lei Zhu , Xinghao Ding

DIVD: Deblurring with Improved Video Diffusion Model

Video deblurring presents a considerable challenge owing to the complexity of blur, which frequently results from a combination of camera shakes, and object motions. In the field of video deblurring, many previous works have primarily…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Haoyang Long , Yan Wang , Wendong Wang