Related papers: DiffDreamer: Towards Consistent Unsupervised Singl…

Rethinking Real-world Image Deraining via An Unpaired Degradation-Conditioned Diffusion Model

Recent diffusion models have exhibited great potential in generative modeling tasks. Part of their success can be attributed to the ability of training stable on huge sets of paired synthetic data. However, adapting these models to…

Computer Vision and Pattern Recognition · Computer Science 2024-05-02 Yiyang Shen , Mingqiang Wei , Yongzhen Wang , Xueyang Fu , Jing Qin

Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis

Synthesizing extrapolated views remains a difficult task, especially in urban driving scenes, where the only reliable sources of data are limited RGB captures and sparse LiDAR points. To address this problem, we present PointmapDiff, a…

Computer Vision and Pattern Recognition · Computer Science 2025-12-25 Thang-Anh-Quan Nguyen , Nathan Piasco , Luis Roldão , Moussab Bennehar , Dzmitry Tsishkou , Laurent Caraffa , Jean-Philippe Tarel , Roland Brémond

Uncertainty-Aware Diffusion Guided Refinement of 3D Scenes

Reconstructing 3D scenes from a single image is a fundamentally ill-posed task due to the severely under-constrained nature of the problem. Consequently, when the scene is rendered from novel camera views, existing single image to 3D…

Computer Vision and Pattern Recognition · Computer Science 2025-10-10 Sarosij Bose , Arindam Dutta , Sayak Nag , Junge Zhang , Jiachen Li , Konstantinos Karydis , Amit K. Roy Chowdhury

ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing

This paper proposes ConsistDreamer - a novel framework that lifts 2D diffusion models with 3D awareness and 3D consistency, thus enabling high-fidelity instruction-guided scene editing. To overcome the fundamental limitation of missing 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Jun-Kun Chen , Samuel Rota Bulò , Norman Müller , Lorenzo Porzi , Peter Kontschieder , Yu-Xiong Wang

Long-Term Photometric Consistent Novel View Synthesis with Diffusion Models

Novel view synthesis from a single input image is a challenging task, where the goal is to generate a new view of a scene from a desired camera pose that may be separated by a large motion. The highly uncertain nature of this synthesis task…

Computer Vision and Pattern Recognition · Computer Science 2023-08-23 Jason J. Yu , Fereshteh Forghani , Konstantinos G. Derpanis , Marcus A. Brubaker

Diffusion-based Generation, Optimization, and Planning in 3D Scenes

We introduce SceneDiffuser, a conditional generative model for 3D scene understanding. SceneDiffuser provides a unified model for solving scene-conditioned generation, optimization, and planning. In contrast to prior works, SceneDiffuser is…

Computer Vision and Pattern Recognition · Computer Science 2023-01-18 Siyuan Huang , Zan Wang , Puhao Li , Baoxiong Jia , Tengyu Liu , Yixin Zhu , Wei Liang , Song-Chun Zhu

DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis

We present DiffPortrait3D, a conditional diffusion model that is capable of synthesizing 3D-consistent photo-realistic novel views from as few as a single in-the-wild portrait. Specifically, given a single RGB input, we aim to synthesize…

Computer Vision and Pattern Recognition · Computer Science 2025-03-21 Yuming Gu , You Xie , Hongyi Xu , Guoxian Song , Yichun Shi , Di Chang , Jing Yang , Linjie Luo

Consistent View Synthesis with Pose-Guided Diffusion Models

Novel view synthesis from a single image has been a cornerstone problem for many Virtual Reality applications that provide immersive experiences. However, most existing techniques can only synthesize novel views within a limited range of…

Computer Vision and Pattern Recognition · Computer Science 2023-03-31 Hung-Yu Tseng , Qinbo Li , Changil Kim , Suhib Alsisan , Jia-Bin Huang , Johannes Kopf

DiffCamera: Arbitrary Refocusing on Images

The depth-of-field (DoF) effect, which introduces aesthetically pleasing blur, enhances photographic quality but is fixed and difficult to modify once the image has been created. This becomes problematic when the applied blur is…

Computer Vision and Pattern Recognition · Computer Science 2025-10-01 Yiyang Wang , Xi Chen , Xiaogang Xu , Yu Liu , Hengshuang Zhao

DiffPano: Scalable and Consistent Text to Panorama Generation with Spherical Epipolar-Aware Diffusion

Diffusion-based methods have achieved remarkable achievements in 2D image or 3D object generation, however, the generation of 3D scenes and even $360^{\circ}$ images remains constrained, due to the limited number of scene datasets, the…

Computer Vision and Pattern Recognition · Computer Science 2024-11-01 Weicai Ye , Chenhao Ji , Zheng Chen , Junyao Gao , Xiaoshui Huang , Song-Hai Zhang , Wanli Ouyang , Tong He , Cairong Zhao , Guofeng Zhang

RealmDreamer: Text-Driven 3D Scene Generation with Inpainting and Depth Diffusion

We introduce RealmDreamer, a technique for generating forward-facing 3D scenes from text descriptions. Our method optimizes a 3D Gaussian Splatting representation to match complex text prompts using pretrained diffusion models. Our key…

Computer Vision and Pattern Recognition · Computer Science 2025-03-12 Jaidev Shriram , Alex Trevithick , Lingjie Liu , Ravi Ramamoorthi

Enhancing Monocular 3D Scene Completion with Diffusion Model

3D scene reconstruction is essential for applications in virtual reality, robotics, and autonomous driving, enabling machines to understand and interact with complex environments. Traditional 3D Gaussian Splatting techniques rely on images…

Graphics · Computer Science 2025-03-04 Changlin Song , Jiaqi Wang , Liyun Zhu , He Weng

ViewFusion: Towards Multi-View Consistency via Interpolated Denoising

Novel-view synthesis through diffusion models has demonstrated remarkable potential for generating diverse and high-quality images. Yet, the independent process of image generation in these prevailing methods leads to challenges in…

Computer Vision and Pattern Recognition · Computer Science 2024-03-01 Xianghui Yang , Yan Zuo , Sameera Ramasinghe , Loris Bazzani , Gil Avraham , Anton van den Hengel

Diffusion Models for Monocular Depth Estimation: Overcoming Challenging Conditions

We present a novel approach designed to address the complexities posed by challenging, out-of-distribution data in the single-image depth estimation task. Starting with images that facilitate depth prediction due to the absence of…

Computer Vision and Pattern Recognition · Computer Science 2024-07-24 Fabio Tosi , Pierluigi Zama Ramirez , Matteo Poggi

DreamComposer: Controllable 3D Object Generation via Multi-View Conditions

Utilizing pre-trained 2D large-scale generative models, recent works are capable of generating high-quality novel views from a single in-the-wild image. However, due to the lack of information from multiple views, these works encounter…

Computer Vision and Pattern Recognition · Computer Science 2024-03-27 Yunhan Yang , Yukun Huang , Xiaoyang Wu , Yuan-Chen Guo , Song-Hai Zhang , Hengshuang Zhao , Tong He , Xihui Liu

SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

In this paper, we present a novel diffusion model called that generates multiview-consistent images from a single-view image. Using pretrained large-scale 2D diffusion models, recent work Zero123 demonstrates the ability to generate…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Yuan Liu , Cheng Lin , Zijiao Zeng , Xiaoxiao Long , Lingjie Liu , Taku Komura , Wenping Wang

VistaDream: Sampling multiview consistent images for single-view scene reconstruction

In this paper, we propose VistaDream a novel framework to reconstruct a 3D scene from a single-view image. Recent diffusion models enable generating high-quality novel-view images from a single-view input image. Most existing methods only…

Computer Vision and Pattern Recognition · Computer Science 2024-10-23 Haiping Wang , Yuan Liu , Ziwei Liu , Wenping Wang , Zhen Dong , Bisheng Yang

ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models

Generating novel views of an object from a single image is a challenging task. It requires an understanding of the underlying 3D structure of the object from an image and rendering high-quality, spatially consistent new views. While recent…

Computer Vision and Pattern Recognition · Computer Science 2023-12-05 Jeong-gi Kwak , Erqun Dong , Yuhe Jin , Hanseok Ko , Shweta Mahajan , Kwang Moo Yi

ConsistentDreamer: View-Consistent Meshes Through Balanced Multi-View Gaussian Optimization

Recent advances in diffusion models have significantly improved 3D generation, enabling the use of assets generated from an image for embodied AI simulations. However, the one-to-many nature of the image-to-3D problem limits their use due…

Computer Vision and Pattern Recognition · Computer Science 2025-02-26 Onat Şahin , Mohammad Altillawi , George Eskandar , Carlos Carbone , Ziyuan Liu

Denoising Diffusion via Image-Based Rendering

Generating 3D scenes is a challenging open problem, which requires synthesizing plausible content that is fully consistent in 3D space. While recent methods such as neural radiance fields excel at view synthesis and 3D reconstruction, they…

Computer Vision and Pattern Recognition · Computer Science 2024-02-22 Titas Anciukevičius , Fabian Manhardt , Federico Tombari , Paul Henderson