Related papers: Consistent View Synthesis with Pose-Guided Diffusi…

ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models

Generating novel views of an object from a single image is a challenging task. It requires an understanding of the underlying 3D structure of the object from an image and rendering high-quality, spatially consistent new views. While recent…

Computer Vision and Pattern Recognition · Computer Science 2023-12-05 Jeong-gi Kwak , Erqun Dong , Yuhe Jin , Hanseok Ko , Shweta Mahajan , Kwang Moo Yi

Long-Term Photometric Consistent Novel View Synthesis with Diffusion Models

Novel view synthesis from a single input image is a challenging task, where the goal is to generate a new view of a scene from a desired camera pose that may be separated by a large motion. The highly uncertain nature of this synthesis task…

Computer Vision and Pattern Recognition · Computer Science 2023-08-23 Jason J. Yu , Fereshteh Forghani , Konstantinos G. Derpanis , Marcus A. Brubaker

WAVE: Warp-Based View Guidance for Consistent Novel View Synthesis Using a Single Image

Generating high-quality novel views of a scene from a single image requires maintaining structural coherence across different views, referred to as view consistency. While diffusion models have driven advancements in novel view synthesis,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-07 Jiwoo Park , Tae Eun Choi , Youngjun Jun , Seong Jae Hwang

Diffusion Priors for Dynamic View Synthesis from Monocular Videos

Dynamic novel view synthesis aims to capture the temporal evolution of visual content within videos. Existing methods struggle to distinguishing between motion and structure, particularly in scenarios where camera poses are either unknown…

Computer Vision and Pattern Recognition · Computer Science 2024-01-12 Chaoyang Wang , Peiye Zhuang , Aliaksandr Siarohin , Junli Cao , Guocheng Qian , Hsin-Ying Lee , Sergey Tulyakov

Generative Novel View Synthesis with 3D-Aware Diffusion Models

We present a diffusion-based model for 3D-aware generative novel view synthesis from as few as a single input image. Our model samples from the distribution of possible renderings consistent with the input and, even in the presence of…

Computer Vision and Pattern Recognition · Computer Science 2023-04-06 Eric R. Chan , Koki Nagano , Matthew A. Chan , Alexander W. Bergman , Jeong Joon Park , Axel Levy , Miika Aittala , Shalini De Mello , Tero Karras , Gordon Wetzstein

Novel View Synthesis with Pixel-Space Diffusion Models

Synthesizing a novel view from a single input image is a challenging task. Traditionally, this task was approached by estimating scene depth, warping, and inpainting, with machine learning models enabling parts of the pipeline. More…

Computer Vision and Pattern Recognition · Computer Science 2024-11-13 Noam Elata , Bahjat Kawar , Yaron Ostrovsky-Berman , Miriam Farber , Ron Sokolovsky

Geometric Consistency Refinement for Single Image Novel View Synthesis via Test-Time Adaptation of Diffusion Models

Diffusion models for single image novel view synthesis (NVS) can generate highly realistic and plausible images, but they are limited in the geometric consistency to the given relative poses. The generated images often show significant…

Computer Vision and Pattern Recognition · Computer Science 2025-04-14 Josef Bengtson , David Nilsson , Fredrik Kahl

Synthesizing Consistent Novel Views via 3D Epipolar Attention without Re-Training

Large diffusion models demonstrate remarkable zero-shot capabilities in novel view synthesis from a single image. However, these models often face challenges in maintaining consistency across novel and reference views. A crucial factor…

Computer Vision and Pattern Recognition · Computer Science 2025-02-26 Botao Ye , Sifei Liu , Xueting Li , Marc Pollefeys , Ming-Hsuan Yang

Look Beyond: Two-Stage Scene View Generation via Panorama and Video Diffusion

Novel view synthesis (NVS) from a single image is highly ill-posed due to large unobserved regions, especially for views that deviate significantly from the input. While existing methods focus on consistency between the source and generated…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Xueyang Kang , Zhengkang Xiang , Zezheng Zhang , Kourosh Khoshelham

Stable Virtual Camera: Generative View Synthesis with Diffusion Models

We present Stable Virtual Camera (Seva), a generalist diffusion model that creates novel views of a scene, given any number of input views and target cameras. Existing works struggle to generate either large viewpoint changes or temporally…

Computer Vision and Pattern Recognition · Computer Science 2025-04-03 Jensen Zhou , Hang Gao , Vikram Voleti , Aaryaman Vasishta , Chun-Han Yao , Mark Boss , Philip Torr , Christian Rupprecht , Varun Jampani

ViewFusion: Towards Multi-View Consistency via Interpolated Denoising

Novel-view synthesis through diffusion models has demonstrated remarkable potential for generating diverse and high-quality images. Yet, the independent process of image generation in these prevailing methods leads to challenges in…

Computer Vision and Pattern Recognition · Computer Science 2024-03-01 Xianghui Yang , Yan Zuo , Sameera Ramasinghe , Loris Bazzani , Gil Avraham , Anton van den Hengel

Pointmap-Conditioned Diffusion for Consistent Novel View Synthesis

Synthesizing extrapolated views remains a difficult task, especially in urban driving scenes, where the only reliable sources of data are limited RGB captures and sparse LiDAR points. To address this problem, we present PointmapDiff, a…

Computer Vision and Pattern Recognition · Computer Science 2025-12-25 Thang-Anh-Quan Nguyen , Nathan Piasco , Luis Roldão , Moussab Bennehar , Dzmitry Tsishkou , Laurent Caraffa , Jean-Philippe Tarel , Roland Brémond

3D-free meets 3D priors: Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance

Recent 3D novel view synthesis (NVS) methods often require extensive 3D data for training, and also typically lack generalization beyond the training distribution. Moreover, they tend to be object centric and struggle with complex and…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Taewon Kang , Divya Kothandaraman , Dinesh Manocha , Ming C. Lin

ViewCrafter: Taming Video Diffusion Models for High-fidelity Novel View Synthesis

Despite recent advancements in neural 3D reconstruction, the dependence on dense multi-view captures restricts their broader applicability. In this work, we propose \textbf{ViewCrafter}, a novel method for synthesizing high-fidelity novel…

Computer Vision and Pattern Recognition · Computer Science 2024-09-04 Wangbo Yu , Jinbo Xing , Li Yuan , Wenbo Hu , Xiaoyu Li , Zhipeng Huang , Xiangjun Gao , Tien-Tsin Wong , Ying Shan , Yonghong Tian

Projected Representation Conditioning for High-fidelity Novel View Synthesis

We propose a novel framework for diffusion-based novel view synthesis in which we leverage external representations as conditions, harnessing their geometric and semantic correspondence properties for enhanced geometric consistency in…

Computer Vision and Pattern Recognition · Computer Science 2026-02-13 Min-Seop Kwak , Minkyung Kwon , Jinhyeok Choi , Jiho Park , Seungryong Kim

Aligned Novel View Image and Geometry Synthesis via Cross-modal Attention Instillation

We introduce a diffusion-based framework that performs aligned novel view image and geometry generation via a warping-and-inpainting methodology. Unlike prior methods that require dense posed images or pose-embedded generative models…

Computer Vision and Pattern Recognition · Computer Science 2026-02-09 Min-Seop Kwak , Junho Kim , Sangdoo Yun , Dongyoon Han , Taekyung Kim , Seungryong Kim , Jin-Hwa Kim

CloseUpShot: Close-up Novel View Synthesis from Sparse-views via Point-conditioned Diffusion Model

Reconstructing 3D scenes and synthesizing novel views from sparse input views is a highly challenging task. Recent advances in video diffusion models have demonstrated strong temporal reasoning capabilities, making them a promising tool for…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Yuqi Zhang , Guanying Chen , Jiaxing Chen , Chuanyu Fu , Chuan Huang , Shuguang Cui

Consistent Human Image and Video Generation with Spatially Conditioned Diffusion

Consistent human-centric image and video synthesis aims to generate images or videos with new poses while preserving appearance consistency with a given reference image, which is crucial for low-cost visual content creation. Recent advances…

Computer Vision and Pattern Recognition · Computer Science 2024-12-20 Mingdeng Cao , Chong Mou , Ziyang Yuan , Xintao Wang , Zhaoyang Zhang , Ying Shan , Yinqiang Zheng

DiffPortrait360: Consistent Portrait Diffusion for 360 View Synthesis

Generating high-quality 360-degree views of human heads from single-view images is essential for enabling accessible immersive telepresence applications and scalable personalized content creation. While cutting-edge methods for full head…

Computer Vision and Pattern Recognition · Computer Science 2025-03-21 Yuming Gu , Phong Tran , Yujian Zheng , Hongyi Xu , Heyuan Li , Adilbek Karmanov , Hao Li

DT-NVS: Diffusion Transformers for Novel View Synthesis

Generating novel views of a natural scene, e.g., every-day scenes both indoors and outdoors, from a single view is an under-explored problem, even though it is an organic extension to the object-centric novel view synthesis. Existing…

Computer Vision and Pattern Recognition · Computer Science 2025-11-13 Wonbong Jang , Jonathan Tremblay , Lourdes Agapito