Related papers: VaLID: Variable-Length Input Diffusion for Novel V…

DT-NVS: Diffusion Transformers for Novel View Synthesis

Generating novel views of a natural scene, e.g., every-day scenes both indoors and outdoors, from a single view is an under-explored problem, even though it is an organic extension to the object-centric novel view synthesis. Existing…

Computer Vision and Pattern Recognition · Computer Science 2025-11-13 Wonbong Jang , Jonathan Tremblay , Lourdes Agapito

Look Beyond: Two-Stage Scene View Generation via Panorama and Video Diffusion

Novel view synthesis (NVS) from a single image is highly ill-posed due to large unobserved regions, especially for views that deviate significantly from the input. While existing methods focus on consistency between the source and generated…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Xueyang Kang , Zhengkang Xiang , Zezheng Zhang , Kourosh Khoshelham

Novel View Synthesis as Video Completion

We tackle the problem of sparse novel view synthesis (NVS) using video diffusion models; given $K$ ($\approx 5$) multi-view images of a scene and their camera poses, we predict the view from a target camera pose. Many prior approaches…

Computer Vision and Pattern Recognition · Computer Science 2026-04-10 Qi Wu , Khiem Vuong , Minsik Jeon , Srinivasa Narasimhan , Deva Ramanan

One-Shot Refiner: Boosting Feed-forward Novel View Synthesis via One-Step Diffusion

We present a novel framework for high-fidelity novel view synthesis (NVS) from sparse images, addressing key limitations in recent feed-forward 3D Gaussian Splatting (3DGS) methods built on Vision Transformer (ViT) backbones. While…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Yitong Dong , Qi Zhang , Minchao Jiang , Zhiqiang Wu , Qingnan Fan , Ying Feng , Huaqi Zhang , Hujun Bao , Guofeng Zhang

3D-free meets 3D priors: Novel View Synthesis from a Single Image with Pretrained Diffusion Guidance

Recent 3D novel view synthesis (NVS) methods often require extensive 3D data for training, and also typically lack generalization beyond the training distribution. Moreover, they tend to be object centric and struggle with complex and…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Taewon Kang , Divya Kothandaraman , Dinesh Manocha , Ming C. Lin

Novel View Synthesis with Pixel-Space Diffusion Models

Synthesizing a novel view from a single input image is a challenging task. Traditionally, this task was approached by estimating scene depth, warping, and inpainting, with machine learning models enabling parts of the pipeline. More…

Computer Vision and Pattern Recognition · Computer Science 2024-11-13 Noam Elata , Bahjat Kawar , Yaron Ostrovsky-Berman , Miriam Farber , Ron Sokolovsky

ViVid-1-to-3: Novel View Synthesis with Video Diffusion Models

Generating novel views of an object from a single image is a challenging task. It requires an understanding of the underlying 3D structure of the object from an image and rendering high-quality, spatially consistent new views. While recent…

Computer Vision and Pattern Recognition · Computer Science 2023-12-05 Jeong-gi Kwak , Erqun Dong , Yuhe Jin , Hanseok Ko , Shweta Mahajan , Kwang Moo Yi

SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion

We present Stable Video 3D (SV3D) -- a latent video diffusion model for high-resolution, image-to-multi-view generation of orbital videos around a 3D object. Recent work on 3D generation propose techniques to adapt 2D generative models for…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Vikram Voleti , Chun-Han Yao , Mark Boss , Adam Letts , David Pankratz , Dmitry Tochilkin , Christian Laforte , Robin Rombach , Varun Jampani

SGD: Street View Synthesis with Gaussian Splatting and Diffusion Prior

Novel View Synthesis (NVS) for street scenes play a critical role in the autonomous driving simulation. The current mainstream technique to achieve it is neural rendering, such as Neural Radiance Fields (NeRF) and 3D Gaussian Splatting…

Computer Vision and Pattern Recognition · Computer Science 2024-04-01 Zhongrui Yu , Haoran Wang , Jinze Yang , Hanzhang Wang , Zeke Xie , Yunfeng Cai , Jiale Cao , Zhong Ji , Mingming Sun

NVS-Solver: Video Diffusion Model as Zero-Shot Novel View Synthesizer

By harnessing the potent generative capabilities of pre-trained large video diffusion models, we propose NVS-Solver, a new novel view synthesis (NVS) paradigm that operates \textit{without} the need for training. NVS-Solver adaptively…

Computer Vision and Pattern Recognition · Computer Science 2025-04-03 Meng You , Zhiyu Zhu , Hui Liu , Junhui Hou

OrbitNVS: Harnessing Video Diffusion Priors for Novel View Synthesis

Novel View Synthesis (NVS) aims to generate unseen views of a 3D object given a limited number of known views. Existing methods often struggle to synthesize plausible views for unobserved regions, particularly under single-view input, and…

Computer Vision and Pattern Recognition · Computer Science 2026-03-23 Jinglin Liang , Zijian Zhou , Rui Huang , Shuangping Huang , Yichen Gong

Light Field Diffusion for Single-View Novel View Synthesis

Single-view novel view synthesis (NVS), the task of generating images from new viewpoints based on a single reference image, is important but challenging in computer vision. Recent advancements in NVS have leveraged Denoising Diffusion…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Yifeng Xiong , Haoyu Ma , Shanlin Sun , Kun Han , Hao Tang , Xiaohui Xie

Consistent-1-to-3: Consistent Image to 3D View Synthesis via Geometry-aware Diffusion Models

Zero-shot novel view synthesis (NVS) from a single image is an essential problem in 3D object understanding. While recent approaches that leverage pre-trained generative models can synthesize high-quality novel views from in-the-wild…

Computer Vision and Pattern Recognition · Computer Science 2024-03-18 Jianglong Ye , Peng Wang , Kejie Li , Yichun Shi , Heng Wang

UMAMI: Unifying Masked Autoregressive Models and Deterministic Rendering for View Synthesis

Novel view synthesis (NVS) seeks to render photorealistic, 3D-consistent images of a scene from unseen camera poses given only a sparse set of posed views. Existing deterministic networks render observed regions quickly but blur unobserved…

Computer Vision and Pattern Recognition · Computer Science 2025-12-24 Thanh-Tung Le , Tuan Pham , Tung Nguyen , Deying Kong , Xiaohui Xie , Stephan Mandt

VisionNVS: Self-Supervised Inpainting for Novel View Synthesis under the Virtual-Shift Paradigm

A fundamental bottleneck in Novel View Synthesis (NVS) for autonomous driving is the inherent supervision gap on novel trajectories: models are tasked with synthesizing unseen views during inference, yet lack ground truth images for these…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Hongbo Lu , Liang Yao , Chenghao He , Fan Liu , Wenlong Liao , Tao He , Pai Peng

Novel View Synthesis from a Single RGBD Image for Indoor Scenes

In this paper, we propose an approach for synthesizing novel view images from a single RGBD (Red Green Blue-Depth) input. Novel view synthesis (NVS) is an interesting computer vision task with extensive applications. Methods using multiple…

Computer Vision and Pattern Recognition · Computer Science 2023-11-03 Congrui Hetang , Yuping Wang

MOVIS: Enhancing Multi-Object Novel View Synthesis for Indoor Scenes

Repurposing pre-trained diffusion models has been proven to be effective for NVS. However, these methods are mostly limited to a single object; directly applying such methods to compositional multi-object scenarios yields inferior results,…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Ruijie Lu , Yixin Chen , Junfeng Ni , Baoxiong Jia , Yu Liu , Diwen Wan , Gang Zeng , Siyuan Huang

EpipolarNVS: leveraging on Epipolar geometry for single-image Novel View Synthesis

Novel-view synthesis (NVS) can be tackled through different approaches, depending on the general setting: a single source image to a short video sequence, exact or noisy camera pose information, 3D-based information such as point clouds…

Computer Vision and Pattern Recognition · Computer Science 2022-10-25 Gaétan Landreau , Mohamed Tamaazousti

iNVS: Repurposing Diffusion Inpainters for Novel View Synthesis

We present a method for generating consistent novel views from a single source image. Our approach focuses on maximizing the reuse of visible pixels from the source image. To achieve this, we use a monocular depth estimator that transfers…

Computer Vision and Pattern Recognition · Computer Science 2023-10-26 Yash Kant , Aliaksandr Siarohin , Michael Vasilkovsky , Riza Alp Guler , Jian Ren , Sergey Tulyakov , Igor Gilitschenski

WildCAT3D: Appearance-Aware Multi-View Diffusion in the Wild

Despite recent advances in sparse novel view synthesis (NVS) applied to object-centric scenes, scene-level NVS remains a challenge. A central issue is the lack of available clean multi-view training data, beyond manually curated datasets…

Computer Vision and Pattern Recognition · Computer Science 2025-11-04 Morris Alper , David Novotny , Filippos Kokkinos , Hadar Averbuch-Elor , Tom Monnier