Related papers: MVDD: Multi-View Depth Diffusion Models

MVD$^2$: Efficient Multiview 3D Reconstruction for Multiview Diffusion

As a promising 3D generation technique, multiview diffusion (MVD) has received a lot of attention due to its advantages in terms of generalizability, quality, and efficiency. By finetuning pretrained large image diffusion models with 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-02-23 Xin-Yang Zheng , Hao Pan , Yu-Xiao Guo , Xin Tong , Yang Liu

MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View

Generating consistent multiple views for 3D reconstruction tasks is still a challenge to existing image-to-3D diffusion models. Generally, incorporating 3D representations into diffusion model decrease the model's speed as well as…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Emmanuelle Bourigault , Pauline Bourigault

MVDream: Multi-view Diffusion for 3D Generation

We introduce MVDream, a diffusion model that is able to generate consistent multi-view images from a given text prompt. Learning from both 2D and 3D data, a multi-view diffusion model can achieve the generalizability of 2D diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-04-19 Yichun Shi , Peng Wang , Jianglong Ye , Mai Long , Kejie Li , Xiao Yang

MVD-Fusion: Single-view 3D via Depth-consistent Multi-view Generation

We present MVD-Fusion: a method for single-view 3D inference via generative modeling of multi-view-consistent RGB-D images. While recent methods pursuing 3D inference advocate learning novel-view generative models, these generations are not…

Computer Vision and Pattern Recognition · Computer Science 2024-04-05 Hanzhe Hu , Zhizhuo Zhou , Varun Jampani , Shubham Tulsiani

Diffusion Models in 3D Vision: A Survey

In recent years, 3D vision has become a crucial field within computer vision, powering a wide range of applications such as autonomous driving, robotics, augmented reality, and medical imaging. This field relies on accurate perception,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Zhen Wang , Dongyuan Li , Yaozu Wu , Tianyu He , Jiang Bian , Renhe Jiang

3D Shape Generation and Completion through Point-Voxel Diffusion

We propose a novel approach for probabilistic generative modeling of 3D shapes. Unlike most existing models that learn to deterministically translate a latent vector to a shape, our model, Point-Voxel Diffusion (PVD), is a unified,…

Computer Vision and Pattern Recognition · Computer Science 2021-08-31 Linqi Zhou , Yilun Du , Jiajun Wu

DMV3D: Denoising Multi-View Diffusion using 3D Large Reconstruction Model

We propose \textbf{DMV3D}, a novel 3D generation approach that uses a transformer-based 3D large reconstruction model to denoise multi-view diffusion. Our reconstruction model incorporates a triplane NeRF representation and can denoise…

Computer Vision and Pattern Recognition · Computer Science 2023-11-16 Yinghao Xu , Hao Tan , Fujun Luan , Sai Bi , Peng Wang , Jiahao Li , Zifan Shi , Kalyan Sunkavalli , Gordon Wetzstein , Zexiang Xu , Kai Zhang

3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement

Despite advances in neural rendering, due to the scarcity of high-quality 3D datasets and the inherent limitations of multi-view diffusion models, view synthesis and 3D model generation are restricted to low resolutions with suboptimal…

Computer Vision and Pattern Recognition · Computer Science 2025-04-30 Yihang Luo , Shangchen Zhou , Yushi Lan , Xingang Pan , Chen Change Loy

GeoMVD: Geometry-Enhanced Multi-View Generation Model Based on Geometric Information Extraction

Multi-view image generation holds significant application value in computer vision, particularly in domains like 3D reconstruction, virtual reality, and augmented reality. Most existing methods, which rely on extending single images, face…

Computer Vision and Pattern Recognition · Computer Science 2025-11-20 Jiaqi Wu , Yaosen Chen , Shuyuan Zhu

TetraDiffusion: Tetrahedral Diffusion Models for 3D Shape Generation

Probabilistic denoising diffusion models (DDMs) have set a new standard for 2D image generation. Extending DDMs for 3D content creation is an active field of research. Here, we propose TetraDiffusion, a diffusion model that operates on a…

Computer Vision and Pattern Recognition · Computer Science 2024-08-12 Nikolai Kalischek , Torben Peters , Jan D. Wegner , Konrad Schindler

Generic 3D Diffusion Adapter Using Controlled Multi-View Editing

Open-domain 3D object synthesis has been lagging behind image synthesis due to limited data and higher computational complexity. To bridge this gap, recent works have investigated multi-view diffusion but often fall short in either 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-03-20 Hansheng Chen , Ruoxi Shi , Yulin Liu , Bokui Shen , Jiayuan Gu , Gordon Wetzstein , Hao Su , Leonidas Guibas

HoloFusion: Towards Photo-realistic 3D Generative Modeling

Diffusion-based image generators can now produce high-quality and diverse samples, but their success has yet to fully translate to 3D generation: existing diffusion methods can either generate low-resolution but 3D consistent outputs, or…

Computer Vision and Pattern Recognition · Computer Science 2023-08-29 Animesh Karnewar , Niloy J. Mitra , Andrea Vedaldi , David Novotny

CDI3D: Cross-guided Dense-view Interpolation for 3D Reconstruction

3D object reconstruction from single-view image is a fundamental task in computer vision with wide-ranging applications. Recent advancements in Large Reconstruction Models (LRMs) have shown great promise in leveraging multi-view images…

Computer Vision and Pattern Recognition · Computer Science 2025-03-13 Zhiyuan Wu , Xibin Song , Senbo Wang , Weizhe Liu , Jiayu Yang , Ziang Cheng , Shenzhou Chen , Taizhang Shang , Weixuan Sun , Shan Luo , Pan Ji

V3D: Video Diffusion Models are Effective 3D Generators

Automatic 3D generation has recently attracted widespread attention. Recent methods have greatly accelerated the generation speed, but usually produce less-detailed objects due to limited model capacity or 3D data. Motivated by recent…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Zilong Chen , Yikai Wang , Feng Wang , Zhengyi Wang , Huaping Liu

Multi-view PointNet for 3D Scene Understanding

Fusion of 2D images and 3D point clouds is important because information from dense images can enhance sparse point clouds. However, fusion is challenging because 2D and 3D data live in different spaces. In this work, we propose MVPNet…

Computer Vision and Pattern Recognition · Computer Science 2019-10-01 Maximilian Jaritz , Jiayuan Gu , Hao Su

MVDiffusion: Enabling Holistic Multi-view Image Generation with Correspondence-Aware Diffusion

This paper introduces MVDiffusion, a simple yet effective method for generating consistent multi-view images from text prompts given pixel-to-pixel correspondences (e.g., perspective crops from a panorama or multi-view images given depth…

Computer Vision and Pattern Recognition · Computer Science 2023-12-27 Shitao Tang , Fuyang Zhang , Jiacheng Chen , Peng Wang , Yasutaka Furukawa

Deformable 3D Shape Diffusion Model

The Gaussian diffusion model, initially designed for image generation, has recently been adapted for 3D point cloud generation. However, these adaptations have not fully considered the intrinsic geometric characteristics of 3D shapes,…

Graphics · Computer Science 2024-08-01 Dengsheng Chen , Jie Hu , Xiaoming Wei , Enhua Wu

MVGenMaster: Scaling Multi-View Generation from Any Image via 3D Priors Enhanced Diffusion Model

We introduce MVGenMaster, a multi-view diffusion model enhanced with 3D priors to address versatile Novel View Synthesis (NVS) tasks. MVGenMaster leverages 3D priors that are warped using metric depth and camera poses, significantly…

Computer Vision and Pattern Recognition · Computer Science 2025-03-07 Chenjie Cao , Chaohui Yu , Shang Liu , Fan Wang , Xiangyang Xue , Yanwei Fu

Vivid-ZOO: Multi-View Video Generation with Diffusion Model

While diffusion models have shown impressive performance in 2D image/video generation, diffusion-based Text-to-Multi-view-Video (T2MVid) generation remains underexplored. The new challenges posed by T2MVid generation lie in the lack of…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Bing Li , Cheng Zheng , Wenxuan Zhu , Jinjie Mai , Biao Zhang , Peter Wonka , Bernard Ghanem

Neural Point Cloud Diffusion for Disentangled 3D Shape and Appearance Generation

Controllable generation of 3D assets is important for many practical applications like content creation in movies, games and engineering, as well as in AR/VR. Recently, diffusion models have shown remarkable results in generation quality of…

Computer Vision and Pattern Recognition · Computer Science 2024-08-01 Philipp Schröppel , Christopher Wewer , Jan Eric Lenssen , Eddy Ilg , Thomas Brox