Related papers: Generic 3D Diffusion Adapter Using Controlled Mult…

MVDream: Multi-view Diffusion for 3D Generation

We introduce MVDream, a diffusion model that is able to generate consistent multi-view images from a given text prompt. Learning from both 2D and 3D data, a multi-view diffusion model can achieve the generalizability of 2D diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-04-19 Yichun Shi , Peng Wang , Jianglong Ye , Mai Long , Kejie Li , Xiao Yang

MvDrag3D: Drag-based Creative 3D Editing via Multi-view Generation-Reconstruction Priors

Drag-based editing has become popular in 2D content creation, driven by the capabilities of image generative models. However, extending this technique to 3D remains a challenge. Existing 3D drag-based editing methods, whether employing…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Honghua Chen , Yushi Lan , Yongwei Chen , Yifan Zhou , Xingang Pan

MVD$^2$: Efficient Multiview 3D Reconstruction for Multiview Diffusion

As a promising 3D generation technique, multiview diffusion (MVD) has received a lot of attention due to its advantages in terms of generalizability, quality, and efficiency. By finetuning pretrained large image diffusion models with 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-02-23 Xin-Yang Zheng , Hao Pan , Yu-Xiao Guo , Xin Tong , Yang Liu

MVDD: Multi-View Depth Diffusion Models

Denoising diffusion models have demonstrated outstanding results in 2D image generation, yet it remains a challenge to replicate its success in 3D shape generation. In this paper, we propose leveraging multi-view depth, which represents…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Zhen Wang , Qiangeng Xu , Feitong Tan , Menglei Chai , Shichen Liu , Rohit Pandey , Sean Fanello , Achuta Kadambi , Yinda Zhang

MVDiff: Scalable and Flexible Multi-View Diffusion for 3D Object Reconstruction from Single-View

Generating consistent multiple views for 3D reconstruction tasks is still a challenge to existing image-to-3D diffusion models. Generally, incorporating 3D representations into diffusion model decrease the model's speed as well as…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Emmanuelle Bourigault , Pauline Bourigault

DreamEdit3D: Personalization of Multi-View Diffusion Models for 3D Editing

While 2D diffusion models have achieved remarkable success in identity-preserving personalization, extending this capability to 3D assets remains a significant challenge due to the complexities of multi-view consistency and spatial control.…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Jinxin Ai , Matthias Nießner , Ziya Erkoç

VMDiff: Visual Mixing Diffusion for Limitless Cross-Object Synthesis

Creating novel images by fusing visual cues from multiple sources is a fundamental yet underexplored problem in image-to-image generation, with broad applications in artistic creation, virtual reality and visual media. Existing methods…

Computer Vision and Pattern Recognition · Computer Science 2025-09-30 Zeren Xiong , Yue Yu , Zedong Zhang , Shuo Chen , Jian Yang , Jun Li

Fast Multi-view Consistent 3D Editing with Video Priors

Text-driven 3D editing enables user-friendly 3D object or scene editing with text instructions. Due to the lack of multi-view consistency priors, existing methods typically resort to employing 2D generation or editing models to process each…

Computer Vision and Pattern Recognition · Computer Science 2025-12-02 Liyi Chen , Ruihuang Li , Guowen Zhang , Pengfei Wang , Lei Zhang

MVDiffusion++: A Dense High-resolution Multi-view Diffusion Model for Single or Sparse-view 3D Object Reconstruction

This paper presents a neural architecture MVDiffusion++ for 3D object reconstruction that synthesizes dense and high-resolution views of an object given one or a few images without camera poses. MVDiffusion++ achieves superior flexibility…

Computer Vision and Pattern Recognition · Computer Science 2024-05-01 Shitao Tang , Jiacheng Chen , Dilin Wang , Chengzhou Tang , Fuyang Zhang , Yuchen Fan , Vikas Chandra , Yasutaka Furukawa , Rakesh Ranjan

3D-Adapter: Geometry-Consistent Multi-View Diffusion for High-Quality 3D Generation

Multi-view image diffusion models have significantly advanced open-domain 3D object generation. However, most existing models rely on 2D network architectures that lack inherent 3D biases, resulting in compromised geometric consistency. To…

Computer Vision and Pattern Recognition · Computer Science 2025-02-21 Hansheng Chen , Bokui Shen , Yulin Liu , Ruoxi Shi , Linqi Zhou , Connor Z. Lin , Jiayuan Gu , Hao Su , Gordon Wetzstein , Leonidas Guibas

SV3D: Novel Multi-view Synthesis and 3D Generation from a Single Image using Latent Video Diffusion

We present Stable Video 3D (SV3D) -- a latent video diffusion model for high-resolution, image-to-multi-view generation of orbital videos around a 3D object. Recent work on 3D generation propose techniques to adapt 2D generative models for…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Vikram Voleti , Chun-Han Yao , Mark Boss , Adam Letts , David Pankratz , Dmitry Tochilkin , Christian Laforte , Robin Rombach , Varun Jampani

V3D: Video Diffusion Models are Effective 3D Generators

Automatic 3D generation has recently attracted widespread attention. Recent methods have greatly accelerated the generation speed, but usually produce less-detailed objects due to limited model capacity or 3D data. Motivated by recent…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Zilong Chen , Yikai Wang , Feng Wang , Zhengyi Wang , Huaping Liu

MV-Adapter: Multi-view Consistent Image Generation Made Easy

Existing multi-view image generation methods often make invasive modifications to pre-trained text-to-image (T2I) models and require full fine-tuning, leading to (1) high computational costs, especially with large base models and…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Zehuan Huang , Yuan-Chen Guo , Haoran Wang , Ran Yi , Lizhuang Ma , Yan-Pei Cao , Lu Sheng

MVHuman: Tailoring 2D Diffusion with Multi-view Sampling For Realistic 3D Human Generation

Recent months have witnessed rapid progress in 3D generation based on diffusion models. Most advances require fine-tuning existing 2D Stable Diffsuions into multi-view settings or tedious distilling operations and hence fall short of 3D…

Computer Vision and Pattern Recognition · Computer Science 2023-12-19 Suyi Jiang , Haimin Luo , Haoran Jiang , Ziyu Wang , Jingyi Yu , Lan Xu

MVLight: Relightable Text-to-3D Generation via Light-conditioned Multi-View Diffusion

Recent advancements in text-to-3D generation, building on the success of high-performance text-to-image generative models, have made it possible to create imaginative and richly textured 3D objects from textual descriptions. However, a key…

Computer Vision and Pattern Recognition · Computer Science 2024-11-19 Dongseok Shim , Yichun Shi , Kejie Li , H. Jin Kim , Peng Wang

3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models

Text-guided diffusion models have shown superior performance in image/video generation and editing. While few explorations have been performed in 3D scenarios. In this paper, we discuss three fundamental and interesting problems on this…

Computer Vision and Pattern Recognition · Computer Science 2023-10-13 Gang Li , Heliang Zheng , Chaoyue Wang , Chang Li , Changwen Zheng , Dacheng Tao

View-Consistent 3D Editing with Gaussian Splatting

The advent of 3D Gaussian Splatting (3DGS) has revolutionized 3D editing, offering efficient, high-fidelity rendering and enabling precise local manipulations. Currently, diffusion-based 2D editing models are harnessed to modify multi-view…

Graphics · Computer Science 2025-02-18 Yuxuan Wang , Xuanyu Yi , Zike Wu , Na Zhao , Long Chen , Hanwang Zhang

Animate3D: Animating Any 3D Model with Multi-view Video Diffusion

Recent advances in 4D generation mainly focus on generating 4D content by distilling pre-trained text or single-view image-conditioned models. It is inconvenient for them to take advantage of various off-the-shelf 3D assets with multi-view…

Computer Vision and Pattern Recognition · Computer Science 2024-09-10 Yanqin Jiang , Chaohui Yu , Chenjie Cao , Fan Wang , Weiming Hu , Jin Gao

Make Your MoVe: Make Your 3D Contents by Adapting Multi-View Diffusion Models to External Editing

As 3D generation techniques continue to flourish, the demand for generating personalized content is rapidly rising. Users increasingly seek to apply various editing methods to polish generated 3D content, aiming to enhance its color, style,…

Computer Vision and Pattern Recognition · Computer Science 2025-08-12 Weitao Wang , Haoran Xu , Jun Meng , Haoqian Wang

DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing

While diffusion models have demonstrated remarkable progress in 2D image generation and editing, extending these capabilities to 3D editing remains challenging, particularly in maintaining multi-view consistency. Classical approaches…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Yufeng Chi , Huimin Ma , Kafeng Wang , Jianmin Li