Related papers: ConsistentDreamer: View-Consistent Meshes Through …

SweetDreamer: Aligning Geometric Priors in 2D Diffusion for Consistent Text-to-3D

It is inherently ambiguous to lift 2D results from pre-trained diffusion models to a 3D world for text-to-3D generation. 2D diffusion models solely learn view-agnostic priors and thus lack 3D knowledge during the lifting, leading to the…

Computer Vision and Pattern Recognition · Computer Science 2023-10-23 Weiyu Li , Rui Chen , Xuelin Chen , Ping Tan

An Optimization Framework to Enforce Multi-View Consistency for Texturing 3D Meshes

A fundamental problem in the texturing of 3D meshes using pre-trained text-to-image models is to ensure multi-view consistency. State-of-the-art approaches typically use diffusion models to aggregate multi-view inputs, where common issues…

Computer Vision and Pattern Recognition · Computer Science 2024-08-05 Zhengyi Zhao , Chen Song , Xiaodong Gu , Yuan Dong , Qi Zuo , Weihao Yuan , Liefeng Bo , Zilong Dong , Qixing Huang

SyncDreamer: Generating Multiview-consistent Images from a Single-view Image

In this paper, we present a novel diffusion model called that generates multiview-consistent images from a single-view image. Using pretrained large-scale 2D diffusion models, recent work Zero123 demonstrates the ability to generate…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Yuan Liu , Cheng Lin , Zijiao Zeng , Xiaoxiao Long , Lingjie Liu , Taku Komura , Wenping Wang

ConsistDreamer: 3D-Consistent 2D Diffusion for High-Fidelity Scene Editing

This paper proposes ConsistDreamer - a novel framework that lifts 2D diffusion models with 3D awareness and 3D consistency, thus enabling high-fidelity instruction-guided scene editing. To overcome the fundamental limitation of missing 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Jun-Kun Chen , Samuel Rota Bulò , Norman Müller , Lorenzo Porzi , Peter Kontschieder , Yu-Xiong Wang

ConsDreamer: Advancing Multi-View Consistency for Zero-Shot Text-to-3D Generation

Recent advances in zero-shot text-to-3D generation have revolutionized 3D content creation by enabling direct synthesis from textual descriptions. While state-of-the-art methods leverage 3D Gaussian Splatting with score distillation to…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Yuan Zhou , Shilong Jin , Litao Hua , Wanjun Lv , Haoran Duan , Jungong Han

DisCo3D: Distilling Multi-View Consistency for 3D Scene Editing

While diffusion models have demonstrated remarkable progress in 2D image generation and editing, extending these capabilities to 3D editing remains challenging, particularly in maintaining multi-view consistency. Classical approaches…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Yufeng Chi , Huimin Ma , Kafeng Wang , Jianmin Li

Consistent123: Improve Consistency for One Image to 3D Object Synthesis

Large image diffusion models enable novel view synthesis with high quality and excellent zero-shot capability. However, such models based on image-to-image translation have no guarantee of view consistency, limiting the performance for…

Computer Vision and Pattern Recognition · Computer Science 2023-10-13 Haohan Weng , Tianyu Yang , Jianan Wang , Yu Li , Tong Zhang , C. L. Philip Chen , Lei Zhang

TexPainter: Generative Mesh Texturing with Multi-view Consistency

The recent success of pre-trained diffusion models unlocks the possibility of the automatic generation of textures for arbitrary 3D meshes in the wild. However, these models are trained in the screen space, while converting them to a…

Computer Vision and Pattern Recognition · Computer Science 2024-06-28 Hongkun Zhang , Zherong Pan , Congyi Zhang , Lifeng Zhu , Xifeng Gao

VistaDream: Sampling multiview consistent images for single-view scene reconstruction

In this paper, we propose VistaDream a novel framework to reconstruct a 3D scene from a single-view image. Recent diffusion models enable generating high-quality novel-view images from a single-view input image. Most existing methods only…

Computer Vision and Pattern Recognition · Computer Science 2024-10-23 Haiping Wang , Yuan Liu , Ziwei Liu , Wenping Wang , Zhen Dong , Bisheng Yang

3DGS-Enhancer: Enhancing Unbounded 3D Gaussian Splatting with View-consistent 2D Diffusion Priors

Novel-view synthesis aims to generate novel views of a scene from multiple input images or videos, and recent advancements like 3D Gaussian splatting (3DGS) have achieved notable success in producing photorealistic renderings with efficient…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Xi Liu , Chaoyi Zhou , Siyu Huang

Sparse3D: Distilling Multiview-Consistent Diffusion for Object Reconstruction from Sparse Views

Reconstructing 3D objects from extremely sparse views is a long-standing and challenging problem. While recent techniques employ image diffusion models for generating plausible images at novel viewpoints or for distilling pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Zi-Xin Zou , Weihao Cheng , Yan-Pei Cao , Shi-Sheng Huang , Ying Shan , Song-Hai Zhang

GeoDream: Disentangling 2D and Geometric Priors for High-Fidelity and Consistent 3D Generation

Text-to-3D generation by distilling pretrained large-scale text-to-image diffusion models has shown great promise but still suffers from inconsistent 3D geometric structures (Janus problems) and severe artifacts. The aforementioned problems…

Computer Vision and Pattern Recognition · Computer Science 2023-12-05 Baorui Ma , Haoge Deng , Junsheng Zhou , Yu-Shen Liu , Tiejun Huang , Xinlong Wang

View-Consistent Diffusion Representations for 3D-Consistent Video Generation

Video generation models have made significant progress in generating realistic content, enabling applications in simulation, gaming, and film making. However, current generated videos still contain visual artifacts arising from 3D…

Computer Vision and Pattern Recognition · Computer Science 2025-11-25 Duolikun Danier , Ge Gao , Steven McDonagh , Changjian Li , Hakan Bilen , Oisin Mac Aodha

StableDreamer: Taming Noisy Score Distillation Sampling for Text-to-3D

In the realm of text-to-3D generation, utilizing 2D diffusion models through score distillation sampling (SDS) frequently leads to issues such as blurred appearances and multi-faced geometry, primarily due to the intrinsically noisy nature…

Computer Vision and Pattern Recognition · Computer Science 2023-12-06 Pengsheng Guo , Hans Hao , Adam Caccavale , Zhongzheng Ren , Edward Zhang , Qi Shan , Aditya Sankar , Alexander G. Schwing , Alex Colburn , Fangchang Ma

Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors

3D object generation from a single image involves estimating the full 3D geometry and texture of unseen views from an unposed RGB image captured in the wild. Accurately reconstructing an object's complete 3D structure and texture has…

Computer Vision and Pattern Recognition · Computer Science 2024-11-21 Hritam Basak , Hadi Tabatabaee , Shreekant Gayaka , Ming-Feng Li , Xin Yang , Cheng-Hao Kuo , Arnie Sen , Min Sun , Zhaozheng Yin

GaussVideoDreamer: 3D Scene Generation with Video Diffusion and Inconsistency-Aware Gaussian Splatting

Single-image 3D scene reconstruction presents significant challenges due to its inherently ill-posed nature and limited input constraints. Recent advances have explored two promising directions: multiview generative models that train on 3D…

Computer Vision and Pattern Recognition · Computer Science 2025-04-17 Junlin Hao , Peiheng Wang , Haoyang Wang , Xinggong Zhang , Zongming Guo

Wonder3D: Single Image to 3D using Cross-Domain Diffusion

In this work, we introduce Wonder3D, a novel method for efficiently generating high-fidelity textured meshes from single-view images.Recent methods based on Score Distillation Sampling (SDS) have shown the potential to recover 3D geometry…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 Xiaoxiao Long , Yuan-Chen Guo , Cheng Lin , Yuan Liu , Zhiyang Dou , Lingjie Liu , Yuexin Ma , Song-Hai Zhang , Marc Habermann , Christian Theobalt , Wenping Wang

MVDream: Multi-view Diffusion for 3D Generation

We introduce MVDream, a diffusion model that is able to generate consistent multi-view images from a given text prompt. Learning from both 2D and 3D data, a multi-view diffusion model can achieve the generalizability of 2D diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-04-19 Yichun Shi , Peng Wang , Jianglong Ye , Mai Long , Kejie Li , Xiao Yang

3DEnhancer: Consistent Multi-View Diffusion for 3D Enhancement

Despite advances in neural rendering, due to the scarcity of high-quality 3D datasets and the inherent limitations of multi-view diffusion models, view synthesis and 3D model generation are restricted to low resolutions with suboptimal…

Computer Vision and Pattern Recognition · Computer Science 2025-04-30 Yihang Luo , Shangchen Zhou , Yushi Lan , Xingang Pan , Chen Change Loy

3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models

Text-guided diffusion models have shown superior performance in image/video generation and editing. While few explorations have been performed in 3D scenarios. In this paper, we discuss three fundamental and interesting problems on this…

Computer Vision and Pattern Recognition · Computer Science 2023-10-13 Gang Li , Heliang Zheng , Chaoyue Wang , Chang Li , Changwen Zheng , Dacheng Tao