English
Related papers

Related papers: ConsistentDreamer: View-Consistent Meshes Through …

200 papers

It is inherently ambiguous to lift 2D results from pre-trained diffusion models to a 3D world for text-to-3D generation. 2D diffusion models solely learn view-agnostic priors and thus lack 3D knowledge during the lifting, leading to the…

Computer Vision and Pattern Recognition · Computer Science 2023-10-23 Weiyu Li , Rui Chen , Xuelin Chen , Ping Tan

A fundamental problem in the texturing of 3D meshes using pre-trained text-to-image models is to ensure multi-view consistency. State-of-the-art approaches typically use diffusion models to aggregate multi-view inputs, where common issues…

Computer Vision and Pattern Recognition · Computer Science 2024-08-05 Zhengyi Zhao , Chen Song , Xiaodong Gu , Yuan Dong , Qi Zuo , Weihao Yuan , Liefeng Bo , Zilong Dong , Qixing Huang

In this paper, we present a novel diffusion model called that generates multiview-consistent images from a single-view image. Using pretrained large-scale 2D diffusion models, recent work Zero123 demonstrates the ability to generate…

Computer Vision and Pattern Recognition · Computer Science 2024-04-16 Yuan Liu , Cheng Lin , Zijiao Zeng , Xiaoxiao Long , Lingjie Liu , Taku Komura , Wenping Wang

This paper proposes ConsistDreamer - a novel framework that lifts 2D diffusion models with 3D awareness and 3D consistency, thus enabling high-fidelity instruction-guided scene editing. To overcome the fundamental limitation of missing 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-06-14 Jun-Kun Chen , Samuel Rota Bulò , Norman Müller , Lorenzo Porzi , Peter Kontschieder , Yu-Xiong Wang

Recent advances in zero-shot text-to-3D generation have revolutionized 3D content creation by enabling direct synthesis from textual descriptions. While state-of-the-art methods leverage 3D Gaussian Splatting with score distillation to…

Computer Vision and Pattern Recognition · Computer Science 2026-04-28 Yuan Zhou , Shilong Jin , Litao Hua , Wanjun Lv , Haoran Duan , Jungong Han

While diffusion models have demonstrated remarkable progress in 2D image generation and editing, extending these capabilities to 3D editing remains challenging, particularly in maintaining multi-view consistency. Classical approaches…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Yufeng Chi , Huimin Ma , Kafeng Wang , Jianmin Li

Large image diffusion models enable novel view synthesis with high quality and excellent zero-shot capability. However, such models based on image-to-image translation have no guarantee of view consistency, limiting the performance for…

Computer Vision and Pattern Recognition · Computer Science 2023-10-13 Haohan Weng , Tianyu Yang , Jianan Wang , Yu Li , Tong Zhang , C. L. Philip Chen , Lei Zhang

The recent success of pre-trained diffusion models unlocks the possibility of the automatic generation of textures for arbitrary 3D meshes in the wild. However, these models are trained in the screen space, while converting them to a…

Computer Vision and Pattern Recognition · Computer Science 2024-06-28 Hongkun Zhang , Zherong Pan , Congyi Zhang , Lifeng Zhu , Xifeng Gao

In this paper, we propose VistaDream a novel framework to reconstruct a 3D scene from a single-view image. Recent diffusion models enable generating high-quality novel-view images from a single-view input image. Most existing methods only…

Computer Vision and Pattern Recognition · Computer Science 2024-10-23 Haiping Wang , Yuan Liu , Ziwei Liu , Wenping Wang , Zhen Dong , Bisheng Yang

Novel-view synthesis aims to generate novel views of a scene from multiple input images or videos, and recent advancements like 3D Gaussian splatting (3DGS) have achieved notable success in producing photorealistic renderings with efficient…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Xi Liu , Chaoyi Zhou , Siyu Huang

Reconstructing 3D objects from extremely sparse views is a long-standing and challenging problem. While recent techniques employ image diffusion models for generating plausible images at novel viewpoints or for distilling pre-trained…

Computer Vision and Pattern Recognition · Computer Science 2023-12-21 Zi-Xin Zou , Weihao Cheng , Yan-Pei Cao , Shi-Sheng Huang , Ying Shan , Song-Hai Zhang

Text-to-3D generation by distilling pretrained large-scale text-to-image diffusion models has shown great promise but still suffers from inconsistent 3D geometric structures (Janus problems) and severe artifacts. The aforementioned problems…

Computer Vision and Pattern Recognition · Computer Science 2023-12-05 Baorui Ma , Haoge Deng , Junsheng Zhou , Yu-Shen Liu , Tiejun Huang , Xinlong Wang

Video generation models have made significant progress in generating realistic content, enabling applications in simulation, gaming, and film making. However, current generated videos still contain visual artifacts arising from 3D…

Computer Vision and Pattern Recognition · Computer Science 2025-11-25 Duolikun Danier , Ge Gao , Steven McDonagh , Changjian Li , Hakan Bilen , Oisin Mac Aodha

In the realm of text-to-3D generation, utilizing 2D diffusion models through score distillation sampling (SDS) frequently leads to issues such as blurred appearances and multi-faced geometry, primarily due to the intrinsically noisy nature…

Computer Vision and Pattern Recognition · Computer Science 2023-12-06 Pengsheng Guo , Hans Hao , Adam Caccavale , Zhongzheng Ren , Edward Zhang , Qi Shan , Aditya Sankar , Alexander G. Schwing , Alex Colburn , Fangchang Ma

3D object generation from a single image involves estimating the full 3D geometry and texture of unseen views from an unposed RGB image captured in the wild. Accurately reconstructing an object's complete 3D structure and texture has…

Computer Vision and Pattern Recognition · Computer Science 2024-11-21 Hritam Basak , Hadi Tabatabaee , Shreekant Gayaka , Ming-Feng Li , Xin Yang , Cheng-Hao Kuo , Arnie Sen , Min Sun , Zhaozheng Yin

Single-image 3D scene reconstruction presents significant challenges due to its inherently ill-posed nature and limited input constraints. Recent advances have explored two promising directions: multiview generative models that train on 3D…

Computer Vision and Pattern Recognition · Computer Science 2025-04-17 Junlin Hao , Peiheng Wang , Haoyang Wang , Xinggong Zhang , Zongming Guo

In this work, we introduce Wonder3D, a novel method for efficiently generating high-fidelity textured meshes from single-view images.Recent methods based on Score Distillation Sampling (SDS) have shown the potential to recover 3D geometry…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 Xiaoxiao Long , Yuan-Chen Guo , Cheng Lin , Yuan Liu , Zhiyang Dou , Lingjie Liu , Yuexin Ma , Song-Hai Zhang , Marc Habermann , Christian Theobalt , Wenping Wang

We introduce MVDream, a diffusion model that is able to generate consistent multi-view images from a given text prompt. Learning from both 2D and 3D data, a multi-view diffusion model can achieve the generalizability of 2D diffusion models…

Computer Vision and Pattern Recognition · Computer Science 2024-04-19 Yichun Shi , Peng Wang , Jianglong Ye , Mai Long , Kejie Li , Xiao Yang

Despite advances in neural rendering, due to the scarcity of high-quality 3D datasets and the inherent limitations of multi-view diffusion models, view synthesis and 3D model generation are restricted to low resolutions with suboptimal…

Computer Vision and Pattern Recognition · Computer Science 2025-04-30 Yihang Luo , Shangchen Zhou , Yushi Lan , Xingang Pan , Chen Change Loy

Text-guided diffusion models have shown superior performance in image/video generation and editing. While few explorations have been performed in 3D scenarios. In this paper, we discuss three fundamental and interesting problems on this…

Computer Vision and Pattern Recognition · Computer Science 2023-10-13 Gang Li , Heliang Zheng , Chaoyue Wang , Chang Li , Changwen Zheng , Dacheng Tao
‹ Prev 1 2 3 10 Next ›