Related papers: A3D: Does Diffusion Dream about 3D Alignment?

DreamCraft3D: Hierarchical 3D Generation with Bootstrapped Diffusion Prior

We present DreamCraft3D, a hierarchical 3D content generation method that produces high-fidelity and coherent 3D objects. We tackle the problem by leveraging a 2D reference image to guide the stages of geometry sculpting and texture…

Computer Vision and Pattern Recognition · Computer Science 2023-10-27 Jingxiang Sun , Bo Zhang , Ruizhi Shao , Lizhen Wang , Wen Liu , Zhenda Xie , Yebin Liu

Efficient Text-Guided 3D-Aware Portrait Generation with Score Distillation Sampling on Distribution

Text-to-3D is an emerging task that allows users to create 3D content with infinite possibilities. Existing works tackle the problem by optimizing a 3D representation with guidance from pre-trained diffusion models. An apparent drawback is…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Yiji Cheng , Fei Yin , Xiaoke Huang , Xintong Yu , Jiaxiang Liu , Shikun Feng , Yujiu Yang , Yansong Tang

DreamTime: An Improved Optimization Strategy for Diffusion-Guided 3D Generation

Text-to-image diffusion models pre-trained on billions of image-text pairs have recently enabled 3D content creation by optimizing a randomly initialized differentiable 3D representation with score distillation. However, the optimization…

Computer Vision and Pattern Recognition · Computer Science 2024-05-07 Yukun Huang , Jianan Wang , Yukai Shi , Boshi Tang , Xianbiao Qi , Lei Zhang

Dream-in-Style: Text-to-3D Generation Using Stylized Score Distillation

We present a method to generate 3D objects in styles. Our method takes a text prompt and a style reference image as input and reconstructs a neural radiance field to synthesize a 3D model with the content aligning with the text prompt and…

Computer Vision and Pattern Recognition · Computer Science 2025-02-14 Hubert Kompanowski , Binh-Son Hua

Retrieval-Augmented Score Distillation for Text-to-3D Generation

Text-to-3D generation has achieved significant success by incorporating powerful 2D diffusion models, but insufficient 3D prior knowledge also leads to the inconsistency of 3D geometry. Recently, since large-scale multi-view datasets have…

Computer Vision and Pattern Recognition · Computer Science 2024-05-03 Junyoung Seo , Susung Hong , Wooseok Jang , Inès Hyeonsu Kim , Minseop Kwak , Doyup Lee , Seungryong Kim

Enhancing Single Image to 3D Generation using Gaussian Splatting and Hybrid Diffusion Priors

3D object generation from a single image involves estimating the full 3D geometry and texture of unseen views from an unposed RGB image captured in the wild. Accurately reconstructing an object's complete 3D structure and texture has…

Computer Vision and Pattern Recognition · Computer Science 2024-11-21 Hritam Basak , Hadi Tabatabaee , Shreekant Gayaka , Ming-Feng Li , Xin Yang , Cheng-Hao Kuo , Arnie Sen , Min Sun , Zhaozheng Yin

3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models

Text-guided diffusion models have shown superior performance in image/video generation and editing. While few explorations have been performed in 3D scenarios. In this paper, we discuss three fundamental and interesting problems on this…

Computer Vision and Pattern Recognition · Computer Science 2023-10-13 Gang Li , Heliang Zheng , Chaoyue Wang , Chang Li , Changwen Zheng , Dacheng Tao

DreamFlow: High-Quality Text-to-3D Generation by Approximating Probability Flow

Recent progress in text-to-3D generation has been achieved through the utilization of score distillation methods: they make use of the pre-trained text-to-image (T2I) diffusion models by distilling via the diffusion model training…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Kyungmin Lee , Kihyuk Sohn , Jinwoo Shin

Text-to-3D Generation with Bidirectional Diffusion using both 2D and 3D priors

Most 3D generation research focuses on up-projecting 2D foundation models into the 3D space, either by minimizing 2D Score Distillation Sampling (SDS) loss or fine-tuning on multi-view datasets. Without explicit 3D priors, these methods…

Computer Vision and Pattern Recognition · Computer Science 2023-12-11 Lihe Ding , Shaocong Dong , Zhanpeng Huang , Zibin Wang , Yiyuan Zhang , Kaixiong Gong , Dan Xu , Tianfan Xue

SweetDreamer: Aligning Geometric Priors in 2D Diffusion for Consistent Text-to-3D

It is inherently ambiguous to lift 2D results from pre-trained diffusion models to a 3D world for text-to-3D generation. 2D diffusion models solely learn view-agnostic priors and thus lack 3D knowledge during the lifting, leading to the…

Computer Vision and Pattern Recognition · Computer Science 2023-10-23 Weiyu Li , Rui Chen , Xuelin Chen , Ping Tan

Semantic Score Distillation Sampling for Compositional Text-to-3D Generation

Generating high-quality 3D assets from textual descriptions remains a pivotal challenge in computer graphics and vision research. Due to the scarcity of 3D data, state-of-the-art approaches utilize pre-trained 2D diffusion priors, optimized…

Computer Vision and Pattern Recognition · Computer Science 2024-10-14 Ling Yang , Zixiang Zhang , Junlin Han , Bohan Zeng , Runjia Li , Philip Torr , Wentao Zhang

Let 2D Diffusion Model Know 3D-Consistency for Robust Text-to-3D Generation

Text-to-3D generation has shown rapid progress in recent days with the advent of score distillation, a methodology of using pretrained text-to-2D diffusion models to optimize neural radiance field (NeRF) in the zero-shot setting. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-02-07 Junyoung Seo , Wooseok Jang , Min-Seop Kwak , Hyeonsu Kim , Jaehoon Ko , Junho Kim , Jin-Hwa Kim , Jiyoung Lee , Seungryong Kim

Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication

Existing diffusion-based text-to-3D generation methods primarily focus on producing visually realistic shapes and appearances, often neglecting the physical constraints necessary for downstream tasks. Generated models frequently fail to…

Machine Learning · Computer Science 2024-11-19 Yunuo Chen , Tianyi Xie , Zeshun Zong , Xuan Li , Feng Gao , Yin Yang , Ying Nian Wu , Chenfanfu Jiang

Grounded Compositional and Diverse Text-to-3D with Pretrained Multi-View Diffusion Model

In this paper, we propose an effective two-stage approach named Grounded-Dreamer to generate 3D assets that can accurately follow complex, compositional text prompts while achieving high fidelity by using a pre-trained multi-view diffusion…

Computer Vision and Pattern Recognition · Computer Science 2024-04-30 Xiaolong Li , Jiawei Mo , Ying Wang , Chethan Parameshwara , Xiaohan Fei , Ashwin Swaminathan , CJ Taylor , Zhuowen Tu , Paolo Favaro , Stefano Soatto

Acc3D: Accelerating Single Image to 3D Diffusion Models via Edge Consistency Guided Score Distillation

We present Acc3D to tackle the challenge of accelerating the diffusion process to generate 3D models from single images. To derive high-quality reconstructions through few-step inferences, we emphasize the critical issue of regularizing the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-21 Kendong Liu , Zhiyu Zhu , Hui Liu , Junhui Hou

Diffusion Models in 3D Vision: A Survey

In recent years, 3D vision has become a crucial field within computer vision, powering a wide range of applications such as autonomous driving, robotics, augmented reality, and medical imaging. This field relies on accurate perception,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Zhen Wang , Dongyuan Li , Yaozu Wu , Tianyu He , Jiang Bian , Renhe Jiang

DreamPolish: Domain Score Distillation With Progressive Geometry Generation

We introduce DreamPolish, a text-to-3D generation model that excels in producing refined geometry and high-quality textures. In the geometry construction phase, our approach leverages multiple neural representations to enhance the stability…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Yean Cheng , Ziqi Cai , Ming Ding , Wendi Zheng , Shiyu Huang , Yuxiao Dong , Jie Tang , Boxin Shi

DreamEdit3D: Personalization of Multi-View Diffusion Models for 3D Editing

While 2D diffusion models have achieved remarkable success in identity-preserving personalization, extending this capability to 3D assets remains a significant challenge due to the complexities of multi-view consistency and spatial control.…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Jinxin Ai , Matthias Nießner , Ziya Erkoç

Dive3D: Diverse Distillation-based Text-to-3D Generation via Score Implicit Matching

Distilling pre-trained 2D diffusion models into 3D assets has driven remarkable advances in text-to-3D synthesis. However, existing methods typically rely on Score Distillation Sampling (SDS) loss, which involves asymmetric KL divergence--a…

Computer Vision and Pattern Recognition · Computer Science 2025-06-17 Weimin Bai , Yubo Li , Wenzheng Chen , Weijian Luo , He Sun

Text-based Animatable 3D Avatars with Morphable Model Alignment

The generation of high-quality, animatable 3D head avatars from text has enormous potential in content creation applications such as games, movies, and embodied virtual assistants. Current text-to-3D generation methods typically combine…

Computer Vision and Pattern Recognition · Computer Science 2025-04-23 Yiqian Wu , Malte Prinzler , Xiaogang Jin , Siyu Tang