English
Related papers

Related papers: A3D: Does Diffusion Dream about 3D Alignment?

200 papers

We present DreamCraft3D, a hierarchical 3D content generation method that produces high-fidelity and coherent 3D objects. We tackle the problem by leveraging a 2D reference image to guide the stages of geometry sculpting and texture…

Computer Vision and Pattern Recognition · Computer Science 2023-10-27 Jingxiang Sun , Bo Zhang , Ruizhi Shao , Lizhen Wang , Wen Liu , Zhenda Xie , Yebin Liu

Text-to-3D is an emerging task that allows users to create 3D content with infinite possibilities. Existing works tackle the problem by optimizing a 3D representation with guidance from pre-trained diffusion models. An apparent drawback is…

Computer Vision and Pattern Recognition · Computer Science 2023-06-06 Yiji Cheng , Fei Yin , Xiaoke Huang , Xintong Yu , Jiaxiang Liu , Shikun Feng , Yujiu Yang , Yansong Tang

Text-to-image diffusion models pre-trained on billions of image-text pairs have recently enabled 3D content creation by optimizing a randomly initialized differentiable 3D representation with score distillation. However, the optimization…

Computer Vision and Pattern Recognition · Computer Science 2024-05-07 Yukun Huang , Jianan Wang , Yukai Shi , Boshi Tang , Xianbiao Qi , Lei Zhang

We present a method to generate 3D objects in styles. Our method takes a text prompt and a style reference image as input and reconstructs a neural radiance field to synthesize a 3D model with the content aligning with the text prompt and…

Computer Vision and Pattern Recognition · Computer Science 2025-02-14 Hubert Kompanowski , Binh-Son Hua

Text-to-3D generation has achieved significant success by incorporating powerful 2D diffusion models, but insufficient 3D prior knowledge also leads to the inconsistency of 3D geometry. Recently, since large-scale multi-view datasets have…

Computer Vision and Pattern Recognition · Computer Science 2024-05-03 Junyoung Seo , Susung Hong , Wooseok Jang , Inès Hyeonsu Kim , Minseop Kwak , Doyup Lee , Seungryong Kim

3D object generation from a single image involves estimating the full 3D geometry and texture of unseen views from an unposed RGB image captured in the wild. Accurately reconstructing an object's complete 3D structure and texture has…

Computer Vision and Pattern Recognition · Computer Science 2024-11-21 Hritam Basak , Hadi Tabatabaee , Shreekant Gayaka , Ming-Feng Li , Xin Yang , Cheng-Hao Kuo , Arnie Sen , Min Sun , Zhaozheng Yin

Text-guided diffusion models have shown superior performance in image/video generation and editing. While few explorations have been performed in 3D scenarios. In this paper, we discuss three fundamental and interesting problems on this…

Computer Vision and Pattern Recognition · Computer Science 2023-10-13 Gang Li , Heliang Zheng , Chaoyue Wang , Chang Li , Changwen Zheng , Dacheng Tao

Recent progress in text-to-3D generation has been achieved through the utilization of score distillation methods: they make use of the pre-trained text-to-image (T2I) diffusion models by distilling via the diffusion model training…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Kyungmin Lee , Kihyuk Sohn , Jinwoo Shin

Most 3D generation research focuses on up-projecting 2D foundation models into the 3D space, either by minimizing 2D Score Distillation Sampling (SDS) loss or fine-tuning on multi-view datasets. Without explicit 3D priors, these methods…

Computer Vision and Pattern Recognition · Computer Science 2023-12-11 Lihe Ding , Shaocong Dong , Zhanpeng Huang , Zibin Wang , Yiyuan Zhang , Kaixiong Gong , Dan Xu , Tianfan Xue

It is inherently ambiguous to lift 2D results from pre-trained diffusion models to a 3D world for text-to-3D generation. 2D diffusion models solely learn view-agnostic priors and thus lack 3D knowledge during the lifting, leading to the…

Computer Vision and Pattern Recognition · Computer Science 2023-10-23 Weiyu Li , Rui Chen , Xuelin Chen , Ping Tan

Generating high-quality 3D assets from textual descriptions remains a pivotal challenge in computer graphics and vision research. Due to the scarcity of 3D data, state-of-the-art approaches utilize pre-trained 2D diffusion priors, optimized…

Computer Vision and Pattern Recognition · Computer Science 2024-10-14 Ling Yang , Zixiang Zhang , Junlin Han , Bohan Zeng , Runjia Li , Philip Torr , Wentao Zhang

Text-to-3D generation has shown rapid progress in recent days with the advent of score distillation, a methodology of using pretrained text-to-2D diffusion models to optimize neural radiance field (NeRF) in the zero-shot setting. However,…

Computer Vision and Pattern Recognition · Computer Science 2024-02-07 Junyoung Seo , Wooseok Jang , Min-Seop Kwak , Hyeonsu Kim , Jaehoon Ko , Junho Kim , Jin-Hwa Kim , Jiyoung Lee , Seungryong Kim

Existing diffusion-based text-to-3D generation methods primarily focus on producing visually realistic shapes and appearances, often neglecting the physical constraints necessary for downstream tasks. Generated models frequently fail to…

Machine Learning · Computer Science 2024-11-19 Yunuo Chen , Tianyi Xie , Zeshun Zong , Xuan Li , Feng Gao , Yin Yang , Ying Nian Wu , Chenfanfu Jiang

In this paper, we propose an effective two-stage approach named Grounded-Dreamer to generate 3D assets that can accurately follow complex, compositional text prompts while achieving high fidelity by using a pre-trained multi-view diffusion…

Computer Vision and Pattern Recognition · Computer Science 2024-04-30 Xiaolong Li , Jiawei Mo , Ying Wang , Chethan Parameshwara , Xiaohan Fei , Ashwin Swaminathan , CJ Taylor , Zhuowen Tu , Paolo Favaro , Stefano Soatto

We present Acc3D to tackle the challenge of accelerating the diffusion process to generate 3D models from single images. To derive high-quality reconstructions through few-step inferences, we emphasize the critical issue of regularizing the…

Computer Vision and Pattern Recognition · Computer Science 2025-03-21 Kendong Liu , Zhiyu Zhu , Hui Liu , Junhui Hou

In recent years, 3D vision has become a crucial field within computer vision, powering a wide range of applications such as autonomous driving, robotics, augmented reality, and medical imaging. This field relies on accurate perception,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Zhen Wang , Dongyuan Li , Yaozu Wu , Tianyu He , Jiang Bian , Renhe Jiang

We introduce DreamPolish, a text-to-3D generation model that excels in producing refined geometry and high-quality textures. In the geometry construction phase, our approach leverages multiple neural representations to enhance the stability…

Computer Vision and Pattern Recognition · Computer Science 2024-11-05 Yean Cheng , Ziqi Cai , Ming Ding , Wendi Zheng , Shiyu Huang , Yuxiao Dong , Jie Tang , Boxin Shi

While 2D diffusion models have achieved remarkable success in identity-preserving personalization, extending this capability to 3D assets remains a significant challenge due to the complexities of multi-view consistency and spatial control.…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Jinxin Ai , Matthias Nießner , Ziya Erkoç

Distilling pre-trained 2D diffusion models into 3D assets has driven remarkable advances in text-to-3D synthesis. However, existing methods typically rely on Score Distillation Sampling (SDS) loss, which involves asymmetric KL divergence--a…

Computer Vision and Pattern Recognition · Computer Science 2025-06-17 Weimin Bai , Yubo Li , Wenzheng Chen , Weijian Luo , He Sun

The generation of high-quality, animatable 3D head avatars from text has enormous potential in content creation applications such as games, movies, and embodied virtual assistants. Current text-to-3D generation methods typically combine…

Computer Vision and Pattern Recognition · Computer Science 2025-04-23 Yiqian Wu , Malte Prinzler , Xiaogang Jin , Siyu Tang
‹ Prev 1 2 3 10 Next ›