Related papers: Zero3D: Semantic-Driven Multi-Category 3D Shape Ge…

Guided and Unguided Conditional Diffusion Mechanisms for Structured and Semantically-Aware 3D Point Cloud Generation

Generating realistic 3D point clouds is a fundamental problem in computer vision with applications in remote sensing, robotics, and digital object modeling. Existing generative approaches primarily capture geometry, and when semantics are…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Gunner Stone , Sushmita Sarker , Alireza Tavakkoli

Michelangelo: Conditional 3D Shape Generation based on Shape-Image-Text Aligned Latent Representation

We present a novel alignment-before-generation approach to tackle the challenging task of generating general 3D shapes based on 2D images or texts. Directly learning a conditional generative model from images or texts to 3D shapes is prone…

Computer Vision and Pattern Recognition · Computer Science 2023-07-04 Zibo Zhao , Wen Liu , Xin Chen , Xianfang Zeng , Rui Wang , Pei Cheng , Bin Fu , Tao Chen , Gang Yu , Shenghua Gao

3D-LDM: Neural Implicit 3D Shape Generation with Latent Diffusion Models

Diffusion models have shown great promise for image generation, beating GANs in terms of generation diversity, with comparable image quality. However, their application to 3D shapes has been limited to point or voxel representations that…

Computer Vision and Pattern Recognition · Computer Science 2022-12-16 Gimin Nam , Mariem Khlifi , Andrew Rodriguez , Alberto Tono , Linqi Zhou , Paul Guerrero

3D Semantic Subspace Traverser: Empowering 3D Generative Model with Shape Editing Capability

Shape generation is the practice of producing 3D shapes as various representations for 3D content creation. Previous studies on 3D shape generation have focused on shape quality and structure, without or less considering the importance of…

Computer Vision and Pattern Recognition · Computer Science 2023-08-17 Ruowei Wang , Yu Liu , Pei Su , Jianwei Zhang , Qijun Zhao

CTGAN: Semantic-guided Conditional Texture Generator for 3D Shapes

The entertainment industry relies on 3D visual content to create immersive experiences, but traditional methods for creating textured 3D models can be time-consuming and subjective. Generative networks such as StyleGAN have advanced image…

Computer Vision and Pattern Recognition · Computer Science 2024-02-09 Yi-Ting Pan , Chai-Rong Lee , Shu-Ho Fan , Jheng-Wei Su , Jia-Bin Huang , Yung-Yu Chuang , Hung-Kuo Chu

Category-Aware 3D Object Composition with Disentangled Texture and Shape Multi-view Diffusion

In this paper, we tackle a new task of 3D object synthesis, where a 3D model is composited with another object category to create a novel 3D model. However, most existing text/image/3D-to-3D methods struggle to effectively integrate…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Zeren Xiong , Zikun Chen , Zedong Zhang , Xiang Li , Ying Tai , Jian Yang , Jun Li

Sketch-A-Shape: Zero-Shot Sketch-to-3D Shape Generation

Significant progress has recently been made in creative applications of large pre-trained models for downstream tasks in 3D vision, such as text-to-shape generation. This motivates our investigation of how these pre-trained models can be…

Computer Vision and Pattern Recognition · Computer Science 2023-07-11 Aditya Sanghi , Pradeep Kumar Jayaraman , Arianna Rampini , Joseph Lambourne , Hooman Shayani , Evan Atherton , Saeid Asgari Taghanaki

Shape from Semantics: 3D Shape Generation from Multi-View Semantics

Existing 3D reconstruction methods utilize guidances such as 2D images, 3D point clouds, shape contours and single semantics to recover the 3D surface, which limits the creative exploration of 3D modeling. In this paper, we propose a novel…

Computer Vision and Pattern Recognition · Computer Science 2025-08-19 Liangchen Li , Caoliwen Wang , Yuqi Zhou , Bailin Deng , Juyong Zhang

Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models

Recent CLIP-guided 3D optimization methods, such as DreamFields and PureCLIPNeRF, have achieved impressive results in zero-shot text-to-3D synthesis. However, due to scratch training and random initialization without prior knowledge, these…

Computer Vision and Pattern Recognition · Computer Science 2023-04-04 Jiale Xu , Xintao Wang , Weihao Cheng , Yan-Pei Cao , Ying Shan , Xiaohu Qie , Shenghua Gao

Diffusion 3D Features (Diff3F): Decorating Untextured Shapes with Distilled Semantic Features

We present Diff3F as a simple, robust, and class-agnostic feature descriptor that can be computed for untextured input shapes (meshes or point clouds). Our method distills diffusion features from image foundational models onto input shapes.…

Computer Vision and Pattern Recognition · Computer Science 2024-04-04 Niladri Shekhar Dutt , Sanjeev Muralikrishnan , Niloy J. Mitra

SSEditor: Controllable Mask-to-Scene Generation with Diffusion Model

Recent advancements in 3D diffusion-based semantic scene generation have gained attention. However, existing methods rely on unconditional generation and require multiple resampling steps when editing scenes, which significantly limits…

Computer Vision and Pattern Recognition · Computer Science 2024-11-20 Haowen Zheng , Yanyan Liang

Controlling Style and Semantics in Weakly-Supervised Image Generation

We propose a weakly-supervised approach for conditional image generation of complex scenes where a user has fine control over objects appearing in the scene. We exploit sparse semantic maps to control object shapes and classes, as well as…

Computer Vision and Pattern Recognition · Computer Science 2020-11-23 Dario Pavllo , Aurelien Lucchi , Thomas Hofmann

3D-aware Image Generation using 2D Diffusion Models

In this paper, we introduce a novel 3D-aware image generation method that leverages 2D diffusion models. We formulate the 3D-aware image generation task as multiview 2D image set generation, and further to a sequential…

Computer Vision and Pattern Recognition · Computer Science 2023-04-03 Jianfeng Xiang , Jiaolong Yang , Binbin Huang , Xin Tong

CMD: Controllable Multiview Diffusion for 3D Editing and Progressive Generation

Recently, 3D generation methods have shown their powerful ability to automate 3D model creation. However, most 3D generation methods only rely on an input image or a text prompt to generate a 3D model, which lacks the control of each…

Computer Vision and Pattern Recognition · Computer Science 2025-07-08 Peng Li , Suizhi Ma , Jialiang Chen , Yuan Liu , Congyi Zhang , Wei Xue , Wenhan Luo , Alla Sheffer , Wenping Wang , Yike Guo

Layered 3D Human Generation via Semantic-Aware Diffusion Model

The generation of 3D clothed humans has attracted increasing attention in recent years. However, existing work cannot generate layered high-quality 3D humans with consistent body structures. As a result, these methods are unable to…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Yi Wang , Jian Ma , Ruizhi Shao , Qiao Feng , Yu-Kun Lai , Yebin Liu , Kun Li

Diffusion Probabilistic Models for Scene-Scale 3D Categorical Data

In this paper, we learn a diffusion model to generate 3D data on a scene-scale. Specifically, our model crafts a 3D scene consisting of multiple objects, while recent diffusion research has focused on a single object. To realize our goal,…

Computer Vision and Pattern Recognition · Computer Science 2023-01-03 Jumin Lee , Woobin Im , Sebin Lee , Sung-Eui Yoon

Compositional 3D Scene Generation using Locally Conditioned Diffusion

Designing complex 3D scenes has been a tedious, manual process requiring domain expertise. Emerging text-to-3D generative models show great promise for making this task more intuitive, but existing approaches are limited to object-level…

Computer Vision and Pattern Recognition · Computer Science 2023-03-24 Ryan Po , Gordon Wetzstein

Towards Implicit Text-Guided 3D Shape Generation

In this work, we explore the challenging task of generating 3D shapes from text. Beyond the existing works, we propose a new approach for text-guided 3D shape generation, capable of producing high-fidelity shapes with colors that match the…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Zhengzhe Liu , Yi Wang , Xiaojuan Qi , Chi-Wing Fu

LTM3D: Bridging Token Spaces for Conditional 3D Generation with Auto-Regressive Diffusion Framework

We present LTM3D, a Latent Token space Modeling framework for conditional 3D shape generation that integrates the strengths of diffusion and auto-regressive (AR) models. While diffusion-based methods effectively model continuous latent…

Computer Vision and Pattern Recognition · Computer Science 2025-06-02 Xin Kang , Zihan Zheng , Lei Chu , Yue Gao , Jiahao Li , Hao Pan , Xuejin Chen , Yan Lu

3D-aware Image Generation and Editing with Multi-modal Conditions

3D-consistent image generation from a single 2D semantic label is an important and challenging research topic in computer graphics and computer vision. Although some related works have made great progress in this field, most of the existing…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Bo Li , Yi-ke Li , Zhi-fen He , Bin Liu , Yun-Kun Lai