Related papers: Precise-Physics Driven Text-to-3D Generation

PhysGen: Physically Grounded 3D Shape Generation for Industrial Design

Existing generative models for 3D shapes can synthesize high-fidelity and visually plausible shapes. For certain classes of shapes that have undergone an engineering design process, the realism of the shape is tightly coupled with the…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Yingxuan You , Chen Zhao , Hantao Zhang , Ming Xu , Pascal Fua

Text-to-3D Shape Generation

Recent years have seen an explosion of work and interest in text-to-3D shape generation. Much of the progress is driven by advances in 3D representations, large-scale pretraining and representation learning for text and image data enabling…

Computer Vision and Pattern Recognition · Computer Science 2024-03-21 Han-Hung Lee , Manolis Savva , Angel X. Chang

PhysGen: Rigid-Body Physics-Grounded Image-to-Video Generation

We present PhysGen, a novel image-to-video generation method that converts a single image and an input condition (e.g., force and torque applied to an object in the image) to produce a realistic, physically plausible, and temporally…

Computer Vision and Pattern Recognition · Computer Science 2024-09-30 Shaowei Liu , Zhongzheng Ren , Saurabh Gupta , Shenlong Wang

Phys4DGen: Physics-Compliant 4D Generation with Multi-Material Composition Perception

4D content generation aims to create dynamically evolving 3D content that responds to specific input objects such as images or 3D representations. Current approaches typically incorporate physical priors to animate 3D representations, but…

Computer Vision and Pattern Recognition · Computer Science 2025-11-04 Jiajing Lin , Zhenzhong Wang , Dejun Xu , Shu Jiang , YunPeng Gong , Min Jiang

Towards High-Fidelity Text-Guided 3D Face Generation and Manipulation Using only Images

Generating 3D faces from textual descriptions has a multitude of applications, such as gaming, movie, and robotics. Recent progresses have demonstrated the success of unconditional 3D face generation and text-to-3D shape generation.…

Computer Vision and Pattern Recognition · Computer Science 2023-09-01 Cuican Yu , Guansong Lu , Yihan Zeng , Jian Sun , Xiaodan Liang , Huibin Li , Zongben Xu , Songcen Xu , Wei Zhang , Hang Xu

PI3D: Efficient Text-to-3D Generation with Pseudo-Image Diffusion

Diffusion models trained on large-scale text-image datasets have demonstrated a strong capability of controllable high-quality image generation from arbitrary text prompts. However, the generation quality and generalization ability of 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-04-23 Ying-Tian Liu , Yuan-Chen Guo , Guan Luo , Heyi Sun , Wei Yin , Song-Hai Zhang

Towards Implicit Text-Guided 3D Shape Generation

In this work, we explore the challenging task of generating 3D shapes from text. Beyond the existing works, we propose a new approach for text-guided 3D shape generation, capable of producing high-fidelity shapes with colors that match the…

Computer Vision and Pattern Recognition · Computer Science 2022-03-29 Zhengzhe Liu , Yi Wang , Xiaojuan Qi , Chi-Wing Fu

Text-to-3D Gaussian Splatting with Physics-Grounded Motion Generation

Text-to-3D generation is a valuable technology in virtual reality and digital content creation. While recent works have pushed the boundaries of text-to-3D generation, producing high-fidelity 3D objects with inefficient prompts and…

Computer Vision and Pattern Recognition · Computer Science 2024-12-10 Wenqing Wang , Yun Fu

ShapeGen: Towards High-Quality 3D Shape Synthesis

Inspired by generative paradigms in image and video, 3D shape generation has made notable progress, enabling the rapid synthesis of high-fidelity 3D assets from a single image. However, current methods still face challenges, including the…

Computer Vision and Pattern Recognition · Computer Science 2025-11-26 Yangguang Li , Xianglong He , Zi-Xin Zou , Zexiang Liu , Wanli Ouyang , Ding Liang , Yan-Pei Cao

Scene Generation at Absolute Scale: Utilizing Semantic and Geometric Guidance From Text for Accurate and Interpretable 3D Indoor Scene Generation

We present GuidedSceneGen, a text-to-3D generation framework that produces metrically accurate, globally consistent, and semantically interpretable indoor scenes. Unlike prior text-driven methods that often suffer from geometric drift or…

Computer Vision and Pattern Recognition · Computer Science 2026-03-17 Stefan Ainetter , Thomas Deixelberger , Edoardo A. Dominici , Philipp Drescher , Konstantinos Vardis , Markus Steinberger

Text2NeRF: Text-Driven 3D Scene Generation with Neural Radiance Fields

Text-driven 3D scene generation is widely applicable to video gaming, film industry, and metaverse applications that have a large demand for 3D scenes. However, existing text-to-3D generation methods are limited to producing 3D objects with…

Computer Vision and Pattern Recognition · Computer Science 2024-02-01 Jingbo Zhang , Xiaoyu Li , Ziyu Wan , Can Wang , Jing Liao

DetailGen3D: Generative 3D Geometry Enhancement via Data-Dependent Flow

Modern 3D generation methods can rapidly create shapes from sparse or single views, but their outputs often lack geometric detail due to computational constraints. We present DetailGen3D, a generative approach specifically designed to…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Ken Deng , Yuan-Chen Guo , Jingxiang Sun , Zi-Xin Zou , Yangguang Li , Xin Cai , Yan-Pei Cao , Yebin Liu , Ding Liang

Dream3D: Zero-Shot Text-to-3D Synthesis Using 3D Shape Prior and Text-to-Image Diffusion Models

Recent CLIP-guided 3D optimization methods, such as DreamFields and PureCLIPNeRF, have achieved impressive results in zero-shot text-to-3D synthesis. However, due to scratch training and random initialization without prior knowledge, these…

Computer Vision and Pattern Recognition · Computer Science 2023-04-04 Jiale Xu , Xintao Wang , Weihao Cheng , Yan-Pei Cao , Ying Shan , Xiaohu Qie , Shenghua Gao

PhysCtrl: Generative Physics for Controllable and Physics-Grounded Video Generation

Existing video generation models excel at producing photo-realistic videos from text or images, but often lack physical plausibility and 3D controllability. To overcome these limitations, we introduce PhysCtrl, a novel framework for…

Computer Vision and Pattern Recognition · Computer Science 2025-11-11 Chen Wang , Chuhao Chen , Yiming Huang , Zhiyang Dou , Yuan Liu , Jiatao Gu , Lingjie Liu

Progressive Text-to-3D Generation for Automatic 3D Prototyping

Text-to-3D generation is to craft a 3D object according to a natural language description. This can significantly reduce the workload for manually designing 3D models and provide a more natural way of interaction for users. However, this…

Computer Vision and Pattern Recognition · Computer Science 2023-09-27 Han Yi , Zhedong Zheng , Xiangyu Xu , Tat-seng Chua

Garment3DGen: 3D Garment Stylization and Texture Generation

We introduce Garment3DGen a new method to synthesize 3D garment assets from a base mesh given a single input image as guidance. Our proposed approach allows users to generate 3D textured clothes based on both real and synthetic images, such…

Computer Vision and Pattern Recognition · Computer Science 2025-05-01 Nikolaos Sarafianos , Tuur Stuyck , Xiaoyu Xiang , Yilei Li , Jovan Popovic , Rakesh Ranjan

PhysGen3D: Crafting a Miniature Interactive World from a Single Image

Envisioning physically plausible outcomes from a single image requires a deep understanding of the world's dynamics. To address this, we introduce PhysGen3D, a novel framework that transforms a single image into an amodal, camera-centric,…

Computer Vision and Pattern Recognition · Computer Science 2025-03-27 Boyuan Chen , Hanxiao Jiang , Shaowei Liu , Saurabh Gupta , Yunzhu Li , Hao Zhao , Shenlong Wang

A Survey On Text-to-3D Contents Generation In The Wild

3D content creation plays a vital role in various applications, such as gaming, robotics simulation, and virtual reality. However, the process is labor-intensive and time-consuming, requiring skilled designers to invest considerable effort…

Computer Vision and Pattern Recognition · Computer Science 2024-05-16 Chenhan Jiang

Atlas3D: Physically Constrained Self-Supporting Text-to-3D for Simulation and Fabrication

Existing diffusion-based text-to-3D generation methods primarily focus on producing visually realistic shapes and appearances, often neglecting the physical constraints necessary for downstream tasks. Generated models frequently fail to…

Machine Learning · Computer Science 2024-11-19 Yunuo Chen , Tianyi Xie , Zeshun Zong , Xuan Li , Feng Gao , Yin Yang , Ying Nian Wu , Chenfanfu Jiang

Hyper-3DG: Text-to-3D Gaussian Generation via Hypergraph

Text-to-3D generation represents an exciting field that has seen rapid advancements, facilitating the transformation of textual descriptions into detailed 3D models. However, current progress often neglects the intricate high-order…

Computer Vision and Pattern Recognition · Computer Science 2025-01-10 Donglin Di , Jiahui Yang , Chaofan Luo , Zhou Xue , Wei Chen , Xun Yang , Yue Gao