Related papers: MeshCoder: LLM-Powered Structured Mesh Code Genera…

CAD-Recode: Reverse Engineering CAD Code from Point Clouds

Computer-Aided Design (CAD) models are typically constructed by sequentially drawing parametric sketches and applying CAD operations to obtain a 3D model. The problem of 3D CAD reverse engineering consists of reconstructing the sketch and…

Computer Vision and Pattern Recognition · Computer Science 2025-03-12 Danila Rukhovich , Elona Dupont , Dimitrios Mallis , Kseniya Cherenkova , Anis Kacem , Djamila Aouada

LL3M: Large Language 3D Modelers

We present LL3M, a multi-agent system that leverages pretrained large language models (LLMs) to generate 3D assets by writing interpretable Python code in Blender. We break away from the typical generative approach that learns from a…

Graphics · Computer Science 2025-08-12 Sining Lu , Guan Chen , Nam Anh Dinh , Itai Lang , Ari Holtzman , Rana Hanocka

MeshLLM: Empowering Large Language Models to Progressively Understand and Generate 3D Mesh

We present MeshLLM, a novel framework that leverages large language models (LLMs) to understand and generate text-serialized 3D meshes. Our approach addresses key limitations in existing methods, including the limited dataset scale when…

Graphics · Computer Science 2025-08-06 Shuangkang Fang , I-Chao Shen , Yufeng Wang , Yi-Hsuan Tsai , Yi Yang , Shuchang Zhou , Wenrui Ding , Takeo Igarashi , Ming-Hsuan Yang

ToolCoder: A Systematic Code-Empowered Tool Learning Framework for Large Language Models

Tool learning has emerged as a crucial capability for large language models (LLMs) to solve complex real-world tasks through interaction with external tools. Existing approaches face significant challenges, including reliance on…

Computation and Language · Computer Science 2025-06-02 Hanxing Ding , Shuchang Tao , Liang Pang , Zihao Wei , Jinyang Gao , Bolin Ding , Huawei Shen , Xueqi Cheng

BrepCoder: A Unified Multimodal Large Language Model for Multi-task B-rep Reasoning

Recent advancements in deep learning have actively addressed complex challenges within the Computer-Aided Design (CAD) domain.However, most existing approaches rely on task-specifi c models requiring structural modifi cations for new tasks,…

Machine Learning · Computer Science 2026-03-03 Mingi Kim , Yongjun Kim , Jungwoo Kang , Hyungki Kim

Don't Mesh with Me: Generating Constructive Solid Geometry Instead of Meshes by Fine-Tuning a Code-Generation LLM

While recent advancements in machine learning, such as LLMs, are revolutionizing software development and creative industries, they have had minimal impact on engineers designing mechanical parts, which remains largely a manual process.…

Machine Learning · Computer Science 2025-05-07 Maximilian Mews , Ansar Aynetdinov , Vivian Schiller , Peter Eisert , Alan Akbik

3D Shape Reconstruction from Sketches via Multi-view Convolutional Networks

We propose a method for reconstructing 3D shapes from 2D sketches in the form of line drawings. Our method takes as input a single sketch, or multiple sketches, and outputs a dense point cloud representing a 3D reconstruction of the input…

Computer Vision and Pattern Recognition · Computer Science 2017-10-02 Zhaoliang Lun , Matheus Gadelha , Evangelos Kalogerakis , Subhransu Maji , Rui Wang

Learning From Design Procedure To Generate CAD Programs for Data Augmentation

Large Language Models (LLMs) have demonstrated impressive capabilities in a wide range of code generation tasks. However, generating code for certain domains remains challenging. One such domain is Computer-Aided Design (CAD) program, where…

Machine Learning · Computer Science 2026-03-10 Yan-Ying Chen , Dule Shu , Matthew Hong , Andrew Taber , Jonathan Li , Matthew Klenk

LAM3D: Large Image-Point-Cloud Alignment Model for 3D Reconstruction from Single Image

Large Reconstruction Models have made significant strides in the realm of automated 3D content generation from single or multiple input images. Despite their success, these models often produce 3D meshes with geometric inaccuracies,…

Computer Vision and Pattern Recognition · Computer Science 2024-05-27 Ruikai Cui , Xibin Song , Weixuan Sun , Senbo Wang , Weizhe Liu , Shenzhou Chen , Taizhang Shang , Yang Li , Nick Barnes , Hongdong Li , Pan Ji

Point Cloud as a Foreign Language for Multi-modal Large Language Model

Multi-modal large language models (MLLMs) have shown remarkable progress in integrating visual and linguistic understanding. Recent efforts have extended these capabilities to 3D understanding through encoder-based architectures that rely…

Computer Vision and Pattern Recognition · Computer Science 2026-03-11 Sneha Paul , Zachary Patterson , Nizar Bouguila

Multilingual Multimodal Software Developer for Code Generation

The rapid advancement of Large Language Models (LLMs) has significantly improved code generation, yet most models remain text-only, neglecting crucial visual aids like diagrams and flowcharts used in real-world software development. To…

Computation and Language · Computer Science 2025-07-14 Linzheng Chai , Jian Yang , Shukai Liu , Wei Zhang , Liran Wang , Ke Jin , Tao Sun , Congnan Liu , Chenchen Zhang , Hualei Zhu , Jiaheng Liu , Xianjie Wu , Ge Zhang , Tianyu Liu , Zhoujun Li

CAD-Coder: An Open-Source Vision-Language Model for Computer-Aided Design Code Generation

Efficient creation of accurate and editable 3D CAD models is critical in engineering design, significantly impacting cost and time-to-market in product innovation. Current manual workflows remain highly time-consuming and demand extensive…

Computer Vision and Pattern Recognition · Computer Science 2025-05-21 Anna C. Doris , Md Ferdous Alam , Amin Heyrani Nobari , Faez Ahmed

3D Mesh Editing using Masked LRMs

We present a novel approach to shape editing, building on recent progress in 3D reconstruction from multi-view images. We formulate shape editing as a conditional reconstruction problem, where the model must reconstruct the input shape with…

Computer Vision and Pattern Recognition · Computer Science 2025-09-16 Will Gao , Dilin Wang , Yuchen Fan , Aljaz Bozic , Tuur Stuyck , Zhengqin Li , Zhao Dong , Rakesh Ranjan , Nikolaos Sarafianos

Part-X-MLLM: Part-aware 3D Multimodal Large Language Model

We introduce Part-X-MLLM, a native 3D multimodal large language model that unifies diverse 3D tasks by formulating them as programs in a structured, executable grammar. Given an RGB point cloud and a natural language prompt, our model…

Computer Vision and Pattern Recognition · Computer Science 2025-11-18 Chunshi Wang , Junliang Ye , Yunhan Yang , Yang Li , Zizhuo Lin , Jun Zhu , Zhuo Chen , Yawei Luo , Chunchao Guo

PointLLM: Empowering Large Language Models to Understand Point Clouds

The unprecedented advancements in Large Language Models (LLMs) have shown a profound impact on natural language processing but are yet to fully embrace the realm of 3D understanding. This paper introduces PointLLM, a preliminary effort to…

Computer Vision and Pattern Recognition · Computer Science 2024-09-10 Runsen Xu , Xiaolong Wang , Tai Wang , Yilun Chen , Jiangmiao Pang , Dahua Lin

Point Linguist Model: Segment Any Object via Bridged Large 3D-Language Model

3D object segmentation with Large Language Models (LLMs) has become a prevailing paradigm due to its broad semantics, task flexibility, and strong generalization. However, this paradigm is hindered by representation misalignment: LLMs…

Computer Vision and Pattern Recognition · Computer Science 2026-02-20 Zhuoxu Huang , Mingqi Gao , Jungong Han

A Review of Deep Learning-Powered Mesh Reconstruction Methods

With the recent advances in hardware and rendering techniques, 3D models have emerged everywhere in our life. Yet creating 3D shapes is arduous and requires significant professional knowledge. Meanwhile, Deep learning has enabled…

Computer Vision and Pattern Recognition · Computer Science 2023-03-07 Zhiqin Chen

Learning to Generate Dense Point Clouds with Textures on Multiple Categories

3D reconstruction from images is a core problem in computer vision. With recent advances in deep learning, it has become possible to recover plausible 3D shapes even from single RGB images for the first time. However, obtaining detailed…

Computer Vision and Pattern Recognition · Computer Science 2019-12-24 Tao Hu , Geng Lin , Zhizhong Han , Matthias Zwicker

Point2Building: Reconstructing Buildings from Airborne LiDAR Point Clouds

We present a learning-based approach to reconstruct buildings as 3D polygonal meshes from airborne LiDAR point clouds. What makes 3D building reconstruction from airborne LiDAR hard is the large diversity of building designs and especially…

Computer Vision and Pattern Recognition · Computer Science 2024-07-30 Yujia Liu , Anton Obukhov , Jan Dirk Wegner , Konrad Schindler

ShapeLLM: Universal 3D Object Understanding for Embodied Interaction

This paper presents ShapeLLM, the first 3D Multimodal Large Language Model (LLM) designed for embodied interaction, exploring a universal 3D object understanding with 3D point clouds and languages. ShapeLLM is built upon an improved 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-07-15 Zekun Qi , Runpei Dong , Shaochen Zhang , Haoran Geng , Chunrui Han , Zheng Ge , Li Yi , Kaisheng Ma