Related papers: Control3Diff: Learning Controllable 3D Diffusion M…

WildFusion: Learning 3D-Aware Latent Diffusion Models in View Space

Modern learning-based approaches to 3D-aware image synthesis achieve high photorealism and 3D-consistent viewpoint changes for the generated images. Existing approaches represent instances in a shared canonical space. However, for…

Computer Vision and Pattern Recognition · Computer Science 2024-04-15 Katja Schwarz , Seung Wook Kim , Jun Gao , Sanja Fidler , Andreas Geiger , Karsten Kreis

GANFusion: Feed-Forward Text-to-3D with Diffusion in GAN Space

We train a feed-forward text-to-3D diffusion generator for human characters using only single-view 2D data for supervision. Existing 3D generative models cannot yet match the fidelity of image or video generative models. State-of-the-art 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-12-24 Souhaib Attaiki , Paul Guerrero , Duygu Ceylan , Niloy J. Mitra , Maks Ovsjanikov

Realiz3D: 3D Generation Made Photorealistic via Domain-Aware Learning

We often aim to generate images that are both photorealistic and 3D-consistent, adhering to precise geometry, material, and viewpoint controls. Typically, this is achieved by fine-tuning an image generator, pre-trained on billions of real…

Graphics · Computer Science 2026-05-15 Ido Sobol , Kihyuk Sohn , Yoav Blum , Egor Zakharov , Max Bluvstein , Andrea Vedaldi , Or Litany

MultiDiffusion: Fusing Diffusion Paths for Controlled Image Generation

Recent advances in text-to-image generation with diffusion models present transformative capabilities in image quality. However, user controllability of the generated image, and fast adaptation to new tasks still remains an open challenge,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-17 Omer Bar-Tal , Lior Yariv , Yaron Lipman , Tali Dekel

3DDesigner: Towards Photorealistic 3D Object Generation and Editing with Text-guided Diffusion Models

Text-guided diffusion models have shown superior performance in image/video generation and editing. While few explorations have been performed in 3D scenarios. In this paper, we discuss three fundamental and interesting problems on this…

Computer Vision and Pattern Recognition · Computer Science 2023-10-13 Gang Li , Heliang Zheng , Chaoyue Wang , Chang Li , Changwen Zheng , Dacheng Tao

Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy

Creating realistic 3D objects and clothed avatars from a single RGB image is an attractive yet challenging problem. Due to its ill-posed nature, recent works leverage powerful prior from 2D diffusion models pretrained on large datasets.…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Yuxuan Xue , Xianghui Xie , Riccardo Marin , Gerard Pons-Moll

Diffusion Models in 3D Vision: A Survey

In recent years, 3D vision has become a crucial field within computer vision, powering a wide range of applications such as autonomous driving, robotics, augmented reality, and medical imaging. This field relies on accurate perception,…

Computer Vision and Pattern Recognition · Computer Science 2025-04-02 Zhen Wang , Dongyuan Li , Yaozu Wu , Tianyu He , Jiang Bian , Renhe Jiang

DiffusionGAN3D: Boosting Text-guided 3D Generation and Domain Adaptation by Combining 3D GANs and Diffusion Priors

Text-guided domain adaptation and generation of 3D-aware portraits find many applications in various fields. However, due to the lack of training data and the challenges in handling the high variety of geometry and appearance, the existing…

Computer Vision and Pattern Recognition · Computer Science 2024-04-15 Biwen Lei , Kai Yu , Mengyang Feng , Miaomiao Cui , Xuansong Xie

Control3D: Towards Controllable Text-to-3D Generation

Recent remarkable advances in large-scale text-to-image diffusion models have inspired a significant breakthrough in text-to-3D generation, pursuing 3D content creation solely from a given text prompt. However, existing text-to-3D…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 Yang Chen , Yingwei Pan , Yehao Li , Ting Yao , Tao Mei

DiffPortrait3D: Controllable Diffusion for Zero-Shot Portrait View Synthesis

We present DiffPortrait3D, a conditional diffusion model that is capable of synthesizing 3D-consistent photo-realistic novel views from as few as a single in-the-wild portrait. Specifically, given a single RGB input, we aim to synthesize…

Computer Vision and Pattern Recognition · Computer Science 2025-03-21 Yuming Gu , You Xie , Hongyi Xu , Guoxian Song , Yichun Shi , Di Chang , Jing Yang , Linjie Luo

Diffusion Models are Geometry Critics: Single Image 3D Editing Using Pre-Trained Diffusion Priors

We propose a novel image editing technique that enables 3D manipulations on single images, such as object rotation and translation. Existing 3D-aware image editing approaches typically rely on synthetic multi-view datasets for training…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Ruicheng Wang , Jianfeng Xiang , Jiaolong Yang , Xin Tong

DiffSketching: Sketch Control Image Synthesis with Diffusion Models

Creative sketch is a universal way of visual expression, but translating images from an abstract sketch is very challenging. Traditionally, creating a deep learning model for sketch-to-image synthesis needs to overcome the distorted input…

Computer Vision and Pattern Recognition · Computer Science 2023-05-31 Qiang Wang , Di Kong , Fengyin Lin , Yonggang Qi

RenderDiffusion: Image Diffusion for 3D Reconstruction, Inpainting and Generation

Diffusion models currently achieve state-of-the-art performance for both conditional and unconditional image generation. However, so far, image diffusion models do not support tasks required for 3D understanding, such as view-consistent 3D…

Computer Vision and Pattern Recognition · Computer Science 2024-02-22 Titas Anciukevičius , Zexiang Xu , Matthew Fisher , Paul Henderson , Hakan Bilen , Niloy J. Mitra , Paul Guerrero

Controlling Avatar Diffusion with Learnable Gaussian Embedding

Recent advances in diffusion models have made significant progress in digital human generation. However, most existing models still struggle to maintain 3D consistency, temporal coherence, and motion accuracy. A key reason for these…

Graphics · Computer Science 2025-03-21 Xuan Gao , Jingtao Zhou , Dongyu Liu , Yuqi Zhou , Juyong Zhang

ControlMat: A Controlled Generative Approach to Material Capture

Material reconstruction from a photograph is a key component of 3D content creation democratization. We propose to formulate this ill-posed problem as a controlled synthesis one, leveraging the recent progress in generative deep networks.…

Computer Vision and Pattern Recognition · Computer Science 2024-08-28 Giuseppe Vecchio , Rosalie Martin , Arthur Roullier , Adrien Kaiser , Romain Rouffet , Valentin Deschaintre , Tamy Boubekeur

3DiffTection: 3D Object Detection with Geometry-Aware Diffusion Features

We present 3DiffTection, a state-of-the-art method for 3D object detection from single images, leveraging features from a 3D-aware diffusion model. Annotating large-scale image data for 3D detection is resource-intensive and time-consuming.…

Computer Vision and Pattern Recognition · Computer Science 2023-11-09 Chenfeng Xu , Huan Ling , Sanja Fidler , Or Litany

ControlCom: Controllable Image Composition using Diffusion Model

Image composition targets at synthesizing a realistic composite image from a pair of foreground and background images. Recently, generative composition methods are built on large pretrained diffusion models to generate composite images,…

Computer Vision and Pattern Recognition · Computer Science 2023-08-22 Bo Zhang , Yuxuan Duan , Jun Lan , Yan Hong , Huijia Zhu , Weiqiang Wang , Li Niu

CAD: Photorealistic 3D Generation via Adversarial Distillation

The increased demand for 3D data in AR/VR, robotics and gaming applications, gave rise to powerful generative pipelines capable of synthesizing high-quality 3D objects. Most of these models rely on the Score Distillation Sampling (SDS)…

Computer Vision and Pattern Recognition · Computer Science 2023-12-12 Ziyu Wan , Despoina Paschalidou , Ian Huang , Hongyu Liu , Bokui Shen , Xiaoyu Xiang , Jing Liao , Leonidas Guibas

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis

Recent strides in Text-to-3D techniques have been propelled by distilling knowledge from powerful large text-to-image diffusion models (LDMs). Nonetheless, existing Text-to-3D approaches often grapple with challenges such as…

Computer Vision and Pattern Recognition · Computer Science 2023-08-23 Yiwen Chen , Chi Zhang , Xiaofeng Yang , Zhongang Cai , Gang Yu , Lei Yang , Guosheng Lin

Enhanced Controllability of Diffusion Models via Feature Disentanglement and Realism-Enhanced Sampling Methods

As Diffusion Models have shown promising performance, a lot of efforts have been made to improve the controllability of Diffusion Models. However, how to train Diffusion Models to have the disentangled latent spaces and how to naturally…

Computer Vision and Pattern Recognition · Computer Science 2025-07-02 Wonwoong Cho , Hareesh Ravi , Midhun Harikumar , Vinh Khuc , Krishna Kumar Singh , Jingwan Lu , David I. Inouye , Ajinkya Kale