Related papers: Unaligned 2D to 3D Translation with Conditional Ve…

Uncertainty-Aware Diffusion Guided Refinement of 3D Scenes

Reconstructing 3D scenes from a single image is a fundamentally ill-posed task due to the severely under-constrained nature of the problem. Consequently, when the scene is rendered from novel camera views, existing single image to 3D…

Computer Vision and Pattern Recognition · Computer Science 2025-10-10 Sarosij Bose , Arindam Dutta , Sayak Nag , Junge Zhang , Jiachen Li , Konstantinos Karydis , Amit K. Roy Chowdhury

LACONIC: A 3D Layout Adapter for Controllable Image Creation

Existing generative approaches for guided image synthesis of multi-object scenes typically rely on 2D controls in the image or text space. As a result, these methods struggle to maintain and respect consistent three-dimensional geometric…

Computer Vision and Pattern Recognition · Computer Science 2025-08-05 Léopold Maillard , Tom Durand , Adrien Ramanana Rahary , Maks Ovsjanikov

IT3D: Improved Text-to-3D Generation with Explicit View Synthesis

Recent strides in Text-to-3D techniques have been propelled by distilling knowledge from powerful large text-to-image diffusion models (LDMs). Nonetheless, existing Text-to-3D approaches often grapple with challenges such as…

Computer Vision and Pattern Recognition · Computer Science 2023-08-23 Yiwen Chen , Chi Zhang , Xiaofeng Yang , Zhongang Cai , Gang Yu , Lei Yang , Guosheng Lin

Towards Unsupervised Learning of Generative Models for 3D Controllable Image Synthesis

In recent years, Generative Adversarial Networks have achieved impressive results in photorealistic image synthesis. This progress nurtures hopes that one day the classical rendering pipeline can be replaced by efficient models that are…

Computer Vision and Pattern Recognition · Computer Science 2020-03-25 Yiyi Liao , Katja Schwarz , Lars Mescheder , Andreas Geiger

Guided and Unguided Conditional Diffusion Mechanisms for Structured and Semantically-Aware 3D Point Cloud Generation

Generating realistic 3D point clouds is a fundamental problem in computer vision with applications in remote sensing, robotics, and digital object modeling. Existing generative approaches primarily capture geometry, and when semantics are…

Computer Vision and Pattern Recognition · Computer Science 2025-09-23 Gunner Stone , Sushmita Sarker , Alireza Tavakkoli

GRAF: Generative Radiance Fields for 3D-Aware Image Synthesis

While 2D generative adversarial networks have enabled high-resolution image synthesis, they largely lack an understanding of the 3D world and the image formation process. Thus, they do not provide precise control over camera viewpoint or…

Computer Vision and Pattern Recognition · Computer Science 2021-03-31 Katja Schwarz , Yiyi Liao , Michael Niemeyer , Andreas Geiger

Compositional 3D Scene Generation using Locally Conditioned Diffusion

Designing complex 3D scenes has been a tedious, manual process requiring domain expertise. Emerging text-to-3D generative models show great promise for making this task more intuitive, but existing approaches are limited to object-level…

Computer Vision and Pattern Recognition · Computer Science 2023-03-24 Ryan Po , Gordon Wetzstein

Gen-3Diffusion: Realistic Image-to-3D Generation via 2D & 3D Diffusion Synergy

Creating realistic 3D objects and clothed avatars from a single RGB image is an attractive yet challenging problem. Due to its ill-posed nature, recent works leverage powerful prior from 2D diffusion models pretrained on large datasets.…

Computer Vision and Pattern Recognition · Computer Science 2025-11-27 Yuxuan Xue , Xianghui Xie , Riccardo Marin , Gerard Pons-Moll

Realistic Image Synthesis with Configurable 3D Scene Layouts

Recent conditional image synthesis approaches provide high-quality synthesized images. However, it is still challenging to accurately adjust image contents such as the positions and orientations of objects, and synthesized images often have…

Computer Vision and Pattern Recognition · Computer Science 2021-08-25 Jaebong Jeong , Janghun Jo , Jingdong Wang , Sunghyun Cho , Jaesik Park

Points2Pix: 3D Point-Cloud to Image Translation using conditional Generative Adversarial Networks

We present the first approach for 3D point-cloud to image translation based on conditional Generative Adversarial Networks (cGAN). The model handles multi-modal information sources from different domains, i.e. raw point-sets and images. The…

Computer Vision and Pattern Recognition · Computer Science 2019-09-17 Stefan Milz , Martin Simon , Kai Fischer , Maximillian Pöpperl

Text-Image Conditioned Diffusion for Consistent Text-to-3D Generation

By lifting the pre-trained 2D diffusion models into Neural Radiance Fields (NeRFs), text-to-3D generation methods have made great progress. Many state-of-the-art approaches usually apply score distillation sampling (SDS) to optimize the…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Yuze He , Yushi Bai , Matthieu Lin , Jenny Sheng , Yubin Hu , Qi Wang , Yu-Hui Wen , Yong-Jin Liu

3D-aware Image Generation and Editing with Multi-modal Conditions

3D-consistent image generation from a single 2D semantic label is an important and challenging research topic in computer graphics and computer vision. Although some related works have made great progress in this field, most of the existing…

Computer Vision and Pattern Recognition · Computer Science 2024-03-12 Bo Li , Yi-ke Li , Zhi-fen He , Bin Liu , Yun-Kun Lai

Conditional Image Synthesis with Diffusion Models: A Survey

Conditional image synthesis based on user-specified requirements is a key component in creating complex visual content. In recent years, diffusion-based generative modeling has become a highly effective way for conditional image synthesis,…

Computer Vision and Pattern Recognition · Computer Science 2025-06-03 Zheyuan Zhan , Defang Chen , Jian-Ping Mei , Zhenghe Zhao , Jiawei Chen , Chun Chen , Siwei Lyu , Can Wang

d-Sketch: Improving Visual Fidelity of Sketch-to-Image Translation with Pretrained Latent Diffusion Models without Retraining

Structural guidance in an image-to-image translation allows intricate control over the shapes of synthesized images. Generating high-quality realistic images from user-specified rough hand-drawn sketches is one such task that aims to impose…

Graphics · Computer Science 2025-02-24 Prasun Roy , Saumik Bhattacharya , Subhankar Ghosh , Umapada Pal , Michael Blumenstein

GIRAFFE: Representing Scenes as Compositional Generative Neural Feature Fields

Deep generative models allow for photorealistic image synthesis at high resolutions. But for many applications, this is not enough: content creation also needs to be controllable. While several recent works investigate how to disentangle…

Computer Vision and Pattern Recognition · Computer Science 2021-04-30 Michael Niemeyer , Andreas Geiger

3D-aware Image Generation using 2D Diffusion Models

In this paper, we introduce a novel 3D-aware image generation method that leverages 2D diffusion models. We formulate the 3D-aware image generation task as multiview 2D image set generation, and further to a sequential…

Computer Vision and Pattern Recognition · Computer Science 2023-04-03 Jianfeng Xiang , Jiaolong Yang , Binbin Huang , Xin Tong

World-consistent Video Diffusion with Explicit 3D Modeling

Recent advancements in diffusion models have set new benchmarks in image and video generation, enabling realistic visual synthesis across single- and multi-frame contexts. However, these models still struggle with efficiently and explicitly…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Qihang Zhang , Shuangfei Zhai , Miguel Angel Bautista , Kevin Miao , Alexander Toshev , Joshua Susskind , Jiatao Gu

Vector Quantized Image-to-Image Translation

Current image-to-image translation methods formulate the task with conditional generation models, leading to learning only the recolorization or regional changes as being constrained by the rich structural information provided by the…

Computer Vision and Pattern Recognition · Computer Science 2022-07-28 Yu-Jie Chen , Shin-I Cheng , Wei-Chen Chiu , Hung-Yu Tseng , Hsin-Ying Lee

Category-Aware 3D Object Composition with Disentangled Texture and Shape Multi-view Diffusion

In this paper, we tackle a new task of 3D object synthesis, where a 3D model is composited with another object category to create a novel 3D model. However, most existing text/image/3D-to-3D methods struggle to effectively integrate…

Computer Vision and Pattern Recognition · Computer Science 2025-09-03 Zeren Xiong , Zikun Chen , Zedong Zhang , Xiang Li , Ying Tai , Jian Yang , Jun Li

GenCAD: Image-Conditioned Computer-Aided Design Generation with Transformer-Based Contrastive Representation and Diffusion Priors

The creation of manufacturable and editable 3D shapes through Computer-Aided Design (CAD) remains a highly manual and time-consuming task, hampered by the complex topology of boundary representations of 3D solids and unintuitive design…

Computer Vision and Pattern Recognition · Computer Science 2025-04-10 Md Ferdous Alam , Faez Ahmed