English
Related papers

Related papers: Pose-Aware Diffusion for 3D Generation

200 papers

Diffusion models have demonstrated impressive capabilities in modeling complex data distributions and are increasingly applied in various generative tasks. In this work, we propose Pose Analysis by Diffusion Synthesis PADS, a unified…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Haorui Ji , Hongdong Li

We propose SparseFusion, a sparse view 3D reconstruction approach that unifies recent advances in neural rendering and probabilistic image generation. Existing approaches typically build on neural rendering with re-projected features but…

Computer Vision and Pattern Recognition · Computer Science 2023-02-17 Zhizhuo Zhou , Shubham Tulsiani

We introduce GazeD, a new 3D gaze estimation method that jointly provides 3D gaze and human pose from a single RGB image. Leveraging the ability of diffusion models to deal with uncertainty, it generates multiple plausible 3D gaze and pose…

Computer Vision and Pattern Recognition · Computer Science 2026-01-26 Riccardo Catalini , Davide Di Nucci , Guido Borghi , Davide Davoli , Lorenzo Garattoni , Gianpiero Francesca , Yuki Kawana , Roberto Vezzani

Automated 3D scene generation is pivotal for applications spanning virtual reality, digital content creation, and Embodied AI. While computer graphics prioritizes aesthetic layouts, vision and robotics demand scenes that mirror real-world…

Graphics · Computer Science 2026-03-31 Minzhang Li , Kuixiang Shao , Xuebing Li , Yuyang Jiao , Yinuo Bai , Hengan Zhou , Sixian Shen , Jiayuan Gu , Jingyi Yu

Existing methods for 3D-aware image synthesis largely depend on the 3D pose distribution pre-estimated on the training set. An inaccurate estimation may mislead the model into learning faulty geometry. This work proposes PoF3D that frees…

Computer Vision and Pattern Recognition · Computer Science 2023-03-24 Zifan Shi , Yujun Shen , Yinghao Xu , Sida Peng , Yiyi Liao , Sheng Guo , Qifeng Chen , Dit-Yan Yeung

We present iFusion, a novel 3D object reconstruction framework that requires only two views with unknown camera poses. While single-view reconstruction yields visually appealing results, it can deviate significantly from the actual object,…

Computer Vision and Pattern Recognition · Computer Science 2023-12-29 Chin-Hsuan Wu , Yen-Chun Chen , Bolivar Solarte , Lu Yuan , Min Sun

Object pose estimation from a single view remains a challenging problem. In particular, partial observability, occlusions, and object symmetries eventually result in pose ambiguity. To account for this multimodality, this work proposes…

Computer Vision and Pattern Recognition · Computer Science 2024-12-03 Christian Möller , Niklas Funk , Jan Peters

We present SPAD, a novel approach for creating consistent multi-view images from text prompts or single images. To enable multi-view generation, we repurpose a pretrained 2D diffusion model by extending its self-attention layers with…

Computer Vision and Pattern Recognition · Computer Science 2024-02-09 Yash Kant , Ziyi Wu , Michael Vasilkovsky , Guocheng Qian , Jian Ren , Riza Alp Guler , Bernard Ghanem , Sergey Tulyakov , Igor Gilitschenski , Aliaksandr Siarohin

We propose a diffusion-based approach for Text-to-Image (T2I) generation with consistent and interactive 3D layout control and editing. While prior methods improve spatial adherence using 2D cues or iterative copy-warp-paste strategies,…

Computer Vision and Pattern Recognition · Computer Science 2026-01-21 Andrea Rigo , Luca Stornaiuolo , Weijie Wang , Mauro Martino , Bruno Lepri , Nicu Sebe

Controllable generation of 3D assets is important for many practical applications like content creation in movies, games and engineering, as well as in AR/VR. Recently, diffusion models have shown remarkable results in generation quality of…

Computer Vision and Pattern Recognition · Computer Science 2024-08-01 Philipp Schröppel , Christopher Wewer , Jan Eric Lenssen , Eddy Ilg , Thomas Brox

Modern learning-based approaches to 3D-aware image synthesis achieve high photorealism and 3D-consistent viewpoint changes for the generated images. Existing approaches represent instances in a shared canonical space. However, for…

Computer Vision and Pattern Recognition · Computer Science 2024-04-15 Katja Schwarz , Seung Wook Kim , Jun Gao , Sanja Fidler , Andreas Geiger , Karsten Kreis

Continuous diffusion models have demonstrated their effectiveness in addressing the inherent uncertainty and indeterminacy in monocular 3D human pose estimation (HPE). Despite their strengths, the need for large search spaces and the…

Computer Vision and Pattern Recognition · Computer Science 2024-05-28 Weiquan Wang , Jun Xiao , Chunping Wang , Wei Liu , Zhao Wang , Long Chen

Single-view 3D shape retrieval is a fundamental yet challenging task that is increasingly important with the growth of available 3D data. Existing approaches largely fall into two categories: those using contrastive learning to map point…

Computer Vision and Pattern Recognition · Computer Science 2026-04-30 Jiaxin Shi , Guofeng Zhang , Wufei Ma , Naifu Liang , Adam Kortylewski , Alan Yuille

Multi-view image diffusion models have significantly advanced open-domain 3D object generation. However, most existing models rely on 2D network architectures that lack inherent 3D biases, resulting in compromised geometric consistency. To…

Computer Vision and Pattern Recognition · Computer Science 2025-02-21 Hansheng Chen , Bokui Shen , Yulin Liu , Ruoxi Shi , Linqi Zhou , Connor Z. Lin , Jiayuan Gu , Hao Su , Gordon Wetzstein , Leonidas Guibas

Humans can infer 3D structure from 2D images of an object based on past experience and improve their 3D understanding as they see more images. Inspired by this behavior, we introduce SAP3D, a system for 3D reconstruction and novel view…

Computer Vision and Pattern Recognition · Computer Science 2024-04-05 Xinyang Han , Zelin Gao , Angjoo Kanazawa , Shubham Goel , Yossi Gandelsman

Monocular 3D human pose estimation is quite challenging due to the inherent ambiguity and occlusion, which often lead to high uncertainty and indeterminacy. On the other hand, diffusion models have recently emerged as an effective tool for…

Computer Vision and Pattern Recognition · Computer Science 2023-04-11 Jia Gong , Lin Geng Foo , Zhipeng Fan , Qiuhong Ke , Hossein Rahmani , Jun Liu

Face aging is the process of converting an individual's appearance to a younger or older version of themselves. Existing face aging techniques have been limited to 2D settings, which often weaken their applications as there is a growing…

Computer Vision and Pattern Recognition · Computer Science 2024-08-29 Junaid Wahid , Fangneng Zhan , Pramod Rao , Christian Theobalt

Existing autoregressive (AR) methods for generating artist-designed meshes struggle to balance global structural consistency with high-fidelity local details, and are susceptible to error accumulation. To address this, we propose…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Yichen Yang , Hong Li , Haodong Zhu , Linin Yang , Guojun Lei , Sheng Xu , Baochang Zhang

We present a diffusion-based model for 3D-aware generative novel view synthesis from as few as a single input image. Our model samples from the distribution of possible renderings consistent with the input and, even in the presence of…

Computer Vision and Pattern Recognition · Computer Science 2023-04-06 Eric R. Chan , Koki Nagano , Matthew A. Chan , Alexander W. Bergman , Jeong Joon Park , Axel Levy , Miika Aittala , Shalini De Mello , Tero Karras , Gordon Wetzstein

While generalizable 3D Gaussian splatting enables efficient, high-quality rendering of unseen scenes, it heavily depends on precise camera poses for accurate geometry. In real-world scenarios, obtaining accurate poses is challenging,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-22 Youngju Na , Taeyeon Kim , Jumin Lee , Kyu Beom Han , Woo Jae Kim , Sung-eui Yoon
‹ Prev 1 2 3 10 Next ›