Related papers: D3PRefiner: A Diffusion-based Denoise Method for 3…

DiffPose: Toward More Reliable 3D Pose Estimation

Monocular 3D human pose estimation is quite challenging due to the inherent ambiguity and occlusion, which often lead to high uncertainty and indeterminacy. On the other hand, diffusion models have recently emerged as an effective tool for…

Computer Vision and Pattern Recognition · Computer Science 2023-04-11 Jia Gong , Lin Geng Foo , Zhipeng Fan , Qiuhong Ke , Hossein Rahmani , Jun Liu

Diffusion-based Pose Refinement and Muti-hypothesis Generation for 3D Human Pose Estimaiton

Previous probabilistic models for 3D Human Pose Estimation (3DHPE) aimed to enhance pose accuracy by generating multiple hypotheses. However, most of the hypotheses generated deviate substantially from the true pose. Compared to…

Computer Vision and Pattern Recognition · Computer Science 2024-01-11 Hongbo Kang , Yong Wang , Mengyuan Liu , Doudou Wu , Peng Liu , Xinlin Yuan , Wenming Yang

DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model

Thanks to the development of 2D keypoint detectors, monocular 3D human pose estimation (HPE) via 2D-to-3D uplifting approaches have achieved remarkable improvements. Still, monocular 3D HPE is a challenging problem due to the inherent depth…

Computer Vision and Pattern Recognition · Computer Science 2023-08-04 Jeongjun Choi , Dongseok Shim , H. Jin Kim

DPoser: Diffusion Model as Robust 3D Human Pose Prior

This work targets to construct a robust human pose prior. However, it remains a persistent challenge due to biomechanical constraints and diverse human movements. Traditional priors like VAEs and NDFs often exhibit shortcomings in realism…

Computer Vision and Pattern Recognition · Computer Science 2024-03-26 Junzhe Lu , Jing Lin , Hongkun Dou , Ailing Zeng , Yue Deng , Yulun Zhang , Haoqian Wang

HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation

Monocular 3D human pose estimation (HPE) often encounters challenges such as depth ambiguity and occlusion during the 2D-to-3D lifting process. Additionally, traditional methods may overlook multi-scale skeleton features when utilizing…

Computer Vision and Pattern Recognition · Computer Science 2025-08-21 Bing Han , Yuhua Huang , Pan Gao

Denoising Diffusion for 3D Hand Pose Estimation from Images

Hand pose estimation from a single image has many applications. However, approaches to full 3D body pose estimation are typically trained on day-to-day activities or actions. As such, detailed hand-to-hand interactions are poorly…

Computer Vision and Pattern Recognition · Computer Science 2023-08-21 Maksym Ivashechkin , Oscar Mendez , Richard Bowden

A generic diffusion-based approach for 3D human pose prediction in the wild

Predicting 3D human poses in real-world scenarios, also known as human pose forecasting, is inevitably subject to noisy inputs arising from inaccurate 3D pose estimations and occlusions. To address these challenges, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2023-03-16 Saeed Saadatnejad , Ali Rasekh , Mohammadreza Mofayezi , Yasamin Medghalchi , Sara Rajabzadeh , Taylor Mordan , Alexandre Alahi

Diffusion-Based 3D Human Pose Estimation with Multi-Hypothesis Aggregation

In this paper, a novel Diffusion-based 3D Pose estimation (D3DP) method with Joint-wise reProjection-based Multi-hypothesis Aggregation (JPMA) is proposed for probabilistic 3D human pose estimation. On the one hand, D3DP generates multiple…

Computer Vision and Pattern Recognition · Computer Science 2023-08-24 Wenkang Shan , Zhenhua Liu , Xinfeng Zhang , Zhao Wang , Kai Han , Shanshe Wang , Siwei Ma , Wen Gao

Flexible Geometric Guidance for Probabilistic Human Pose Estimation with Diffusion Models

3D human pose estimation from 2D images is a challenging problem due to depth ambiguity and occlusion. Because of these challenges the task is underdetermined, where there exists multiple -- possibly infinite -- poses that are plausible…

Computer Vision and Pattern Recognition · Computer Science 2026-02-04 Francis Snelgar , Ming Xu , Stephen Gould , Liang Zheng , Akshay Asthana

FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models

The 3D Human Pose Estimation (3D HPE) task uses 2D images or videos to predict human joint coordinates in 3D space. Despite recent advancements in deep learning-based methods, they mostly ignore the capability of coupling accessible texts…

Computer Vision and Pattern Recognition · Computer Science 2024-05-09 Jinglin Xu , Yijie Guo , Yuxin Peng

DeProPose: Deficiency-Proof 3D Human Pose Estimation via Adaptive Multi-View Fusion

3D human pose estimation has wide applications in fields such as intelligent surveillance, motion capture, and virtual reality. However, in real-world scenarios, issues such as occlusion, noise interference, and missing viewpoints can…

Computer Vision and Pattern Recognition · Computer Science 2025-02-25 Jianbin Jiao , Xina Cheng , Kailun Yang , Xiangrong Zhang , Licheng Jiao

DreamPose3D: Hallucinative Diffusion with Prompt Learning for 3D Human Pose Estimation

Accurate 3D human pose estimation remains a critical yet unresolved challenge, requiring both temporal coherence across frames and fine-grained modeling of joint relationships. However, most existing methods rely solely on geometric cues…

Computer Vision and Pattern Recognition · Computer Science 2025-11-13 Jerrin Bright , Yuhao Chen , John S. Zelek

DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion

We present an innovative approach to 3D Human Pose Estimation (3D-HPE) by integrating cutting-edge diffusion models, which have revolutionized diverse fields, but are relatively unexplored in 3D-HPE. We show that diffusion models enhance…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Cédric Rommel , Eduardo Valle , Mickaël Chen , Souhaiel Khalfaoui , Renaud Marlet , Matthieu Cord , Patrick Pérez

MonoSE(3)-Diffusion: A Monocular SE(3) Diffusion Framework for Robust Camera-to-Robot Pose Estimation

We propose MonoSE(3)-Diffusion, a monocular SE(3) diffusion framework that formulates markerless, image-based robot pose estimation as a conditional denoising diffusion process. The framework consists of two processes: a…

Computer Vision and Pattern Recognition · Computer Science 2025-10-14 Kangjian Zhu , Haobo Jiang , Yigong Zhang , Jianjun Qian , Jian Yang , Jin Xie

$\text{Di}^2\text{Pose}$: Discrete Diffusion Model for Occluded 3D Human Pose Estimation

Continuous diffusion models have demonstrated their effectiveness in addressing the inherent uncertainty and indeterminacy in monocular 3D human pose estimation (HPE). Despite their strengths, the need for large search spaces and the…

Computer Vision and Pattern Recognition · Computer Science 2024-05-28 Weiquan Wang , Jun Xiao , Chunping Wang , Wei Liu , Zhao Wang , Long Chen

DiffPose: Multi-hypothesis Human Pose Estimation using Diffusion models

Traditionally, monocular 3D human pose estimation employs a machine learning model to predict the most likely 3D pose for a given input image. However, a single image can be highly ambiguous and induces multiple plausible solutions for the…

Computer Vision and Pattern Recognition · Computer Science 2022-11-30 Karl Holmquist , Bastian Wandt

ID-Pose: Sparse-view Camera Pose Estimation by Inverting Diffusion Models

Given sparse views of a 3D object, estimating their camera poses is a long-standing and intractable problem. Toward this goal, we consider harnessing the pre-trained diffusion model of novel views conditioned on viewpoints (Zero-1-to-3). We…

Computer Vision and Pattern Recognition · Computer Science 2023-12-01 Weihao Cheng , Yan-Pei Cao , Ying Shan

DiffBody: Diffusion-based Pose and Shape Editing of Human Images

Pose and body shape editing in a human image has received increasing attention. However, current methods often struggle with dataset biases and deteriorate realism and the person's identity when users make large edits. We propose a one-shot…

Computer Vision and Pattern Recognition · Computer Science 2024-01-09 Yuta Okuyama , Yuki Endo , Yoshihiro Kanamori

SnapPose3D: Diffusion-Based Single-Frame 2D-to-3D Lifting of Human Poses

Depth ambiguity and joint uncertainty are the two main obstacles in obtaining accurate human pose predictions by 2D-to-3D lifting methods proposed in the literature. In particular, these issues are caused by 2D joint locations that can be…

Computer Vision and Pattern Recognition · Computer Science 2026-04-30 Alessandro Simoni , Riccardo Catalini , Davide Di Nucci , Guido Borghi , Davide Davoli , Lorenzo Garattoni , Gianpiero Francesca , Yuki Kawana , Roberto Vezzani

Diffusion Model is a Good Pose Estimator from 3D RF-Vision

Human pose estimation (HPE) from Radio Frequency vision (RF-vision) performs human sensing using RF signals that penetrate obstacles without revealing privacy (e.g., facial information). Recently, mmWave radar has emerged as a promising…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Junqiao Fan , Jianfei Yang , Yuecong Xu , Lihua Xie