Related papers: Learning Structure-Guided Diffusion Model for 2D H…

DiffuPose: Monocular 3D Human Pose Estimation via Denoising Diffusion Probabilistic Model

Thanks to the development of 2D keypoint detectors, monocular 3D human pose estimation (HPE) via 2D-to-3D uplifting approaches have achieved remarkable improvements. Still, monocular 3D HPE is a challenging problem due to the inherent depth…

Computer Vision and Pattern Recognition · Computer Science 2023-08-04 Jeongjun Choi , Dongseok Shim , H. Jin Kim

DiffPose: SpatioTemporal Diffusion Model for Video-Based Human Pose Estimation

Denoising diffusion probabilistic models that were initially proposed for realistic image generation have recently shown success in various perception tasks (e.g., object detection and image segmentation) and are increasingly gaining…

Computer Vision and Pattern Recognition · Computer Science 2023-08-08 Runyang Feng , Yixing Gao , Tze Ho Elden Tse , Xueqing Ma , Hyung Jin Chang

DiffPose: Multi-hypothesis Human Pose Estimation using Diffusion models

Traditionally, monocular 3D human pose estimation employs a machine learning model to predict the most likely 3D pose for a given input image. However, a single image can be highly ambiguous and induces multiple plausible solutions for the…

Computer Vision and Pattern Recognition · Computer Science 2022-11-30 Karl Holmquist , Bastian Wandt

Flexible Geometric Guidance for Probabilistic Human Pose Estimation with Diffusion Models

3D human pose estimation from 2D images is a challenging problem due to depth ambiguity and occlusion. Because of these challenges the task is underdetermined, where there exists multiple -- possibly infinite -- poses that are plausible…

Computer Vision and Pattern Recognition · Computer Science 2026-02-04 Francis Snelgar , Ming Xu , Stephen Gould , Liang Zheng , Akshay Asthana

DiffHPE: Robust, Coherent 3D Human Pose Lifting with Diffusion

We present an innovative approach to 3D Human Pose Estimation (3D-HPE) by integrating cutting-edge diffusion models, which have revolutionized diverse fields, but are relatively unexplored in 3D-HPE. We show that diffusion models enhance…

Computer Vision and Pattern Recognition · Computer Science 2023-09-06 Cédric Rommel , Eduardo Valle , Mickaël Chen , Souhaiel Khalfaoui , Renaud Marlet , Matthieu Cord , Patrick Pérez

FinePOSE: Fine-Grained Prompt-Driven 3D Human Pose Estimation via Diffusion Models

The 3D Human Pose Estimation (3D HPE) task uses 2D images or videos to predict human joint coordinates in 3D space. Despite recent advancements in deep learning-based methods, they mostly ignore the capability of coupling accessible texts…

Computer Vision and Pattern Recognition · Computer Science 2024-05-09 Jinglin Xu , Yijie Guo , Yuxin Peng

Diffusion-based Pose Refinement and Muti-hypothesis Generation for 3D Human Pose Estimaiton

Previous probabilistic models for 3D Human Pose Estimation (3DHPE) aimed to enhance pose accuracy by generating multiple hypotheses. However, most of the hypotheses generated deviate substantially from the true pose. Compared to…

Computer Vision and Pattern Recognition · Computer Science 2024-01-11 Hongbo Kang , Yong Wang , Mengyuan Liu , Doudou Wu , Peng Liu , Xinlin Yuan , Wenming Yang

DiffPose: Toward More Reliable 3D Pose Estimation

Monocular 3D human pose estimation is quite challenging due to the inherent ambiguity and occlusion, which often lead to high uncertainty and indeterminacy. On the other hand, diffusion models have recently emerged as an effective tool for…

Computer Vision and Pattern Recognition · Computer Science 2023-04-11 Jia Gong , Lin Geng Foo , Zhipeng Fan , Qiuhong Ke , Hossein Rahmani , Jun Liu

HyperDiff: Hypergraph Guided Diffusion Model for 3D Human Pose Estimation

Monocular 3D human pose estimation (HPE) often encounters challenges such as depth ambiguity and occlusion during the 2D-to-3D lifting process. Additionally, traditional methods may overlook multi-scale skeleton features when utilizing…

Computer Vision and Pattern Recognition · Computer Science 2025-08-21 Bing Han , Yuhua Huang , Pan Gao

Simple Multi-Resolution Representation Learning for Human Pose Estimation

Human pose estimation - the process of recognizing human keypoints in a given image - is one of the most important tasks in computer vision and has a wide range of applications including movement diagnostics, surveillance, or self-driving…

Computer Vision and Pattern Recognition · Computer Science 2021-01-25 Trung Q. Tran , Giang V. Nguyen , Daeyoung Kim

6D-Diff: A Keypoint Diffusion Framework for 6D Object Pose Estimation

Estimating the 6D object pose from a single RGB image often involves noise and indeterminacy due to challenges such as occlusions and cluttered backgrounds. Meanwhile, diffusion models have shown appealing performance in generating…

Computer Vision and Pattern Recognition · Computer Science 2024-03-25 Li Xu , Haoxuan Qu , Yujun Cai , Jun Liu

Diffusion Model is a Good Pose Estimator from 3D RF-Vision

Human pose estimation (HPE) from Radio Frequency vision (RF-vision) performs human sensing using RF signals that penetrate obstacles without revealing privacy (e.g., facial information). Recently, mmWave radar has emerged as a promising…

Computer Vision and Pattern Recognition · Computer Science 2024-07-23 Junqiao Fan , Jianfei Yang , Yuecong Xu , Lihua Xie

$\text{Di}^2\text{Pose}$: Discrete Diffusion Model for Occluded 3D Human Pose Estimation

Continuous diffusion models have demonstrated their effectiveness in addressing the inherent uncertainty and indeterminacy in monocular 3D human pose estimation (HPE). Despite their strengths, the need for large search spaces and the…

Computer Vision and Pattern Recognition · Computer Science 2024-05-28 Weiquan Wang , Jun Xiao , Chunping Wang , Wei Liu , Zhao Wang , Long Chen

FastDDHPose: Towards Unified, Efficient, and Disentangled 3D Human Pose Estimation

Recent approaches for monocular 3D human pose estimation (3D HPE) have achieved leading performance by directly regressing 3D poses from 2D keypoint sequences. Despite the rapid progress in 3D HPE, existing methods are typically trained and…

Computer Vision and Pattern Recognition · Computer Science 2025-12-17 Qingyuan Cai , Linxin Zhang , Xuecai Hu , Saihui Hou , Yongzhen Huang

Learning Heatmap-Style Jigsaw Puzzles Provides Good Pretraining for 2D Human Pose Estimation

The target of 2D human pose estimation is to locate the keypoints of body parts from input 2D images. State-of-the-art methods for pose estimation usually construct pixel-wise heatmaps from keypoints as labels for learning convolution…

Computer Vision and Pattern Recognition · Computer Science 2020-12-15 Kun Zhang , Rui Wu , Ping Yao , Kai Deng , Ding Li , Renbiao Liu , Chuanguang Yang , Ge Chen , Min Du , Tianyao Zheng

DiffPose-Animal: A Language-Conditioned Diffusion Framework for Animal Pose Estimation

Animal pose estimation is a fundamental task in computer vision, with growing importance in ecological monitoring, behavioral analysis, and intelligent livestock management. Compared to human pose estimation, animal pose estimation is more…

Computer Vision and Pattern Recognition · Computer Science 2025-12-16 Tianyu Xiong , Dayi Tan , Wei Tian

Disentangled Diffusion-Based 3D Human Pose Estimation with Hierarchical Spatial and Temporal Denoiser

Recently, diffusion-based methods for monocular 3D human pose estimation have achieved state-of-the-art (SOTA) performance by directly regressing the 3D joint coordinates from the 2D pose sequence. Although some methods decompose the task…

Computer Vision and Pattern Recognition · Computer Science 2025-07-01 Qingyuan Cai , Xuecai Hu , Saihui Hou , Li Yao , Yongzhen Huang

Back to Optimization: Diffusion-based Zero-Shot 3D Human Pose Estimation

Learning-based methods have dominated the 3D human pose estimation (HPE) tasks with significantly better performance in most benchmarks than traditional optimization-based methods. Nonetheless, 3D HPE in the wild is still the biggest…

Computer Vision and Pattern Recognition · Computer Science 2023-10-26 Zhongyu Jiang , Zhuoran Zhou , Lei Li , Wenhao Chai , Cheng-Yen Yang , Jenq-Neng Hwang

StarPose: 3D Human Pose Estimation via Spatial-Temporal Autoregressive Diffusion

Monocular 3D human pose estimation remains a challenging task due to inherent depth ambiguities and occlusions. Compared to traditional methods based on Transformers or Convolutional Neural Networks (CNNs), recent diffusion-based approaches…

Computer Vision and Pattern Recognition · Computer Science 2025-08-12 Haoxin Yang , Weihong Chen , Xuemiao Xu , Cheng Xu , Peng Xiao , Cuifeng Sun , Shaoyu Huang , Shengfeng He

2D Human Pose Estimation: A Survey

Human pose estimation aims at localizing human anatomical keypoints or body parts in the input data (e.g., images, videos, or signals). It forms a crucial component in enabling machines to have an insightful understanding of the behaviors…

Computer Vision and Pattern Recognition · Computer Science 2022-04-18 Haoming Chen , Runyang Feng , Sifan Wu , Hao Xu , Fengcheng Zhou , Zhenguang Liu