Related papers: Learning Complex 3D Human Self-Contact

Reconstructing Three-Dimensional Models of Interacting Humans

Understanding 3d human interactions is fundamental for fine-grained scene analysis and behavioural modeling. However, most of the existing models predict incorrect, lifeless 3d estimates, that miss the subtle human contact aspects--the…

Computer Vision and Pattern Recognition · Computer Science 2023-08-07 Mihai Fieraru , Mihai Zanfir , Elisabeta Oneata , Alin-Ionut Popa , Vlad Olaru , Cristian Sminchisescu

SPARK: Self-supervised Personalized Real-time Monocular Face Capture

Feedforward monocular face capture methods seek to reconstruct posed faces from a single image of a person. Current state of the art approaches have the ability to regress parametric 3D face models in real-time across a wide range of…

Computer Vision and Pattern Recognition · Computer Science 2024-09-13 Kelian Baert , Shrisha Bharadwaj , Fabien Castan , Benoit Maujean , Marc Christie , Victoria Abrevaya , Adnane Boukhayma

MonoCap: Monocular Human Motion Capture using a CNN Coupled with a Geometric Prior

Recovering 3D full-body human pose is a challenging problem with many applications. It has been successfully addressed by motion capture systems with body worn markers and multiple cameras. In this paper, we address the more challenging…

Computer Vision and Pattern Recognition · Computer Science 2018-03-12 Xiaowei Zhou , Menglong Zhu , Georgios Pavlakos , Spyridon Leonardos , Kostantinos G. Derpanis , Kostas Daniilidis

Learning Explicit Contact for Implicit Reconstruction of Hand-held Objects from Monocular Images

Reconstructing hand-held objects from monocular RGB images is an appealing yet challenging task. In this task, contacts between hands and objects provide important cues for recovering the 3D geometry of the hand-held objects. Though recent…

Computer Vision and Pattern Recognition · Computer Science 2024-01-17 Junxing Hu , Hongwen Zhang , Zerui Chen , Mengcheng Li , Yunlong Wang , Yebin Liu , Zhenan Sun

Estimating 3D Motion and Forces of Person-Object Interactions from Monocular Video

In this paper, we introduce a method to automatically reconstruct the 3D motion of a person interacting with an object from a single RGB video. Our method estimates the 3D poses of the person and the object, contact positions, and forces…

Computer Vision and Pattern Recognition · Computer Science 2019-06-18 Zongmian Li , Jiri Sedlar , Justin Carpentier , Ivan Laptev , Nicolas Mansard , Josef Sivic

GraphiContact: Pose-aware Human-Scene Robust Contact Perception for Interactive Systems

Monocular vertex-level human-scene contact prediction is a fundamental capability for interactive systems such as assistive monitoring, embodied AI, and rehabilitation analysis. In this work, we study this task jointly with single-image 3D…

Computer Vision and Pattern Recognition · Computer Science 2026-03-24 Xiaojian Lin , Yaomin Shen , Junyuan Ma , Yujie Sun , Chengqing Bu , Wenxin Zhang , Zongzheng Zhang , Hao Fei , Lei Jin , Hao Zhao

Single-image coherent reconstruction of objects and humans

Existing methods for reconstructing objects and humans from a monocular image suffer from severe mesh collisions and performance limitations for interacting occluding objects. This paper introduces a method to obtain a globally consistent…

Computer Vision and Pattern Recognition · Computer Science 2024-08-16 Sarthak Batra , Partha P. Chakrabarti , Simon Hadfield , Armin Mustafa

On Self-Contact and Human Pose

People touch their face 23 times an hour, they cross their arms and legs, put their hands on their hips, etc. While many images of people contain some form of self-contact, current 3D human pose and shape (HPS) regression methods typically…

Computer Vision and Pattern Recognition · Computer Science 2021-04-09 Lea Müller , Ahmed A. A. Osman , Siyu Tang , Chun-Hao P. Huang , Michael J. Black

Synthetic Training for Monocular Human Mesh Recovery

Recovering 3D human mesh from monocular images is a popular topic in computer vision and has a wide range of applications. This paper aims to estimate 3D mesh of multiple body parts (e.g., body, hands) with large-scale differences from a…

Computer Vision and Pattern Recognition · Computer Science 2020-10-28 Yu Sun , Qian Bao , Wu Liu , Wenpeng Gao , Yili Fu , Chuang Gan , Tao Mei

Towards Accurate Reconstruction of 3D Scene Shape from A Single Monocular Image

Despite significant progress made in the past few years, challenges remain for depth estimation using a single monocular image. First, it is nontrivial to train a metric-depth prediction model that can generalize well to diverse scenes…

Computer Vision and Pattern Recognition · Computer Science 2022-09-07 Wei Yin , Jianming Zhang , Oliver Wang , Simon Niklaus , Simon Chen , Yifan Liu , Chunhua Shen

Toward a Real-Time Framework for Accurate Monocular 3D Human Pose Estimation with Geometric Priors

Monocular 3D human pose estimation remains a challenging and ill-posed problem, particularly in real-time settings and unconstrained environments. While direct imageto-3D approaches require large annotated datasets and heavy models,…

Computer Vision and Pattern Recognition · Computer Science 2025-07-24 Mohamed Adjel

Coupling Top-down and Bottom-up Methods for 3D Human Pose and Shape Estimation from Monocular Image Sequences

Until recently Intelligence, Surveillance, and Reconnaissance (ISR) focused on acquiring behavioral information of the targets and their activities. Continuous evolution of intelligence being gathered of the human centric activities has put…

Computer Vision and Pattern Recognition · Computer Science 2014-10-07 Atul Kanaujia

UniCon3R: Unified Contact-aware 4D Human-Scene Reconstruction from Monocular Video

We introduce UniCon3R, a unified feed-forward framework for online human-scene 4D reconstruction from monocular video. Current feed-forward human-scene reconstruction methods suffer from artifacts, where bodies float above the ground or…

Computer Vision and Pattern Recognition · Computer Science 2026-05-12 Tanuj Sur , Shashank Tripathi , Nikos Athanasiou , Ha Linh Nguyen , Kai Xu , Michael J. Black , Angela Yao

Recovering 3D Human Mesh from Monocular Images: A Survey

Estimating human pose and shape from monocular images is a long-standing problem in computer vision. Since the release of statistical body models, 3D human mesh recovery has been drawing broader attention. With the same goal of obtaining…

Computer Vision and Pattern Recognition · Computer Science 2024-01-03 Yating Tian , Hongwen Zhang , Yebin Liu , Limin Wang

3D Human Shape and Pose from a Single Low-Resolution Image with Self-Supervised Learning

3D human shape and pose estimation from monocular images has been an active area of research in computer vision, having a substantial impact on the development of new applications, from activity recognition to creating virtual avatars.…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Xiangyu Xu , Hao Chen , Francesc Moreno-Noguer , Laszlo A. Jeni , Fernando De la Torre

3D Object Aided Self-Supervised Monocular Depth Estimation

Monocular depth estimation has been actively studied in fields such as robot vision, autonomous driving, and 3D scene understanding. Given a sequence of color images, unsupervised learning methods based on the framework of…

Computer Vision and Pattern Recognition · Computer Science 2022-12-06 Songlin Wei , Guodong Chen , Wenzheng Chi , Zhenhua Wang , Lining Sun

S$^2$Contact: Graph-based Network for 3D Hand-Object Contact Estimation with Semi-Supervised Learning

Despite the recent efforts in accurate 3D annotations in hand and object datasets, there still exist gaps in 3D hand and object reconstructions. Existing works leverage contact maps to refine inaccurate hand-object pose estimations and…

Computer Vision and Pattern Recognition · Computer Science 2023-08-04 Tze Ho Elden Tse , Zhongqun Zhang , Kwang In Kim , Ales Leonardis , Feng Zheng , Hyung Jin Chang

Self-Supervised Multi-View Synchronization Learning for 3D Pose Estimation

Current state-of-the-art methods cast monocular 3D human pose estimation as a learning problem by training neural networks on large data sets of images and corresponding skeleton poses. In contrast, we propose an approach that can exploit…

Computer Vision and Pattern Recognition · Computer Science 2020-10-14 Simon Jenni , Paolo Favaro

HULC: 3D Human Motion Capture with Pose Manifold Sampling and Dense Contact Guidance

Marker-less monocular 3D human motion capture (MoCap) with scene interactions is a challenging research topic relevant for extended reality, robotics and virtual avatar generation. Due to the inherent depth ambiguity of monocular settings,…

Computer Vision and Pattern Recognition · Computer Science 2022-07-27 Soshi Shimada , Vladislav Golyanik , Zhi Li , Patrick Pérez , Weipeng Xu , Christian Theobalt

COSMU: Complete 3D human shape from monocular unconstrained images

We present a novel framework to reconstruct complete 3D human shapes from a given target image by leveraging monocular unconstrained images. The objective of this work is to reproduce high-quality details in regions of the reconstructed…

Computer Vision and Pattern Recognition · Computer Science 2024-07-16 Marco Pesavento , Marco Volino , Adrian Hilton