Related papers: InterTrack: Tracking Human Object Interaction with…

InterTracker: Discovering and Tracking General Objects Interacting with Hands in the Wild

Understanding human interaction with objects is an important research topic for embodied Artificial Intelligence and identifying the objects that humans are interacting with is a primary problem for interaction understanding. Existing…

Computer Vision and Pattern Recognition · Computer Science 2023-08-15 Yanyan Shao , Qi Ye , Wenhan Luo , Kaihao Zhang , Jiming Chen

InterCap: Joint Markerless 3D Tracking of Humans and Objects in Interaction

Humans constantly interact with daily objects to accomplish tasks. To understand such interactions, computers need to reconstruct these from cameras observing whole-body interaction with scenes. This is challenging due to occlusion between…

Computer Vision and Pattern Recognition · Computer Science 2022-10-04 Yinghao Huang , Omid Tehari , Michael J. Black , Dimitrios Tzionas

Dynamic Template Tracking and Recognition

In this paper we address the problem of tracking non-rigid objects whose local appearance and motion changes as a function of time. This class of objects includes dynamic textures such as steam, fire, smoke, water, etc., as well as…

Computer Vision and Pattern Recognition · Computer Science 2012-04-23 Rizwan Chaudhry , Gregory Hager , Rene Vidal

Tracking by 3D Model Estimation of Unknown Objects in Videos

Most model-free visual object tracking methods formulate the tracking task as object location estimation given by a 2D segmentation or a bounding box in each video frame. We argue that this representation is limited and instead propose to…

Computer Vision and Pattern Recognition · Computer Science 2023-04-14 Denys Rozumnyi , Jiri Matas , Marc Pollefeys , Vittorio Ferrari , Martin R. Oswald

TRec: Learning Hand-Object Interactions through 2D Point Track Motion

We present a novel approach for hand-object action recognition that leverages 2D point tracks as an additional motion cue. While most existing methods rely on RGB appearance, human pose estimation, or their combination, our work…

Computer Vision and Pattern Recognition · Computer Science 2026-01-12 Dennis Holzmann , Sven Wachsmuth

A Data-driven Approach for Human Pose Tracking Based on Spatio-temporal Pictorial Structure

In this paper, we present a data-driven approach for human pose tracking in video data. We formulate the human pose tracking problem as a discrete optimization problem based on spatio-temporal pictorial structure model and solve this…

Computer Vision and Pattern Recognition · Computer Science 2016-08-02 Soumitra Samanta , Bhabatosh Chanda

Correspondence-free online human motion retargeting

We present a data-driven framework for unsupervised human motion retargeting that animates a target subject with the motion of a source subject. Our method is correspondence-free, requiring neither spatial correspondences between the source…

Computer Vision and Pattern Recognition · Computer Science 2024-03-05 Rim Rekik , Mathieu Marsot , Anne-Hélène Olivier , Jean-Sébastien Franco , Stefanie Wuhrer

Visibility Aware Human-Object Interaction Tracking from Single RGB Camera

Capturing the interactions between humans and their environment in 3D is important for many applications in robotics, graphics, and vision. Recent works to reconstruct the 3D human and object from a single RGB image do not have consistent…

Computer Vision and Pattern Recognition · Computer Science 2023-11-01 Xianghui Xie , Bharat Lal Bhatnagar , Gerard Pons-Moll

InterPose: Learning to Generate Human-Object Interactions from Large-Scale Web Videos

Human motion generation has shown great advances thanks to the recent diffusion models trained on large-scale motion capture data. Most of existing works, however, currently target animation of isolated people in empty scenes. Meanwhile,…

Computer Vision and Pattern Recognition · Computer Science 2025-12-23 Yangsong Zhang , Abdul Ahad Butt , Gül Varol , Ivan Laptev

BEHAVE: Dataset and Method for Tracking Human Object Interactions

Modelling interactions between humans and objects in natural environments is central to many applications including gaming, virtual and mixed reality, as well as human behavior analysis and human-robot collaboration. This challenging…

Computer Vision and Pattern Recognition · Computer Science 2022-04-15 Bharat Lal Bhatnagar , Xianghui Xie , Ilya A. Petrov , Cristian Sminchisescu , Christian Theobalt , Gerard Pons-Moll

Collecting Consistently High Quality Object Tracks with Minimal Human Involvement by Using Self-Supervised Learning to Detect Tracker Errors

We propose a hybrid framework for consistently producing high-quality object tracks by combining an automated object tracker with little human input. The key idea is to tailor a module for each dataset to intelligently decide when an object…

Computer Vision and Pattern Recognition · Computer Science 2024-05-07 Samreen Anjum , Suyog Jain , Danna Gurari

Attend and Interact: Higher-Order Object Interactions for Video Understanding

Human actions often involve complex interactions across several inter-related objects in the scene. However, existing approaches to fine-grained video understanding or visual relationship detection often rely on single object representation…

Computer Vision and Pattern Recognition · Computer Science 2018-03-22 Chih-Yao Ma , Asim Kadav , Iain Melvin , Zsolt Kira , Ghassan AlRegib , Hans Peter Graf

Self-Attentive 3D Human Pose and Shape Estimation from Videos

We consider the task of estimating 3D human pose and shape from videos. While existing frame-based approaches have made significant progress, these methods are independently applied to each image, thereby often leading to inconsistent…

Computer Vision and Pattern Recognition · Computer Science 2021-09-08 Yun-Chun Chen , Marco Piccirilli , Robinson Piramuthu , Ming-Hsuan Yang

ArtTrack: Articulated Multi-person Tracking in the Wild

In this paper we propose an approach for articulated tracking of multiple people in unconstrained videos. Our starting point is a model that resembles existing architectures for single-frame pose estimation but is substantially faster. We…

Computer Vision and Pattern Recognition · Computer Science 2017-05-10 Eldar Insafutdinov , Mykhaylo Andriluka , Leonid Pishchulin , Siyu Tang , Evgeny Levinkov , Bjoern Andres , Bernt Schiele

Neural Rendering and Reenactment of Human Actor Videos

We propose a method for generating video-realistic animations of real humans under user control. In contrast to conventional human character rendering, we do not require the availability of a production-quality photo-realistic 3D model of…

Computer Vision and Pattern Recognition · Computer Science 2019-05-13 Lingjie Liu , Weipeng Xu , Michael Zollhoefer , Hyeongwoo Kim , Florian Bernard , Marc Habermann , Wenping Wang , Christian Theobalt

ComPose: When to Trust Hands for Object Pose Tracking

Reconstructing the motion of objects from videos is a key component for embodied AI and robot manipulation. While diverse approaches to object pose tracking have been studied, they rely heavily on strong external priors, such as depth data…

Computer Vision and Pattern Recognition · Computer Science 2026-05-25 Jisu Shin , Junoh Lee , JunGyu Lee , Inhwan Bae , Dohyeon Lee , Hokyun Im , Youngwoon Lee , Hae-Gon Jeon

Self-supervised Keypoint Correspondences for Multi-Person Pose Estimation and Tracking in Videos

Video annotation is expensive and time consuming. Consequently, datasets for multi-person pose estimation and tracking are less diverse and have more sparse annotations compared to large scale image datasets for human pose estimation. This…

Computer Vision and Pattern Recognition · Computer Science 2021-03-16 Umer Rafi , Andreas Doering , Bastian Leibe , Juergen Gall

Human Interaction Recognition Framework based on Interacting Body Part Attention

Human activity recognition in videos has been widely studied and has recently gained significant advances with deep learning approaches; however, it remains a challenging task. In this paper, we propose a novel framework that simultaneously…

Computer Vision and Pattern Recognition · Computer Science 2021-01-25 Dong-Gyu Lee , Seong-Whan Lee

PoseTrack: Joint Multi-Person Pose Estimation and Tracking

In this work, we introduce the challenging problem of joint multi-person pose estimation and tracking of an unknown number of persons in unconstrained videos. Existing methods for multi-person pose estimation in images cannot be applied…

Computer Vision and Pattern Recognition · Computer Science 2017-04-10 Umar Iqbal , Anton Milan , Juergen Gall

Human Performance Capture from Monocular Video in the Wild

Capturing the dynamically deforming 3D shape of clothed human is essential for numerous applications, including VR/AR, autonomous driving, and human-computer interaction. Existing methods either require a highly specialized capturing setup,…

Computer Vision and Pattern Recognition · Computer Science 2021-12-01 Chen Guo , Xu Chen , Jie Song , Otmar Hilliges