Related papers: Human-Object Interaction Detection Collaborated wi…

Boosting Human-Object Interaction Detection with Text-to-Image Diffusion Model

This paper investigates the problem of the current HOI detection methods and introduces DiffHOI, a novel HOI detection scheme grounded on a pre-trained text-image diffusion model, which enhances the detector's performance via improved data…

Computer Vision and Pattern Recognition · Computer Science 2023-05-23 Jie Yang , Bingliang Li , Fengyu Yang , Ailing Zeng , Lei Zhang , Ruimao Zhang

An Image-like Diffusion Method for Human-Object Interaction Detection

Human-object interaction (HOI) detection often faces high levels of ambiguity and indeterminacy, as the same interaction can appear vastly different across different human-object pairs. Additionally, the indeterminacy can be further…

Computer Vision and Pattern Recognition · Computer Science 2025-03-25 Xiaofei Hui , Haoxuan Qu , Hossein Rahmani , Jun Liu

HOI-Diff: Text-Driven Synthesis of 3D Human-Object Interactions using Diffusion Models

We address the problem of generating realistic 3D human-object interactions (HOIs) driven by textual prompts. To this end, we take a modular design and decompose the complex task into simpler sub-tasks. We first develop a dual-branch…

Computer Vision and Pattern Recognition · Computer Science 2025-07-08 Xiaogang Peng , Yiming Xie , Zizhao Wu , Varun Jampani , Deqing Sun , Huaizu Jiang

A Review of Human-Object Interaction Detection

Human-object interaction (HOI) detection plays a key role in high-level visual understanding, facilitating a deep comprehension of human activities. Specifically, HOI detection aims to locate the humans and objects involved in interactions…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Yuxiao Wang , Yu Lei , Li Cui , Weiying Xue , Qi Liu , Zhenao Wei

Guiding Human-Object Interactions with Rich Geometry and Relations

Human-object interaction (HOI) synthesis is crucial for creating immersive and realistic experiences for applications such as virtual reality. Existing methods often rely on simplified object representations, such as the object's centroid…

Computer Vision and Pattern Recognition · Computer Science 2025-03-27 Mengqing Xue , Yifei Liu , Ling Guo , Shaoli Huang , Changxing Ding

InterDiff: Generating 3D Human-Object Interactions with Physics-Informed Diffusion

This paper addresses a novel task of anticipating 3D human-object interactions (HOIs). Most existing research on HOI synthesis lacks comprehensive whole-body interactions with dynamic objects, e.g., often limited to manipulating small or…

Computer Vision and Pattern Recognition · Computer Science 2023-09-01 Sirui Xu , Zhengyuan Li , Yu-Xiong Wang , Liang-Yan Gui

InteractDiffusion: Interaction Control in Text-to-Image Diffusion Models

Large-scale text-to-image (T2I) diffusion models have showcased incredible capabilities in generating coherent images based on textual descriptions, enabling vast applications in content generation. While recent advancements have introduced…

Computer Vision and Pattern Recognition · Computer Science 2024-02-28 Jiun Tian Hoe , Xudong Jiang , Chee Seng Chan , Yap-Peng Tan , Weipeng Hu

DreamHOI: Subject-Driven Generation of 3D Human-Object Interactions with Diffusion Priors

We present DreamHOI, a novel method for zero-shot synthesis of human-object interactions (HOIs), enabling a 3D human model to realistically interact with any given object based on a textual description. This task is complicated by the…

Computer Vision and Pattern Recognition · Computer Science 2024-09-13 Thomas Hanwen Zhu , Ruining Li , Tomas Jakab

ScoreHOI: Physically Plausible Reconstruction of Human-Object Interaction via Score-Guided Diffusion

Joint reconstruction of human-object interaction marks a significant milestone in comprehending the intricate interrelations between humans and their surrounding environment. Nevertheless, previous optimization methods often struggle to…

Computer Vision and Pattern Recognition · Computer Science 2025-09-10 Ao Li , Jinpeng Liu , Yixuan Zhu , Yansong Tang

GenHOI: Generalizing Text-driven 4D Human-Object Interaction Synthesis for Unseen Objects

While diffusion models and large-scale motion datasets have advanced text-driven human motion synthesis, extending these advances to 4D human-object interaction (HOI) remains challenging, mainly due to the limited availability of…

Computer Vision and Pattern Recognition · Computer Science 2025-06-19 Shujia Li , Haiyu Zhang , Xinyuan Chen , Yaohui Wang , Yutong Ban

THOR: Text to Human-Object Interaction Diffusion via Relation Intervention

This paper addresses new methodologies to deal with the challenging task of generating dynamic Human-Object Interactions from textual descriptions (Text2HOI). While most existing works assume interactions with limited body parts or static…

Computer Vision and Pattern Recognition · Computer Science 2024-03-19 Qianyang Wu , Ye Shi , Xiaoshui Huang , Jingyi Yu , Lan Xu , Jingya Wang

Towards Zero-shot Human-Object Interaction Detection via Vision-Language Integration

Human-object interaction (HOI) detection aims to locate human-object pairs and identify their interaction categories in images. Most existing methods primarily focus on supervised learning, which relies on extensive manual HOI annotations.…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Weiying Xue , Qi Liu , Qiwei Xiong , Yuxiao Wang , Zhenao Wei , Xiaofen Xing , Xiangmin Xu

HOIDiNi: Human-Object Interaction through Diffusion Noise Optimization

We present HOIDiNi, a text-driven diffusion framework for synthesizing realistic and plausible human-object interaction (HOI). HOI generation is extremely challenging since it induces strict contact accuracies alongside a diverse motion…

Computer Vision and Pattern Recognition · Computer Science 2025-10-22 Roey Ron , Guy Tevet , Haim Sawdayee , Amit H. Bermano

A Study of Failure Modes in Two-Stage Human-Object Interaction Detection

Human-object interaction (HOI) detection aims to detect interactions between humans and objects in images. While recent advances have improved performance on existing benchmarks, their evaluations mainly focus on overall prediction accuracy…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Lemeng Wang , Qinqian Lei , Vidhi Bakshi , Daniel Yi , Yifan Liu , Jiacheng Hou , Asher Seng Hao , Zheda Mai , Wei-Lun Chao , Robby T. Tan , Bo Wang

Learning Human-Object Interaction as Groups

Human-Object Interaction Detection (HOI-DET) aims to localize human-object pairs and identify their interactive relationships. To aggregate contextual cues, existing methods typically propagate information across all detected entities via…

Computer Vision and Pattern Recognition · Computer Science 2025-10-22 Jiajun Hong , Jianan Wei , Wenguan Wang

UAHOI: Uncertainty-aware Robust Interaction Learning for HOI Detection

This paper focuses on Human-Object Interaction (HOI) detection, addressing the challenge of identifying and understanding the interactions between humans and objects within a given image or video frame. Spearheaded by Detection Transformer…

Computer Vision and Pattern Recognition · Computer Science 2024-08-15 Mu Chen , Minghan Chen , Yi Yang

OneHOI: Unifying Human-Object Interaction Generation and Editing

Human-Object Interaction (HOI) modelling captures how humans act upon and relate to objects, typically expressed as <person, action, object> triplets. Existing approaches split into two disjoint families: HOI generation synthesises scenes…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Jiun Tian Hoe , Weipeng Hu , Xudong Jiang , Yap-Peng Tan , Chee Seng Chan

Towards Unconstrained Human-Object Interaction

Human-Object Interaction (HOI) detection is a longstanding computer vision problem concerned with predicting the interaction between humans and objects. Current HOI models rely on a vocabulary of interactions at training and inference time,…

Computer Vision and Pattern Recognition · Computer Science 2026-04-16 Francesco Tonini , Alessandro Conti , Lorenzo Vaquero , Cigdem Beyan , Elisa Ricci

CycleHOI: Improving Human-Object Interaction Detection with Cycle Consistency of Detection and Generation

Recognition and generation are two fundamental tasks in computer vision, which are often investigated separately in the exiting literature. However, these two tasks are highly correlated in essence as they both require understanding the…

Computer Vision and Pattern Recognition · Computer Science 2024-07-17 Yisen Wang , Yao Teng , Limin Wang

Learning to Generate Human-Human-Object Interactions from Textual Descriptions

The way humans interact with each other, including interpersonal distances, spatial configuration, and motion, varies significantly across different situations. To enable machines to understand such complex, context-dependent behaviors, it…

Computer Vision and Pattern Recognition · Computer Science 2025-12-25 Jeonghyeon Na , Sangwon Baik , Inhee Lee , Junyoung Lee , Hanbyul Joo