Related papers: Visual Compositional Learning for Human-Object Int…

Detecting Human-Object Interaction via Fabricated Compositional Learning

Human-Object Interaction (HOI) detection, inferring the relationships between human and objects from images/videos, is a fundamental task for high-level scene understanding. However, HOI detection usually suffers from the open long-tailed…

Computer Vision and Pattern Recognition · Computer Science 2021-03-26 Zhi Hou , Baosheng Yu , Yu Qiao , Xiaojiang Peng , Dacheng Tao

Discovering Human-Object Interaction Concepts via Self-Compositional Learning

A comprehensive understanding of human-object interaction (HOI) requires detecting not only a small portion of predefined HOI concepts (or categories) but also other reasonable HOI concepts, while current approaches usually fail to explore…

Computer Vision and Pattern Recognition · Computer Science 2022-07-26 Zhi Hou , Baosheng Yu , Dacheng Tao

Compositional Learning in Transformer-Based Human-Object Interaction Detection

Human-object interaction (HOI) detection is an important part of understanding human activities and visual scenes. The long-tailed distribution of labeled instances is a primary challenge in HOI detection, promoting research in few-shot and…

Computer Vision and Pattern Recognition · Computer Science 2023-08-14 Zikun Zhuang , Ruihao Qian , Chi Xie , Shuang Liang

Improving Human-Object Interaction Detection via Virtual Image Learning

Human-Object Interaction (HOI) detection aims to understand the interactions between humans and objects, which plays a curtail role in high-level semantic understanding tasks. However, most works pursue designing better architectures to…

Computer Vision and Pattern Recognition · Computer Science 2023-08-08 Shuman Fang , Shuai Liu , Jie Li , Guannan Jiang , Xianming Lin , Rongrong Ji

Towards Zero-shot Human-Object Interaction Detection via Vision-Language Integration

Human-object interaction (HOI) detection aims to locate human-object pairs and identify their interaction categories in images. Most existing methods primarily focus on supervised learning, which relies on extensive manual HOI annotations.…

Computer Vision and Pattern Recognition · Computer Science 2024-03-13 Weiying Xue , Qi Liu , Qiwei Xiong , Yuxiao Wang , Zhenao Wei , Xiaofen Xing , Xiangmin Xu

Learning Human-Object Interaction Detection using Interaction Points

Understanding interactions between humans and objects is one of the fundamental problems in visual classification and an essential step towards detailed scene understanding. Human-object interaction (HOI) detection strives to localize both…

Computer Vision and Pattern Recognition · Computer Science 2020-04-01 Tiancai Wang , Tong Yang , Martin Danelljan , Fahad Shahbaz Khan , Xiangyu Zhang , Jian Sun

HOICLIP: Efficient Knowledge Transfer for HOI Detection with Vision-Language Models

Human-Object Interaction (HOI) detection aims to localize human-object pairs and recognize their interactions. Recently, Contrastive Language-Image Pre-training (CLIP) has shown great potential in providing interaction prior for HOI…

Computer Vision and Pattern Recognition · Computer Science 2023-07-27 Shan Ning , Longtian Qiu , Yongfei Liu , Xuming He

Weakly-supervised HOI Detection via Prior-guided Bi-level Representation Learning

Human object interaction (HOI) detection plays a crucial role in human-centric scene understanding and serves as a fundamental building-block for many vision tasks. One generalizable and scalable strategy for HOI detection is to use weak…

Computer Vision and Pattern Recognition · Computer Science 2023-03-03 Bo Wan , Yongfei Liu , Desen Zhou , Tinne Tuytelaars , Xuming He

CL-HOI: Cross-Level Human-Object Interaction Distillation from Vision Large Language Models

Human-object interaction (HOI) detection has seen advancements with Vision Language Models (VLMs), but these methods often depend on extensive manual annotations. Vision Large Language Models (VLLMs) can inherently recognize and reason…

Computer Vision and Pattern Recognition · Computer Science 2024-10-22 Jianjun Gao , Chen Cai , Ruoyu Wang , Wenyang Liu , Kim-Hui Yap , Kratika Garg , Boon-Siew Han

GEN-VLKT: Simplify Association and Enhance Interaction Understanding for HOI Detection

The task of Human-Object Interaction~(HOI) detection could be divided into two core problems, i.e., human-object association and interaction understanding. In this paper, we reveal and address the disadvantages of the conventional…

Computer Vision and Pattern Recognition · Computer Science 2022-04-15 Yue Liao , Aixi Zhang , Miao Lu , Yongliang Wang , Xiaobo Li , Si Liu

Exploiting CLIP for Zero-shot HOI Detection Requires Knowledge Distillation at Multiple Levels

In this paper, we investigate the task of zero-shot human-object interaction (HOI) detection, a novel paradigm for identifying HOIs without the need for task-specific annotations. To address this challenging task, we employ CLIP, a…

Computer Vision and Pattern Recognition · Computer Science 2023-09-12 Bo Wan , Tinne Tuytelaars

HOI Analysis: Integrating and Decomposing Human-Object Interaction

Human-Object Interaction (HOI) consists of human, object and implicit interaction/verb. Different from previous methods that directly map pixels to HOI semantics, we propose a novel perspective for HOI learning in an analytical manner. In…

Computer Vision and Pattern Recognition · Computer Science 2020-11-10 Yong-Lu Li , Xinpeng Liu , Xiaoqian Wu , Yizhuo Li , Cewu Lu

A Review of Human-Object Interaction Detection

Human-object interaction (HOI) detection plays a key role in high-level visual understanding, facilitating a deep comprehension of human activities. Specifically, HOI detection aims to locate the humans and objects involved in interactions…

Computer Vision and Pattern Recognition · Computer Science 2025-03-19 Yuxiao Wang , Yu Lei , Li Cui , Weiying Xue , Qi Liu , Zhenao Wei

Exploring the Potential of Large Foundation Models for Open-Vocabulary HOI Detection

Open-vocabulary human-object interaction (HOI) detection, which is concerned with the problem of detecting novel HOIs guided by natural language, is crucial for understanding human-centric scenes. However, prior zero-shot HOI detectors…

Computer Vision and Pattern Recognition · Computer Science 2024-04-11 Ting Lei , Shaofeng Yin , Yang Liu

Detecting Human-Object Interactions with Object-Guided Cross-Modal Calibrated Semantics

Human-Object Interaction (HOI) detection is an essential task to understand human-centric images from a fine-grained perspective. Although end-to-end HOI detection models thrive, their paradigm of parallel human/object detection and verb…

Computer Vision and Pattern Recognition · Computer Science 2022-02-02 Hangjie Yuan , Mang Wang , Dong Ni , Liangpeng Xu

Generative Human-Object Interaction Detection via Differentiable Cognitive Steering of Multi-modal LLMs

Human-object interaction (HOI) detection aims to localize human-object pairs and the interactions between them. Existing methods operate under a closed-world assumption, treating the task as a classification problem over a small, predefined…

Computer Vision and Pattern Recognition · Computer Science 2025-12-22 Zhaolin Cai , Huiyu Duan , Zitong Xu , Fan Li , Zhi Liu , Jing Liu , Wei Shen , Xiongkuo Min , Guangtao Zhai

Improving Human-Object Interaction Detection via Phrase Learning and Label Composition

Human-Object Interaction (HOI) detection is a fundamental task in high-level human-centric scene understanding. We propose PhraseHOI, containing a HOI branch and a novel phrase branch, to leverage language prior and improve relation…

Computer Vision and Pattern Recognition · Computer Science 2022-01-19 Zhimin Li , Cheng Zou , Yu Zhao , Boxun Li , Sheng Zhong

Detecting Any Human-Object Interaction Relationship: Universal HOI Detector with Spatial Prompt Learning on Foundation Models

Human-object interaction (HOI) detection aims to comprehend the intricate relationships between humans and objects, predicting $<human, action, object>$ triplets, and serving as the foundation for numerous computer vision tasks. The…

Computer Vision and Pattern Recognition · Computer Science 2023-11-08 Yichao Cao , Qingfei Tang , Xiu Su , Chen Song , Shan You , Xiaobo Lu , Chang Xu

VLM-HOI: Vision Language Models for Interpretable Human-Object Interaction Analysis

The Large Vision Language Model (VLM) has recently addressed remarkable progress in bridging two fundamental modalities. VLM, trained by a sufficiently large dataset, exhibits a comprehensive understanding of both visual and linguistic to…

Computer Vision and Pattern Recognition · Computer Science 2024-11-28 Donggoo Kang , Dasol Jeong , Hyunmin Lee , Sangwoo Park , Hasil Park , Sunkyu Kwon , Yeongjoon Kim , Joonki Paik

Bilateral Collaboration with Large Vision-Language Models for Open Vocabulary Human-Object Interaction Detection

Open vocabulary Human-Object Interaction (HOI) detection is a challenging task that detects all <human, verb, object> triplets of interest in an image, even those that are not pre-defined in the training set. Existing approaches typically…

Computer Vision and Pattern Recognition · Computer Science 2025-07-10 Yupeng Hu , Changxing Ding , Chang Sun , Shaoli Huang , Xiangmin Xu