English

Learning to Detect Human-Object Interactions

Computer Vision and Pattern Recognition 2018-03-02 v2

Abstract

We study the problem of detecting human-object interactions (HOI) in static images, defined as predicting a human and an object bounding box with an interaction class label that connects them. HOI detection is a fundamental problem in computer vision as it provides semantic information about the interactions among the detected objects. We introduce HICO-DET, a new large benchmark for HOI detection, by augmenting the current HICO classification benchmark with instance annotations. To solve the task, we propose Human-Object Region-based Convolutional Neural Networks (HO-RCNN). At the core of our HO-RCNN is the Interaction Pattern, a novel DNN input that characterizes the spatial relations between two bounding boxes. Experiments on HICO-DET demonstrate that our HO-RCNN, by exploiting human-object spatial relations through Interaction Patterns, significantly improves the performance of HOI detection over baseline approaches.

Keywords

Cite

@article{arxiv.1702.05448,
  title  = {Learning to Detect Human-Object Interactions},
  author = {Yu-Wei Chao and Yunfan Liu and Xieyang Liu and Huayi Zeng and Jia Deng},
  journal= {arXiv preprint arXiv:1702.05448},
  year   = {2018}
}

Comments

Accepted in WACV 2018

R2 v1 2026-06-22T18:21:30.280Z