English

Category Query Learning for Human-Object Interaction Classification

Computer Vision and Pattern Recognition 2025-06-09 v1 Artificial Intelligence

Abstract

Unlike most previous HOI methods that focus on learning better human-object features, we propose a novel and complementary approach called category query learning. Such queries are explicitly associated to interaction categories, converted to image specific category representation via a transformer decoder, and learnt via an auxiliary image-level classification task. This idea is motivated by an earlier multi-label image classification method, but is for the first time applied for the challenging human-object interaction classification task. Our method is simple, general and effective. It is validated on three representative HOI baselines and achieves new state-of-the-art results on two benchmarks.

Keywords

Cite

@article{arxiv.2303.14005,
  title  = {Category Query Learning for Human-Object Interaction Classification},
  author = {Chi Xie and Fangao Zeng and Yue Hu and Shuang Liang and Yichen Wei},
  journal= {arXiv preprint arXiv:2303.14005},
  year   = {2025}
}

Comments

Accepted by CVPR 2023

R2 v1 2026-06-28T09:32:13.020Z