English

HintPose

Computer Vision and Pattern Recognition 2020-03-05 v1

Abstract

Most of the top-down pose estimation models assume that there exists only one person in a bounding box. However, the assumption is not always correct. In this technical report, we introduce two ideas, instance cue and recurrent refinement, to an existing pose estimator so that the model is able to handle detection boxes with multiple persons properly. When we evaluated our model on the COCO17 keypoints dataset, it showed non-negligible improvement compared to its baseline model. Our model achieved 76.2 mAP as a single model and 77.3 mAP as an ensemble on the test-dev set without additional training data. After additional post-processing with a separate refinement network, our final predictions achieved 77.8 mAP on the COCO test-dev set.

Keywords

Cite

@article{arxiv.2003.02170,
  title  = {HintPose},
  author = {Sanghoon Hong and Hunchul Park and Jonghyuk Park and Sukhyun Cho and Heewoong Park},
  journal= {arXiv preprint arXiv:2003.02170},
  year   = {2020}
}

Comments

Presented at "Joint COCO and Mapillary Workshop at ICCV 2019: Keypoint Detection Challenge Track"

R2 v1 2026-06-23T14:03:55.864Z