English

UnitBox: An Advanced Object Detection Network

Computer Vision and Pattern Recognition 2016-08-05 v1

Abstract

In present object detection systems, the deep convolutional neural networks (CNNs) are utilized to predict bounding boxes of object candidates, and have gained performance advantages over the traditional region proposal methods. However, existing deep CNN methods assume the object bounds to be four independent variables, which could be regressed by the 2\ell_2 loss separately. Such an oversimplified assumption is contrary to the well-received observation, that those variables are correlated, resulting to less accurate localization. To address the issue, we firstly introduce a novel Intersection over Union (IoUIoU) loss function for bounding box prediction, which regresses the four bounds of a predicted box as a whole unit. By taking the advantages of IoUIoU loss and deep fully convolutional networks, the UnitBox is introduced, which performs accurate and efficient localization, shows robust to objects of varied shapes and scales, and converges fast. We apply UnitBox on face detection task and achieve the best performance among all published methods on the FDDB benchmark.

Keywords

Cite

@article{arxiv.1608.01471,
  title  = {UnitBox: An Advanced Object Detection Network},
  author = {Jiahui Yu and Yuning Jiang and Zhangyang Wang and Zhimin Cao and Thomas Huang},
  journal= {arXiv preprint arXiv:1608.01471},
  year   = {2016}
}

Comments

To appear in ACM MM 2016, 5 pages, 6 figures

R2 v1 2026-06-22T15:12:03.141Z