English

Multiple Instance Learning Convolutional Neural Networks for Object Recognition

Computer Vision and Pattern Recognition 2016-10-12 v1

Abstract

Convolutional Neural Networks (CNN) have demon- strated its successful applications in computer vision, speech recognition, and natural language processing. For object recog- nition, CNNs might be limited by its strict label requirement and an implicit assumption that images are supposed to be target- object-dominated for optimal solutions. However, the labeling procedure, necessitating laying out the locations of target ob- jects, is very tedious, making high-quality large-scale dataset prohibitively expensive. Data augmentation schemes are widely used when deep networks suffer the insufficient training data problem. All the images produced through data augmentation share the same label, which may be problematic since not all data augmentation methods are label-preserving. In this paper, we propose a weakly supervised CNN framework named Multiple Instance Learning Convolutional Neural Networks (MILCNN) to solve this problem. We apply MILCNN framework to object recognition and report state-of-the-art performance on three benchmark datasets: CIFAR10, CIFAR100 and ILSVRC2015 classification dataset.

Keywords

Cite

@article{arxiv.1610.03155,
  title  = {Multiple Instance Learning Convolutional Neural Networks for Object Recognition},
  author = {Miao Sun and Tony X. Han and Ming-Chang Liu and Ahmad Khodayari-Rostamabad},
  journal= {arXiv preprint arXiv:1610.03155},
  year   = {2016}
}

Comments

International Conference on Pattern Recognition(ICPR) 2016, Oral paper

R2 v1 2026-06-22T16:17:10.090Z