English

SIM: Semantic-aware Instance Mask Generation for Box-Supervised Instance Segmentation

Computer Vision and Pattern Recognition 2023-03-16 v1

Abstract

Weakly supervised instance segmentation using only bounding box annotations has recently attracted much research attention. Most of the current efforts leverage low-level image features as extra supervision without explicitly exploiting the high-level semantic information of the objects, which will become ineffective when the foreground objects have similar appearances to the background or other objects nearby. We propose a new box-supervised instance segmentation approach by developing a Semantic-aware Instance Mask (SIM) generation paradigm. Instead of heavily relying on local pair-wise affinities among neighboring pixels, we construct a group of category-wise feature centroids as prototypes to identify foreground objects and assign them semantic-level pseudo labels. Considering that the semantic-aware prototypes cannot distinguish different instances of the same semantics, we propose a self-correction mechanism to rectify the falsely activated regions while enhancing the correct ones. Furthermore, to handle the occlusions between objects, we tailor the Copy-Paste operation for the weakly-supervised instance segmentation task to augment challenging training data. Extensive experimental results demonstrate the superiority of our proposed SIM approach over other state-of-the-art methods. The source code: https://github.com/lslrh/SIM.

Keywords

Cite

@article{arxiv.2303.08578,
  title  = {SIM: Semantic-aware Instance Mask Generation for Box-Supervised Instance Segmentation},
  author = {Ruihuang Li and Chenhang He and Yabin Zhang and Shuai Li and Liyi Chen and Lei Zhang},
  journal= {arXiv preprint arXiv:2303.08578},
  year   = {2023}
}

Comments

Accepted by CVPR2023

R2 v1 2026-06-28T09:18:23.290Z