English

Deep Object Co-Segmentation

Computer Vision and Pattern Recognition 2019-05-29 v2

Abstract

This work presents a deep object co-segmentation (DOCS) approach for segmenting common objects of the same class within a pair of images. This means that the method learns to ignore common, or uncommon, background stuff and focuses on objects. If multiple object classes are presented in the image pair, they are jointly extracted as foreground. To address this task, we propose a CNN-based Siamese encoder-decoder architecture. The encoder extracts high-level semantic features of the foreground objects, a mutual correlation layer detects the common objects, and finally, the decoder generates the output foreground masks for each image. To train our model, we compile a large object co-segmentation dataset consisting of image pairs from the PASCAL VOC dataset with common objects masks. We evaluate our approach on commonly used datasets for co-segmentation tasks and observe that our approach consistently outperforms competing methods, for both seen and unseen object classes.

Keywords

Cite

@article{arxiv.1804.06423,
  title  = {Deep Object Co-Segmentation},
  author = {Weihao Li and Omid Hosseini Jafari and Carsten Rother},
  journal= {arXiv preprint arXiv:1804.06423},
  year   = {2019}
}

Comments

Accepted at ACCV 2018

R2 v1 2026-06-23T01:26:52.464Z