English

Context-based Image Segment Labeling (CBISL)

Computer Vision and Pattern Recognition 2020-11-03 v1

Abstract

Working with images, one often faces problems with incomplete or unclear information. Image inpainting can be used to restore missing image regions but focuses, however, on low-level image features such as pixel intensity, pixel gradient orientation, and color. This paper aims to recover semantic image features (objects and positions) in images. Based on published gated PixelCNNs, we demonstrate a new approach referred to as quadro-directional PixelCNN to recover missing objects and return probable positions for objects based on the context. We call this approach context-based image segment labeling (CBISL). The results suggest that our four-directional model outperforms one-directional models (gated PixelCNN) and returns a human-comparable performance.

Keywords

Cite

@article{arxiv.2011.00784,
  title  = {Context-based Image Segment Labeling (CBISL)},
  author = {Tobias Schlagenhauf and Yefeng Xia and Jürgen Fleischer},
  journal= {arXiv preprint arXiv:2011.00784},
  year   = {2020}
}

Comments

11 pages, 4 figures

R2 v1 2026-06-23T19:50:12.765Z