English

Image Augmentation with Controlled Diffusion for Weakly-Supervised Semantic Segmentation

Computer Vision and Pattern Recognition 2024-11-27 v3

Abstract

Weakly-supervised semantic segmentation (WSSS), which aims to train segmentation models solely using image-level labels, has achieved significant attention. Existing methods primarily focus on generating high-quality pseudo labels using available images and their image-level labels. However, the quality of pseudo labels degrades significantly when the size of available dataset is limited. Thus, in this paper, we tackle this problem from a different view by introducing a novel approach called Image Augmentation with Controlled Diffusion (IACD). This framework effectively augments existing labeled datasets by generating diverse images through controlled diffusion, where the available images and image-level labels are served as the controlling information. Moreover, we also propose a high-quality image selection strategy to mitigate the potential noise introduced by the randomness of diffusion models. In the experiments, our proposed IACD approach clearly surpasses existing state-of-the-art methods. This effect is more obvious when the amount of available data is small, demonstrating the effectiveness of our method.

Keywords

Cite

@article{arxiv.2310.09760,
  title  = {Image Augmentation with Controlled Diffusion for Weakly-Supervised Semantic Segmentation},
  author = {Wangyu Wu and Tianhong Dai and Xiaowei Huang and Fei Ma and Jimin Xiao},
  journal= {arXiv preprint arXiv:2310.09760},
  year   = {2024}
}

Comments

Accepted by ICASSP 2024

R2 v1 2026-06-28T12:50:55.411Z