English

Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval

Computer Vision and Pattern Recognition 2023-04-25 v1

Abstract

Using a discriminative representation obtained by supervised deep learning methods showed promising results on diverse Content-Based Image Retrieval (CBIR) problems. However, existing methods exploiting labels during training try to discriminate all available classes, which is not ideal in cases where the retrieval problem focuses on a class of interest. In this paper, we propose a regularized loss for Variational Auto-Encoders (VAEs) forcing the model to focus on a given class of interest. As a result, the model learns to discriminate the data belonging to the class of interest from any other possibility, making the learnt latent space of the VAE suitable for class-specific retrieval tasks. The proposed Class-Specific Variational Auto-Encoder (CS-VAE) is evaluated on three public and one custom datasets, and its performance is compared with that of three related VAE-based methods. Experimental results show that the proposed method outperforms its competition in both in-domain and out-of-domain retrieval problems.

Keywords

Cite

@article{arxiv.2304.11734,
  title  = {Class-Specific Variational Auto-Encoder for Content-Based Image Retrieval},
  author = {Mehdi Rafiei and Alexandros Iosifidis},
  journal= {arXiv preprint arXiv:2304.11734},
  year   = {2023}
}

Comments

8 pages, 7 figures, 6 tables, accepted at IJCNN conference