English

Deep Triplet Quantization

Computer Vision and Pattern Recognition 2019-02-04 v1

Abstract

Deep hashing establishes efficient and effective image retrieval by end-to-end learning of deep representations and hash codes from similarity data. We present a compact coding solution, focusing on deep learning to quantization approach that has shown superior performance over hashing solutions for similarity retrieval. We propose Deep Triplet Quantization (DTQ), a novel approach to learning deep quantization models from the similarity triplets. To enable more effective triplet training, we design a new triplet selection approach, Group Hard, that randomly selects hard triplets in each image group. To generate compact binary codes, we further apply a triplet quantization with weak orthogonality during triplet training. The quantization loss reduces the codebook redundancy and enhances the quantizability of deep representations through back-propagation. Extensive experiments demonstrate that DTQ can generate high-quality and compact binary codes, which yields state-of-the-art image retrieval performance on three benchmark datasets, NUS-WIDE, CIFAR-10, and MS-COCO.

Keywords

Cite

@article{arxiv.1902.00153,
  title  = {Deep Triplet Quantization},
  author = {Bin Liu and Yue Cao and Mingsheng Long and Jianmin Wang and Jingdong Wang},
  journal= {arXiv preprint arXiv:1902.00153},
  year   = {2019}
}

Comments

Accepted by ACM Multimedia 2018 as oral paper