English

Efficient Image Gallery Representations at Scale Through Multi-Task Learning

Computer Vision and Pattern Recognition 2020-07-27 v3 Information Retrieval Machine Learning

Abstract

Image galleries provide a rich source of diverse information about a product which can be leveraged across many recommendation and retrieval applications. We study the problem of building a universal image gallery encoder through multi-task learning (MTL) approach and demonstrate that it is indeed a practical way to achieve generalizability of learned representations to new downstream tasks. Additionally, we analyze the relative predictive performance of MTL-trained solutions against optimal and substantially more expensive solutions, and find signals that MTL can be a useful mechanism to address sparsity in low-resource binary tasks.

Keywords

Cite

@article{arxiv.2005.09027,
  title  = {Efficient Image Gallery Representations at Scale Through Multi-Task Learning},
  author = {Benjamin Gutelman and Pavel Levin},
  journal= {arXiv preprint arXiv:2005.09027},
  year   = {2020}
}

Comments

Proceedings of the 43rd International ACM SIGIR Conference on Research and Development in Information Retrieval

R2 v1 2026-06-23T15:38:29.475Z