English

Zero-Shot Recognition using Dual Visual-Semantic Mapping Paths

Computer Vision and Pattern Recognition 2017-03-21 v2

Abstract

Zero-shot recognition aims to accurately recognize objects of unseen classes by using a shared visual-semantic mapping between the image feature space and the semantic embedding space. This mapping is learned on training data of seen classes and is expected to have transfer ability to unseen classes. In this paper, we tackle this problem by exploiting the intrinsic relationship between the semantic space manifold and the transfer ability of visual-semantic mapping. We formalize their connection and cast zero-shot recognition as a joint optimization problem. Motivated by this, we propose a novel framework for zero-shot recognition, which contains dual visual-semantic mapping paths. Our analysis shows this framework can not only apply prior semantic knowledge to infer underlying semantic manifold in the image feature space, but also generate optimized semantic embedding space, which can enhance the transfer ability of the visual-semantic mapping to unseen classes. The proposed method is evaluated for zero-shot recognition on four benchmark datasets, achieving outstanding results.

Keywords

Cite

@article{arxiv.1703.05002,
  title  = {Zero-Shot Recognition using Dual Visual-Semantic Mapping Paths},
  author = {Yanan Li and Donghui Wang and Huanhang Hu and Yuetan Lin and Yueting Zhuang},
  journal= {arXiv preprint arXiv:1703.05002},
  year   = {2017}
}

Comments

Accepted as a full paper in IEEE Computer Vision and Pattern Recognition (CVPR) 2017

R2 v1 2026-06-22T18:45:56.632Z