English

Learning Surrogates via Deep Embedding

Computer Vision and Pattern Recognition 2020-07-20 v2

Abstract

This paper proposes a technique for training a neural network by minimizing a surrogate loss that approximates the target evaluation metric, which may be non-differentiable. The surrogate is learned via a deep embedding where the Euclidean distance between the prediction and the ground truth corresponds to the value of the evaluation metric. The effectiveness of the proposed technique is demonstrated in a post-tuning setup, where a trained model is tuned using the learned surrogate. Without a significant computational overhead and any bells and whistles, improvements are demonstrated on challenging and practical tasks of scene-text recognition and detection. In the recognition task, the model is tuned using a surrogate approximating the edit distance metric and achieves up to 39%39\% relative improvement in the total edit distance. In the detection task, the surrogate approximates the intersection over union metric for rotated bounding boxes and yields up to 4.25%4.25\% relative improvement in the F1F_{1} score.

Keywords

Cite

@article{arxiv.2007.00799,
  title  = {Learning Surrogates via Deep Embedding},
  author = {Yash Patel and Tomas Hodan and Jiri Matas},
  journal= {arXiv preprint arXiv:2007.00799},
  year   = {2020}
}

Comments

ECCV 2020 camera-ready version