English

Scene Text Recognition Models Explainability Using Local Features

Computer Vision and Pattern Recognition 2023-10-18 v1

Abstract

Explainable AI (XAI) is the study on how humans can be able to understand the cause of a model's prediction. In this work, the problem of interest is Scene Text Recognition (STR) Explainability, using XAI to understand the cause of an STR model's prediction. Recent XAI literatures on STR only provide a simple analysis and do not fully explore other XAI methods. In this study, we specifically work on data explainability frameworks, called attribution-based methods, that explain the important parts of an input data in deep learning models. However, integrating them into STR produces inconsistent and ineffective explanations, because they only explain the model in the global context. To solve this problem, we propose a new method, STRExp, to take into consideration the local explanations, i.e. the individual character prediction explanations. This is then benchmarked across different attribution-based methods on different STR datasets and evaluated across different STR models.

Keywords

Cite

@article{arxiv.2310.09549,
  title  = {Scene Text Recognition Models Explainability Using Local Features},
  author = {Mark Vincent Ty and Rowel Atienza},
  journal= {arXiv preprint arXiv:2310.09549},
  year   = {2023}
}

Comments

T2023 IEEE International Conference on Image Processing (ICIP). IEEE, 2023

R2 v1 2026-06-28T12:50:36.668Z