Scene Text Recognition Models Explainability Using Local Features

Mark Vincent Ty; Rowel Atienza

doi:10.1109/ICIP49359.2023.10222406

Scene Text Recognition Models Explainability Using Local Features

Computer Vision and Pattern Recognition 2023-10-18 v1

Authors: Mark Vincent Ty , Rowel Atienza

View on arXiv ↗ PDF ↗ DOI ↗

Abstract

Explainable AI (XAI) is the study on how humans can be able to understand the cause of a model's prediction. In this work, the problem of interest is Scene Text Recognition (STR) Explainability, using XAI to understand the cause of an STR model's prediction. Recent XAI literatures on STR only provide a simple analysis and do not fully explore other XAI methods. In this study, we specifically work on data explainability frameworks, called attribution-based methods, that explain the important parts of an input data in deep learning models. However, integrating them into STR produces inconsistent and ineffective explanations, because they only explain the model in the global context. To solve this problem, we propose a new method, STRExp, to take into consideration the local explanations, i.e. the individual character prediction explanations. This is then benchmarked across different attribution-based methods on different STR datasets and evaluated across different STR models.

Keywords

interpretability interpretable machine learning explainable artificial intelligence

Cite

@article{arxiv.2310.09549,
  title  = {Scene Text Recognition Models Explainability Using Local Features},
  author = {Mark Vincent Ty and Rowel Atienza},
  journal= {arXiv preprint arXiv:2310.09549},
  year   = {2023}
}

Comments

T2023 IEEE International Conference on Image Processing (ICIP). IEEE, 2023

Scene Text Recognition Models Explainability Using Local Features

Abstract

Keywords

Cite

Comments

Related papers