Automatic Script Identification in the Wild

Baoguang Shi; Cong Yao; Chengquan Zhang; Xiaowei Guo; Feiyue Huang; Xiang Bai

Automatic Script Identification in the Wild

Computer Vision and Pattern Recognition 2015-05-13 v1

Authors: Baoguang Shi , Cong Yao , Chengquan Zhang , Xiaowei Guo , Feiyue Huang , Xiang Bai

Abstract

With the rapid increase of transnational communication and cooperation, people frequently encounter multilingual scenarios in various situations. In this paper, we are concerned with a relatively new problem: script identification at word or line levels in natural scenes. A large-scale dataset with a great quantity of natural images and 10 types of widely used languages is constructed and released. In allusion to the challenges in script identification in real-world scenarios, a deep learning based algorithm is proposed. The experiments on the proposed dataset demonstrate that our algorithm achieves superior performance, compared with conventional image classification methods, such as the original CNN architecture and LLC.

Keywords

handwritten character recognition natural language processing narrative generation

Cite

@article{arxiv.1505.02982,
  title  = {Automatic Script Identification in the Wild},
  author = {Baoguang Shi and Cong Yao and Chengquan Zhang and Xiaowei Guo and Feiyue Huang and Xiang Bai},
  journal= {arXiv preprint arXiv:1505.02982},
  year   = {2015}
}

Comments

5 pages, 7 figures, submitted to ICDAR 2015

Automatic Script Identification in the Wild

Abstract

Keywords

Cite

Comments

Related papers