Related papers: Scene Text recognition with Full Normalization

Scene Text Detection via Holistic, Multi-Channel Prediction

Recently, scene text detection has become an active research topic in computer vision and document analysis, because of its great importance and significant challenge. However, vast majority of the existing methods detect text within local…

Computer Vision and Pattern Recognition · Computer Science 2016-07-06 Cong Yao , Xiang Bai , Nong Sang , Xinyu Zhou , Shuchang Zhou , Zhimin Cao

Scene Text Detection and Recognition: The Deep Learning Era

With the rise and development of deep learning, computer vision has been tremendously transformed and reshaped. As an important research area in computer vision, scene text detection and recognition has been inescapably influenced by this…

Computer Vision and Pattern Recognition · Computer Science 2020-08-11 Shangbang Long , Xin He , Cong Yao

STN-OCR: A single Neural Network for Text Detection and Text Recognition

Detecting and recognizing text in natural scene images is a challenging, yet not completely solved task. In re- cent years several new systems that try to solve at least one of the two sub-tasks (text detection and text recognition) have…

Computer Vision and Pattern Recognition · Computer Science 2017-07-28 Christian Bartz , Haojin Yang , Christoph Meinel

Image Processing Based Scene-Text Detection and Recognition with Tesseract

Text Recognition is one of the challenging tasks of computer vision with considerable practical interest. Optical character recognition (OCR) enables different applications for automation. This project focuses on word detection and…

Computer Vision and Pattern Recognition · Computer Science 2020-04-20 Ebin Zacharias , Martin Teuchler , Bénédicte Bernier

Text Recognition in the Wild: A Survey

The history of text can be traced back over thousands of years. Rich and precise semantic information carried by text is important in a wide range of vision-based application scenarios. Therefore, text recognition in natural scenes has been…

Computer Vision and Pattern Recognition · Computer Science 2020-12-04 Xiaoxue Chen , Lianwen Jin , Yuanzhi Zhu , Canjie Luo , Tianwei Wang

Why You Should Try the Real Data for the Scene Text Recognition

Recent works in the text recognition area have pushed forward the recognition results to the new horizons. But for a long time a lack of large human-labeled natural text recognition datasets has been forcing researchers to use synthetic…

Computer Vision and Pattern Recognition · Computer Science 2021-07-30 Vladimir Loginov

Pushing the Performance Limit of Scene Text Recognizer without Human Annotation

Scene text recognition (STR) attracts much attention over the years because of its wide application. Most methods train STR model in a fully supervised manner which requires large amounts of labeled data. Although synthetic data contributes…

Computer Vision and Pattern Recognition · Computer Science 2022-05-24 Caiyuan Zheng , Hui Li , Seon-Min Rhee , Seungju Han , Jae-Joon Han , Peng Wang

JSTR: Judgment Improves Scene Text Recognition

In this paper, we present a method for enhancing the accuracy of scene text recognition tasks by judging whether the image and text match each other. While previous studies focused on generating the recognition results from input images,…

Computer Vision and Pattern Recognition · Computer Science 2024-04-10 Masato Fujitake

Text Detection and Recognition in the Wild: A Review

Detection and recognition of text in natural images are two main problems in the field of computer vision that have a wide variety of applications in analysis of sports videos, autonomous driving, industrial automation, to name a few. They…

Computer Vision and Pattern Recognition · Computer Science 2020-07-02 Zobeir Raisi , Mohamed A. Naiel , Paul Fieguth , Steven Wardell , John Zelek

ODM: A Text-Image Further Alignment Pre-training Approach for Scene Text Detection and Spotting

In recent years, text-image joint pre-training techniques have shown promising results in various tasks. However, in Optical Character Recognition (OCR) tasks, aligning text instances with their corresponding text regions in images poses a…

Computer Vision and Pattern Recognition · Computer Science 2024-04-18 Chen Duan , Pei Fu , Shan Guo , Qianyi Jiang , Xiaoming Wei

Advances of Scene Text Datasets

This article introduces publicly available datasets in scene text detection and recognition. The information is as of 2017.

Computer Vision and Pattern Recognition · Computer Science 2018-12-14 Masakazu Iwamura

COCO-Text: Dataset and Benchmark for Text Detection and Recognition in Natural Images

This paper describes the COCO-Text dataset. In recent years large-scale datasets like SUN and Imagenet drove the advancement of scene understanding and object recognition. The goal of COCO-Text is to advance state-of-the-art in text…

Computer Vision and Pattern Recognition · Computer Science 2016-06-21 Andreas Veit , Tomas Matera , Lukas Neumann , Jiri Matas , Serge Belongie

What Is Wrong With Scene Text Recognition Model Comparisons? Dataset and Model Analysis

Many new proposals for scene text recognition (STR) models have been introduced in recent years. While each claim to have pushed the boundary of the technology, a holistic and fair comparison has been largely missing in the field due to the…

Computer Vision and Pattern Recognition · Computer Science 2019-12-19 Jeonghun Baek , Geewook Kim , Junyeop Lee , Sungrae Park , Dongyoon Han , Sangdoo Yun , Seong Joon Oh , Hwalsuk Lee

Geometric Perception based Efficient Text Recognition

Every Scene Text Recognition (STR) task consists of text localization \& text recognition as the prominent sub-tasks. However, in real-world applications with fixed camera positions such as equipment monitor reading, image-based data entry,…

Computer Vision and Pattern Recognition · Computer Science 2023-02-09 P. N. Deelaka , D. R. Jayakodi , D. Y. Silva

Scene Text Recognition with Semantics

Scene Text Recognition (STR) models have achieved high performance in recent years on benchmark datasets where text images are presented with minimal noise. Traditional STR recognition pipelines take a cropped image as sole input and…

Computer Vision and Pattern Recognition · Computer Science 2022-10-21 Joshua Cesare Placidi , Yishu Miao , Zixu Wang , Lucia Specia

TextOCR: Towards large-scale end-to-end reasoning for arbitrary-shaped scene text

A crucial component for the scene text based reasoning required for TextVQA and TextCaps datasets involve detecting and recognizing text present in the images using an optical character recognition (OCR) system. The current systems are…

Computer Vision and Pattern Recognition · Computer Science 2021-05-13 Amanpreet Singh , Guan Pang , Mandy Toh , Jing Huang , Wojciech Galuba , Tal Hassner

Text Detection Forgot About Document OCR

Detection and recognition of text from scans and other images, commonly denoted as Optical Character Recognition (OCR), is a widely used form of automated document processing with a number of methods available. Yet OCR systems still do not…

Computer Vision and Pattern Recognition · Computer Science 2023-01-24 Krzysztof Olejniczak , Milan Šulc

An Effective Data Augmentation Method by Asking Questions about Scene Text Images

Scene text recognition (STR) and handwritten text recognition (HTR) face significant challenges in accurately transcribing textual content from images into machine-readable formats. Conventional OCR models often predict transcriptions…

Computer Vision and Pattern Recognition · Computer Science 2026-03-05 Xu Yao , Lei Kang

An End-to-End Trainable Neural Network for Image-based Sequence Recognition and Its Application to Scene Text Recognition

Image-based sequence recognition has been a long-standing research topic in computer vision. In this paper, we investigate the problem of scene text recognition, which is among the most important and challenging tasks in image-based…

Computer Vision and Pattern Recognition · Computer Science 2015-07-22 Baoguang Shi , Xiang Bai , Cong Yao

Revisiting Scene Text Recognition: A Data Perspective

This paper aims to re-assess scene text recognition (STR) from a data-oriented perspective. We begin by revisiting the six commonly used benchmarks in STR and observe a trend of performance saturation, whereby only 2.91% of the benchmark…

Computer Vision and Pattern Recognition · Computer Science 2023-07-20 Qing Jiang , Jiapeng Wang , Dezhi Peng , Chongyu Liu , Lianwen Jin