Related papers: Text Detection Forgot About Document OCR

A Survey on Optical Character Recognition System

Optical Character Recognition (OCR) has been a topic of interest for many years. It is defined as the process of digitizing a document image into its constituent characters. Despite decades of intense research, developing OCR with…

Computer Vision and Pattern Recognition · Computer Science 2017-10-17 Noman Islam , Zeeshan Islam , Nazia Noor

Recognition of Text Image Using Multilayer Perceptron

The biggest challenge in the field of image processing is to recognize documents both in printed and handwritten format. Optical Character Recognition OCR is a type of document image analysis where scanned digital image that contains either…

Computer Vision and Pattern Recognition · Computer Science 2016-12-05 Singh Vijendra , Nisha Vasudeva , Hem Jyotsana Parashar

Detection Masking for Improved OCR on Noisy Documents

Optical Character Recognition (OCR), the task of extracting textual information from scanned documents is a vital and broadly used technology for digitizing and indexing physical documents. Existing technologies perform well for clean…

Computer Vision and Pattern Recognition · Computer Science 2022-05-18 Daniel Rotman , Ophir Azulai , Inbar Shapira , Yevgeny Burshtein , Udi Barzelay

Text Detection and Recognition in the Wild: A Review

Detection and recognition of text in natural images are two main problems in the field of computer vision that have a wide variety of applications in analysis of sports videos, autonomous driving, industrial automation, to name a few. They…

Computer Vision and Pattern Recognition · Computer Science 2020-07-02 Zobeir Raisi , Mohamed A. Naiel , Paul Fieguth , Steven Wardell , John Zelek

Handwritten Text Recognition Using Convolutional Neural Network

OCR (Optical Character Recognition) is a technology that offers comprehensive alphanumeric recognition of handwritten and printed characters at electronic speed by merely scanning the document. Recently, the understanding of visual data has…

Computer Vision and Pattern Recognition · Computer Science 2023-07-12 Atman Mishra , A. Sharath Ram , Kavyashree C

Image Processing Based Scene-Text Detection and Recognition with Tesseract

Text Recognition is one of the challenging tasks of computer vision with considerable practical interest. Optical character recognition (OCR) enables different applications for automation. This project focuses on word detection and…

Computer Vision and Pattern Recognition · Computer Science 2020-04-20 Ebin Zacharias , Martin Teuchler , Bénédicte Bernier

Digitization of Document and Information Extraction using OCR

Retrieving accurate details from documents is a crucial task, especially when handling a combination of scanned images and native digital formats. This document presents a combined framework for text extraction that merges Optical Character…

Computer Vision and Pattern Recognition · Computer Science 2025-06-16 Rasha Sinha , Rekha B S

Advancements and Challenges in Arabic Optical Character Recognition: A Comprehensive Survey

Optical character recognition (OCR) is a vital process that involves the extraction of handwritten or printed text from scanned or printed images, converting it into a format that can be understood and processed by machines. This enables…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Mahmoud SalahEldin Kasem , Mohamed Mahmoud , Hyun-Soo Kang

Handwritten Optical Character Recognition (OCR): A Comprehensive Systematic Literature Review (SLR)

Given the ubiquity of handwritten documents in human transactions, Optical Character Recognition (OCR) of documents have invaluable practical worth. Optical character recognition is a science that enables to translate various types of…

Computer Vision and Pattern Recognition · Computer Science 2020-01-03 Jamshed Memon , Maira Sami , Rizwan Ahmed Khan

Text Change Detection in Multilingual Documents Using Image Comparison

Document comparison typically relies on optical character recognition (OCR) as its core technology. However, OCR requires the selection of appropriate language models for each document and the performance of multilingual or hybrid models…

Computer Vision and Pattern Recognition · Computer Science 2024-12-06 Doyoung Park , Naresh Reddy Yarram , Sunjin Kim , Minkyu Kim , Seongho Cho , Taehee Lee

OCR accuracy improvement on document images through a novel pre-processing approach

Digital camera and mobile document image acquisition are new trends arising in the world of Optical Character Recognition and text detection. In some cases, such process integrates many distortions and produces poorly scanned text or…

Computer Vision and Pattern Recognition · Computer Science 2015-09-14 Abdeslam El Harraj , Naoufal Raissouni

Design of an Optical Character Recognition System for Camera-based Handheld Devices

This paper presents a complete Optical Character Recognition (OCR) system for camera captured image/graphics embedded textual documents for handheld devices. At first, text regions are extracted and skew corrected. Then, these regions are…

Computer Vision and Pattern Recognition · Computer Science 2011-09-16 Ayatullah Faruk Mollah , Nabamita Majumder , Subhadip Basu , Mita Nasipuri

A survey of modern optical character recognition techniques

This report explores the latest advances in the field of digital document recognition. With the focus on printed document imagery, we discuss the major developments in optical character recognition (OCR) and document image…

Computer Vision and Pattern Recognition · Computer Science 2014-12-16 Eugene Borovikov

STN-OCR: A single Neural Network for Text Detection and Text Recognition

Detecting and recognizing text in natural scene images is a challenging, yet not completely solved task. In re- cent years several new systems that try to solve at least one of the two sub-tasks (text detection and text recognition) have…

Computer Vision and Pattern Recognition · Computer Science 2017-07-28 Christian Bartz , Haojin Yang , Christoph Meinel

OCR Graph Features for Manipulation Detection in Documents

Detecting manipulations in digital documents is becoming increasingly important for information verification purposes. Due to the proliferation of image editing software, altering key information in documents has become widely accessible.…

Computer Vision and Pattern Recognition · Computer Science 2023-06-27 Hailey Joren , Otkrist Gupta , Dan Raviv

Text Recognition in the Wild: A Survey

The history of text can be traced back over thousands of years. Rich and precise semantic information carried by text is important in a wide range of vision-based application scenarios. Therefore, text recognition in natural scenes has been…

Computer Vision and Pattern Recognition · Computer Science 2020-12-04 Xiaoxue Chen , Lianwen Jin , Yuanzhi Zhu , Canjie Luo , Tianwei Wang

OCR Post-Processing Error Correction Algorithm using Google Online Spelling Suggestion

With the advent of digital optical scanners, a lot of paper-based books, textbooks, magazines, articles, and documents are being transformed into an electronic version that can be manipulated by a computer. For this purpose, OCR, short for…

Computation and Language · Computer Science 2012-04-03 Youssef Bassil , Mohammad Alwani

OCR Context-Sensitive Error Correction Based on Google Web 1T 5-Gram Data Set

Since the dawn of the computing era, information has been represented digitally so that it can be processed by electronic computers. Paper books and documents were abundant and widely being published at that time; and hence, there was a…

Computation and Language · Computer Science 2012-04-03 Youssef Bassil , Mohammad Alwani

How Do Large Vision-Language Models See Text in Image? Unveiling the Distinctive Role of OCR Heads

Despite significant advancements in Large Vision Language Models (LVLMs), a gap remains, particularly regarding their interpretability and how they locate and interpret textual information within images. In this paper, we explore various…

Computer Vision and Pattern Recognition · Computer Science 2025-10-01 Ingeol Baek , Hwan Chang , Sunghyun Ryu , Hwanhee Lee

Words as Geometric Features: Estimating Homography using Optical Character Recognition as Compressed Image Representation

Document alignment and registration play a crucial role in numerous real-world applications, such as automated form processing, anomaly detection, and workflow automation. Traditional methods for document alignment rely on image-based…

Computer Vision and Pattern Recognition · Computer Science 2025-05-27 Ross Greer , Alisha Ukani , Katherine Izhikevich , Earlence Fernandes , Stefan Savage , Alex C. Snoeren