Related papers: Efficient, Lexicon-Free OCR using Deep Learning

Telugu OCR Framework using Deep Learning

In this paper, we address the task of Optical Character Recognition(OCR) for the Telugu script. We present an end-to-end framework that segments the text image, classifies the characters and extracts lines using a language model. The…

Machine Learning · Statistics 2017-02-16 Rakesh Achanta , Trevor Hastie

STN-OCR: A single Neural Network for Text Detection and Text Recognition

Detecting and recognizing text in natural scene images is a challenging, yet not completely solved task. In re- cent years several new systems that try to solve at least one of the two sub-tasks (text detection and text recognition) have…

Computer Vision and Pattern Recognition · Computer Science 2017-07-28 Christian Bartz , Haojin Yang , Christoph Meinel

Chargrid-OCR: End-to-end Trainable Optical Character Recognition for Printed Documents using Instance Segmentation

We present an end-to-end trainable approach for Optical Character Recognition (OCR) on printed documents. Specifically, we propose a model that predicts a) a two-dimensional character grid (\emph{chargrid}) representation of a document…

Computer Vision and Pattern Recognition · Computer Science 2020-02-28 Christian Reisswig , Anoop R Katti , Marco Spinaci , Johannes Höhne

Efficient OCR for Building a Diverse Digital History

Thousands of users consult digital archives daily, but the information they can access is unrepresentative of the diversity of documentary history. The sequence-to-sequence architecture typically used for optical character recognition (OCR)…

Computer Vision and Pattern Recognition · Computer Science 2024-07-29 Jacob Carlson , Tom Bryan , Melissa Dell

An Efficient Language-Independent Multi-Font OCR for Arabic Script

Optical Character Recognition (OCR) is the process of extracting digitized text from images of scanned documents. While OCR systems have already matured in many languages, they still have shortcomings in cursive languages with overlapping…

Computer Vision and Pattern Recognition · Computer Science 2020-09-22 Hussein Osman , Karim Zaghw , Mostafa Hazem , Seifeldin Elsehely

Digitization of Document and Information Extraction using OCR

Retrieving accurate details from documents is a crucial task, especially when handling a combination of scanned images and native digital formats. This document presents a combined framework for text extraction that merges Optical Character…

Computer Vision and Pattern Recognition · Computer Science 2025-06-16 Rasha Sinha , Rekha B S

Why Stop at Words? Unveiling the Bigger Picture through Line-Level OCR

Conventional optical character recognition (OCR) techniques segmented each character and then recognized. This made them prone to error in character segmentation, and devoid of context to exploit language models. Advances in sequence to…

Computer Vision and Pattern Recognition · Computer Science 2025-09-01 Shashank Vempati , Nishit Anand , Gaurav Talebailkar , Arpan Garai , Chetan Arora

Efficient Multi-domain Text Recognition Deep Neural Network Parameterization with Residual Adapters

Recent advancements in deep neural networks have markedly enhanced the performance of computer vision tasks, yet the specialized nature of these networks often necessitates extensive data and high computational power. Addressing these…

Computer Vision and Pattern Recognition · Computer Science 2024-01-03 Jiayou Chao , Wei Zhu

An Effective Data Augmentation Method by Asking Questions about Scene Text Images

Scene text recognition (STR) and handwritten text recognition (HTR) face significant challenges in accurately transcribing textual content from images into machine-readable formats. Conventional OCR models often predict transcriptions…

Computer Vision and Pattern Recognition · Computer Science 2026-03-05 Xu Yao , Lei Kang

Recognition of Text Image Using Multilayer Perceptron

The biggest challenge in the field of image processing is to recognize documents both in printed and handwritten format. Optical Character Recognition OCR is a type of document image analysis where scanned digital image that contains either…

Computer Vision and Pattern Recognition · Computer Science 2016-12-05 Singh Vijendra , Nisha Vasudeva , Hem Jyotsana Parashar

Text Detection Forgot About Document OCR

Detection and recognition of text from scans and other images, commonly denoted as Optical Character Recognition (OCR), is a widely used form of automated document processing with a number of methods available. Yet OCR systems still do not…

Computer Vision and Pattern Recognition · Computer Science 2023-01-24 Krzysztof Olejniczak , Milan Šulc

Focus-Enhanced Scene Text Recognition with Deformable Convolutions

Recently, scene text recognition methods based on deep learning have sprung up in computer vision area. The existing methods achieved great performances, but the recognition of irregular text is still challenging due to the various shapes…

Computer Vision and Pattern Recognition · Computer Science 2022-05-06 Linjie Deng , Yanxiang Gong , Xinchen Lu , Xin Yi , Zheng Ma , Mei Xie

A Compositional Textual Model for Recognition of Imperfect Word Images

Printed text recognition is an important problem for industrial OCR systems. Printed text is constructed in a standard procedural fashion in most settings. We develop a mathematical model for this process that can be applied to the backward…

Computer Vision and Pattern Recognition · Computer Science 2018-11-29 Wei Tang , John Corring , Ying Wu , Gang Hua

Image Processing Based Scene-Text Detection and Recognition with Tesseract

Text Recognition is one of the challenging tasks of computer vision with considerable practical interest. Optical character recognition (OCR) enables different applications for automation. This project focuses on word detection and…

Computer Vision and Pattern Recognition · Computer Science 2020-04-20 Ebin Zacharias , Martin Teuchler , Bénédicte Bernier

Automated Invoice Data Extraction: Using LLM and OCR

Conventional Optical Character Recognition (OCR) systems are challenged by variant invoice layouts, handwritten text, and low-quality scans, which are often caused by strong template dependencies that restrict their flexibility across…

Computer Vision and Pattern Recognition · Computer Science 2026-01-09 Khushi Khanchandani , Advait Thakur , Akshita Shetty , Chaitravi Reddy , Ritisa Behera

Reading Scene Text in Deep Convolutional Sequences

We develop a Deep-Text Recurrent Network (DTRN) that regards scene text reading as a sequence labelling problem. We leverage recent advances of deep convolutional neural networks to generate an ordered high-level sequence from a whole word…

Computer Vision and Pattern Recognition · Computer Science 2015-12-22 Pan He , Weilin Huang , Yu Qiao , Chen Change Loy , Xiaoou Tang

Lexically Aware Semi-Supervised Learning for OCR Post-Correction

Much of the existing linguistic data in many languages of the world is locked away in non-digitized books and documents. Optical character recognition (OCR) can be used to produce digitized text, and previous work has demonstrated the…

Computation and Language · Computer Science 2021-11-05 Shruti Rijhwani , Daisy Rosenblum , Antonios Anastasopoulos , Graham Neubig

Handwritten Text Recognition Using Convolutional Neural Network

OCR (Optical Character Recognition) is a technology that offers comprehensive alphanumeric recognition of handwritten and printed characters at electronic speed by merely scanning the document. Recently, the understanding of visual data has…

Computer Vision and Pattern Recognition · Computer Science 2023-07-12 Atman Mishra , A. Sharath Ram , Kavyashree C

OCR accuracy improvement on document images through a novel pre-processing approach

Digital camera and mobile document image acquisition are new trends arising in the world of Optical Character Recognition and text detection. In some cases, such process integrates many distortions and produces poorly scanned text or…

Computer Vision and Pattern Recognition · Computer Science 2015-09-14 Abdeslam El Harraj , Naoufal Raissouni

EASTER: Efficient and Scalable Text Recognizer

Recent progress in deep learning has led to the development of Optical Character Recognition (OCR) systems which perform remarkably well. Most research has been around recurrent networks as well as complex gated layers which make the…

Computer Vision and Pattern Recognition · Computer Science 2022-06-01 Kartik Chaudhary , Raghav Bali