English
Related papers

Related papers: EfficientOCR: An Extensible, Open-Source Package f…

200 papers

Thousands of users consult digital archives daily, but the information they can access is unrepresentative of the diversity of documentary history. The sequence-to-sequence architecture typically used for optical character recognition (OCR)…

Computer Vision and Pattern Recognition · Computer Science 2024-07-29 Jacob Carlson , Tom Bryan , Melissa Dell

Optical Character Recognition (OCR) technology is widely used to extract text from images of documents, facilitating efficient digitization and data retrieval. However, merely extracting text is insufficient when dealing with complex…

A crucial component for the scene text based reasoning required for TextVQA and TextCaps datasets involve detecting and recognizing text present in the images using an optical character recognition (OCR) system. The current systems are…

Computer Vision and Pattern Recognition · Computer Science 2021-05-13 Amanpreet Singh , Guan Pang , Mandy Toh , Jing Huang , Wojciech Galuba , Tal Hassner

Information Extraction from visually rich documents is a challenging task that has gained a lot of attention in recent years due to its importance in several document-control based applications and its widespread commercial value. The…

Computer Vision and Pattern Recognition · Computer Science 2023-05-03 Mohamed Dhouib , Ghassen Bettaieb , Aymen Shabou

Optical character recognition (OCR) has advanced rapidly with deep learning and multimodal models, yet most methods focus on well-resourced scripts such as Latin and Chinese. Ethnic minority languages remain underexplored due to complex…

Computer Vision and Pattern Recognition · Computer Science 2026-02-25 Bonan Liu , Zeyu Zhang , Bingbing Meng , Han Wang , Hanshuo Zhang , Chengping Wang , Daji Ergu , Ying Cai

Contrary to popular belief, Optical Character Recognition (OCR) remains a challenging problem when text occurs in unconstrained environments, like natural scenes, due to geometrical distortions, complex backgrounds, and diverse fonts. In…

Computer Vision and Pattern Recognition · Computer Science 2019-06-06 Marcin Namysl , Iuliu Konya

Conventional optical character recognition (OCR) techniques segmented each character and then recognized. This made them prone to error in character segmentation, and devoid of context to exploit language models. Advances in sequence to…

Computer Vision and Pattern Recognition · Computer Science 2025-09-01 Shashank Vempati , Nishit Anand , Gaurav Talebailkar , Arpan Garai , Chetan Arora

Optical character recognition (OCR) is a widely used pattern recognition application in numerous domains. There are several feature-rich, general-purpose OCR solutions available for consumers, which can provide moderate to excellent…

Computer Vision and Pattern Recognition · Computer Science 2021-05-18 Ayantha Randika , Nilanjan Ray , Xiao Xiao , Allegra Latimer

Optical Character Recognition (OCR) is the process of extracting digitized text from images of scanned documents. While OCR systems have already matured in many languages, they still have shortcomings in cursive languages with overlapping…

Computer Vision and Pattern Recognition · Computer Science 2020-09-22 Hussein Osman , Karim Zaghw , Mostafa Hazem , Seifeldin Elsehely

Vision-Language Models (VLMs) have shown strong promise on Optical Character Recognition (OCR), yet the sheer number of visual tokens required to encode dense documents incurs prohibitive inference cost. Existing pruning methods rely on…

Computer Vision and Pattern Recognition · Computer Science 2026-05-19 Zihan Tang , Leqi Shen , Hui Chen , Ao Wang , Ben Wan , Yan Feng , Ke Zhang , Sicheng Zhao , Tongxuan Liu , Guiguang Ding

Optical Character Recognition (OCR) on historical printings is a challenging task mainly due to the complexity of the layout and the highly variant typography. Nevertheless, in the last few years great progress has been made in the area of…

Computer Vision and Pattern Recognition · Computer Science 2021-06-01 Christian Reul , Dennis Christ , Alexander Hartelt , Nico Balbach , Maximilian Wehner , Uwe Springmann , Christoph Wick , Christine Grundig , Andreas Büttner , Frank Puppe

Recent advancements in deep neural networks have markedly enhanced the performance of computer vision tasks, yet the specialized nature of these networks often necessitates extensive data and high computational power. Addressing these…

Computer Vision and Pattern Recognition · Computer Science 2024-01-03 Jiayou Chao , Wei Zhu

We present an end-to-end trainable approach for Optical Character Recognition (OCR) on printed documents. Specifically, we propose a model that predicts a) a two-dimensional character grid (\emph{chargrid}) representation of a document…

Computer Vision and Pattern Recognition · Computer Science 2020-02-28 Christian Reisswig , Anoop R Katti , Marco Spinaci , Johannes Höhne

Detection and recognition of text from scans and other images, commonly denoted as Optical Character Recognition (OCR), is a widely used form of automated document processing with a number of methods available. Yet OCR systems still do not…

Computer Vision and Pattern Recognition · Computer Science 2023-01-24 Krzysztof Olejniczak , Milan Šulc

Optical character recognition (OCR) is a vital process that involves the extraction of handwritten or printed text from scanned or printed images, converting it into a format that can be understood and processed by machines. This enables…

Computer Vision and Pattern Recognition · Computer Science 2023-12-20 Mahmoud SalahEldin Kasem , Mohamed Mahmoud , Hyun-Soo Kang

Word error rate of an ocr is often higher than its character error rate. This is especially true when ocrs are designed by recognizing characters. High word accuracies are critical to tasks like the creation of content in digital libraries…

Computer Vision and Pattern Recognition · Computer Science 2019-05-29 Deepayan Das , Jerin Philip , Minesh Mathew , C. V. Jawahar

Retrieving accurate details from documents is a crucial task, especially when handling a combination of scanned images and native digital formats. This document presents a combined framework for text extraction that merges Optical Character…

Computer Vision and Pattern Recognition · Computer Science 2025-06-16 Rasha Sinha , Rekha B S

The Optical Character Recognition (OCR) systems have been widely used in various of application scenarios, such as office automation (OA) systems, factory automations, online educations, map productions etc. However, OCR is still a…

Computer Vision and Pattern Recognition · Computer Science 2020-10-16 Yuning Du , Chenxia Li , Ruoyu Guo , Xiaoting Yin , Weiwei Liu , Jun Zhou , Yifan Bai , Zilin Yu , Yehua Yang , Qingqing Dang , Haoshuang Wang

Much of the existing linguistic data in many languages of the world is locked away in non-digitized books and documents. Optical character recognition (OCR) can be used to produce digitized text, and previous work has demonstrated the…

Computation and Language · Computer Science 2021-11-05 Shruti Rijhwani , Daisy Rosenblum , Antonios Anastasopoulos , Graham Neubig

Despite the existence of numerous Optical Character Recognition (OCR) tools, the lack of comprehensive open-source systems hampers the progress of document digitization in various low-resource languages, including Bengali. Low-resource…

‹ Prev 1 2 3 10 Next ›