Related papers: DeepOtsu: Document Enhancement and Binarization us…

Improving Document Binarization via Adversarial Noise-Texture Augmentation

Binarization of degraded document images is an elementary step in most of the problems in document image analysis domain. The paper re-visits the binarization problem by introducing an adversarial learning approach. We construct a Texture…

Computer Vision and Pattern Recognition · Computer Science 2019-05-02 Ankan Kumar Bhunia , Ayan Kumar Bhunia , Aneeshan Sain , Partha Pratim Roy

Learning Document Image Binarization from Data

In this paper we present a fully trainable binarization solution for degraded document images. Unlike previous attempts that often used simple features with a series of pre- and post-processing, our solution encodes all heuristics about…

Computer Vision and Pattern Recognition · Computer Science 2015-05-05 Yue Wu , Stephen Rawls , Wael AbdAlmageed , Premkumar Natarajan

A Fair Evaluation of Various Deep Learning-Based Document Image Binarization Approaches

Binarization of document images is an important pre-processing step in the field of document analysis. Traditional image binarization techniques usually rely on histograms or local statistics to identify a valid threshold to differentiate…

Computer Vision and Pattern Recognition · Computer Science 2024-01-23 Richin Sukesh , Mathias Seuret , Anguelos Nicolaou , Martin Mayr , Vincent Christlein

BiNet: Degraded-Manuscript Binarization in Diverse Document Textures and Layouts using Deep Encoder-Decoder Networks

Handwritten document-image binarization is a semantic segmentation process to differentiate ink pixels from background pixels. It is one of the essential steps towards character recognition, writer identification, and script-style evolution…

Computer Vision and Pattern Recognition · Computer Science 2026-01-06 Maruf A. Dhali , Jan Willem de Wit , Lambert Schomaker

A selectional auto-encoder approach for document image binarization

Binarization plays a key role in the automatic information retrieval from document images. This process is usually performed in the first stages of documents analysis systems, and serves as a basis for subsequent steps. Hence it has to be…

Computer Vision and Pattern Recognition · Computer Science 2018-09-07 Jorge Calvo-Zaragoza , Antonio-Javier Gallego

An Analytical Study of different Document Image Binarization Methods

Document image has been the area of research for a couple of decades because of its potential application in the area of text recognition, line recognition or any other shape recognition from the image. For most of these purposes…

Computer Vision and Pattern Recognition · Computer Science 2015-02-02 Mahua Nandy , Satadal Saha

Two-stage generative adversarial networks for document image binarization with color noise and background removal

Document image enhancement and binarization methods are often used to improve the accuracy and efficiency of document image analysis tasks such as text recognition. Traditional non-machine-learning methods are constructed on low-level…

Computer Vision and Pattern Recognition · Computer Science 2021-04-28 Sungho Suh , Jihun Kim , Paul Lukowicz , Yong Oh Lee

Variational Augmentation for Enhancing Historical Document Image Binarization

Historical Document Image Binarization is a well-known segmentation problem in image processing. Despite ubiquity, traditional thresholding algorithms achieved limited success on severely degraded document images. With the advent of deep…

Computer Vision and Pattern Recognition · Computer Science 2022-11-15 Avirup Dey , Nibaran Das , Mita Nasipuri

Iterative Joint Image Demosaicking and Denoising using a Residual Denoising Network

Modern digital cameras rely on the sequential execution of separate image processing steps to produce realistic images. The first two steps are usually related to denoising and demosaicking where the former aims to reduce noise from the…

Computer Vision and Pattern Recognition · Computer Science 2019-04-02 Filippos Kokkinos , Stamatios Lefkimmiatis

A Survey on Deep learning based Document Image Enhancement

Digitized documents such as scientific articles, tax forms, invoices, contract papers, historic texts are widely used nowadays. These document images could be degraded or damaged due to various reasons including poor lighting conditions,…

Computer Vision and Pattern Recognition · Computer Science 2022-01-04 Zahra Anvari , Vassilis Athitsos

Recurrent Neural Networks to Correct Satellite Image Classification Maps

While initially devised for image categorization, convolutional neural networks (CNNs) are being increasingly used for the pixelwise semantic labeling of images. However, the proper nature of the most common CNN architectures makes them…

Computer Vision and Pattern Recognition · Computer Science 2017-04-24 Emmanuel Maggiori , Guillaume Charpiat , Yuliya Tarabalka , Pierre Alliez

PDNet: Semantic Segmentation integrated with a Primal-Dual Network for Document binarization

Binarization of digital documents is the task of classifying each pixel in an image of the document as belonging to the background (parchment/paper) or foreground (text/ink). Historical documents are often subjected to degradations, that…

Machine Learning · Statistics 2018-05-18 Kalyan Ram Ayyalasomayajula , Filip Malmberg , Anders Brun

Binary Document Image Super Resolution for Improved Readability and OCR Performance

There is a need for information retrieval from large collections of low-resolution (LR) binary document images, which can be found in digital libraries across the world, where the high-resolution (HR) counterpart is not available. This…

Computer Vision and Pattern Recognition · Computer Science 2018-12-07 Ram Krishna Pandey , K Vignesh , A G Ramakrishnan , Chandrahasa B

Improving Document Image Understanding with Reinforcement Finetuning

Successful Artificial Intelligence systems often require numerous labeled data to extract information from document images. In this paper, we investigate the problem of improving the performance of Artificial Intelligence systems in…

Information Retrieval · Computer Science 2022-09-27 Bao-Sinh Nguyen , Dung Tien Le , Hieu M. Vu , Tuan Anh D. Nguyen , Minh-Tien Nguyen , Hung Le

Deep Retinal Image Understanding

This paper presents Deep Retinal Image Understanding (DRIU), a unified framework of retinal image analysis that provides both retinal vessel and optic disc segmentation. We make use of deep Convolutional Neural Networks (CNNs), which have…

Computer Vision and Pattern Recognition · Computer Science 2016-11-17 Kevis-Kokitsi Maninis , Jordi Pont-Tuset , Pablo Arbeláez , Luc Van Gool

Automatic Document Image Binarization using Bayesian Optimization

Document image binarization is often a challenging task due to various forms of degradation. Although there exist several binarization techniques in literature, the binarized image is typically sensitive to control parameter settings of the…

Information Retrieval · Computer Science 2018-02-22 Ekta Vats , Anders Hast , Prashant Singh

Deep Unrestricted Document Image Rectification

In recent years, tremendous efforts have been made on document image rectification, but existing advanced algorithms are limited to processing restricted document images, i.e., the input images must incorporate a complete document. Once the…

Computer Vision and Pattern Recognition · Computer Science 2023-12-19 Hao Feng , Shaokai Liu , Jiajun Deng , Wengang Zhou , Houqiang Li

OCR accuracy improvement on document images through a novel pre-processing approach

Digital camera and mobile document image acquisition are new trends arising in the world of Optical Character Recognition and text detection. In some cases, such process integrates many distortions and produces poorly scanned text or…

Computer Vision and Pattern Recognition · Computer Science 2015-09-14 Abdeslam El Harraj , Naoufal Raissouni

An Iterative Fingerprint Enhancement Algorithm Based on Accurate Determination of Orientation Flow

We describe an algorithm to enhance and binarize a fingerprint image. The algorithm is based on accurate determination of orientation flow of the ridges of the fingerprint image by computing variance of the neighborhood pixels around a…

Computer Vision and Pattern Recognition · Computer Science 2009-07-03 Simant Dube

Learned Neural Iterative Decoding for Lossy Image Compression Systems

For lossy image compression systems, we develop an algorithm, iterative refinement, to improve the decoder's reconstruction compared to standard decoding techniques. Specifically, we propose a recurrent neural network approach for…

Computer Vision and Pattern Recognition · Computer Science 2018-11-13 Alexander G. Ororbia , Ankur Mali , Jian Wu , Scott O'Connell , David Miller , C. Lee Giles