Related papers: A selectional auto-encoder approach for document i…

An Analytical Study of different Document Image Binarization Methods

Document image has been the area of research for a couple of decades because of its potential application in the area of text recognition, line recognition or any other shape recognition from the image. For most of these purposes…

Computer Vision and Pattern Recognition · Computer Science 2015-02-02 Mahua Nandy , Satadal Saha

Automatic Document Image Binarization using Bayesian Optimization

Document image binarization is often a challenging task due to various forms of degradation. Although there exist several binarization techniques in literature, the binarized image is typically sensitive to control parameter settings of the…

Information Retrieval · Computer Science 2018-02-22 Ekta Vats , Anders Hast , Prashant Singh

PDNet: Semantic Segmentation integrated with a Primal-Dual Network for Document binarization

Binarization of digital documents is the task of classifying each pixel in an image of the document as belonging to the background (parchment/paper) or foreground (text/ink). Historical documents are often subjected to degradations, that…

Machine Learning · Statistics 2018-05-18 Kalyan Ram Ayyalasomayajula , Filip Malmberg , Anders Brun

Learning Document Image Binarization from Data

In this paper we present a fully trainable binarization solution for degraded document images. Unlike previous attempts that often used simple features with a series of pre- and post-processing, our solution encodes all heuristics about…

Computer Vision and Pattern Recognition · Computer Science 2015-05-05 Yue Wu , Stephen Rawls , Wael AbdAlmageed , Premkumar Natarajan

Degraded Historical Documents Images Binarization Using a Combination of Enhanced Techniques

Document image binarization is the initial step and a crucial in many document analysis and recognition scheme. In fact, it is still a relevant research subject and a fundamental challenge due to its importance and influence. This paper…

Computer Vision and Pattern Recognition · Computer Science 2019-01-29 Omar Boudraa , Walid Khaled Hidouci , Dominique Michelucci

A Fair Evaluation of Various Deep Learning-Based Document Image Binarization Approaches

Binarization of document images is an important pre-processing step in the field of document analysis. Traditional image binarization techniques usually rely on histograms or local statistics to identify a valid threshold to differentiate…

Computer Vision and Pattern Recognition · Computer Science 2024-01-23 Richin Sukesh , Mathias Seuret , Anguelos Nicolaou , Martin Mayr , Vincent Christlein

Unsupervised Neural Domain Adaptation for Document Image Binarization

Binarization is a well-known image processing task, whose objective is to separate the foreground of an image from the background. One of the many tasks for which it is useful is that of preprocessing document images in order to identify…

Computer Vision and Pattern Recognition · Computer Science 2021-07-02 Francisco J. Castellanos , Antonio-Javier Gallego , Jorge Calvo-Zaragoza

Image Denoising Using Convolutional Autoencoder

With the inexorable digitalisation of the modern world, every subset in the field of technology goes through major advancements constantly. One such subset is digital images which are ever so popular. Images can not always be as visually…

Computer Vision and Pattern Recognition · Computer Science 2022-07-26 Prashanth Venkataraman

Hashing with binary autoencoders

An attractive approach for fast search in image databases is binary hashing, where each high-dimensional, real-valued image is mapped onto a low-dimensional, binary vector and the search is done in this binary space. Finding the optimal…

Machine Learning · Computer Science 2015-01-23 Miguel Á. Carreira-Perpiñán , Ramin Raziperchikolaei

Two-stage generative adversarial networks for document image binarization with color noise and background removal

Document image enhancement and binarization methods are often used to improve the accuracy and efficiency of document image analysis tasks such as text recognition. Traditional non-machine-learning methods are constructed on low-level…

Computer Vision and Pattern Recognition · Computer Science 2021-04-28 Sungho Suh , Jihun Kim , Paul Lukowicz , Yong Oh Lee

Document Image Binarization with Fully Convolutional Neural Networks

Binarization of degraded historical manuscript images is an important pre-processing step for many document processing tasks. We formulate binarization as a pixel classification learning task and apply a novel Fully Convolutional Network…

Computer Vision and Pattern Recognition · Computer Science 2017-08-11 Chris Tensmeyer , Tony Martinez

Improving Document Binarization via Adversarial Noise-Texture Augmentation

Binarization of degraded document images is an elementary step in most of the problems in document image analysis domain. The paper re-visits the binarization problem by introducing an adversarial learning approach. We construct a Texture…

Computer Vision and Pattern Recognition · Computer Science 2019-05-02 Ankan Kumar Bhunia , Ayan Kumar Bhunia , Aneeshan Sain , Partha Pratim Roy

Variational Augmentation for Enhancing Historical Document Image Binarization

Historical Document Image Binarization is a well-known segmentation problem in image processing. Despite ubiquity, traditional thresholding algorithms achieved limited success on severely degraded document images. With the advent of deep…

Computer Vision and Pattern Recognition · Computer Science 2022-11-15 Avirup Dey , Nibaran Das , Mita Nasipuri

Document Image Binarization in JPEG Compressed Domain using Dual Discriminator Generative Adversarial Networks

Image binarization techniques are being popularly used in enhancement of noisy and/or degraded images catering different Document Image Anlaysis (DIA) applications like word spotting, document retrieval, and OCR. Most of the existing…

Computer Vision and Pattern Recognition · Computer Science 2022-09-14 Bulla Rajesh , Manav Kamlesh Agrawal , Milan Bhuva , Kisalaya Kishore , Mohammed Javed

DocBinFormer: A Two-Level Transformer Network for Effective Document Image Binarization

In real life, various degradation scenarios exist that might damage document images, making it harder to recognize and analyze them, thus binarization is a fundamental and crucial step for achieving the most optimal performance in any…

Computer Vision and Pattern Recognition · Computer Science 2023-12-07 Risab Biswas , Swalpa Kumar Roy , Ning Wang , Umapada Pal , Guang-Bin Huang

DeepOtsu: Document Enhancement and Binarization using Iterative Deep Learning

This paper presents a novel iterative deep learning framework and apply it for document enhancement and binarization. Unlike the traditional methods which predict the binary label of each pixel on the input image, we train the neural…

Computer Vision and Pattern Recognition · Computer Science 2019-01-21 Sheng He , Lambert Schomaker

Binarizing Documents by Leveraging both Space and Frequency

Document Image Binarization is a well-known problem in Document Analysis and Computer Vision, although it is far from being solved. One of the main challenges of this task is that documents generally exhibit degradations and acquisition…

Computer Vision and Pattern Recognition · Computer Science 2024-04-29 Fabio Quattrini , Vittorio Pippi , Silvia Cascianelli , Rita Cucchiara

Modelling, Visualising and Summarising Documents with a Single Convolutional Neural Network

Capturing the compositional process which maps the meaning of words to that of documents is a central challenge for researchers in Natural Language Processing and Information Retrieval. We introduce a model that is able to represent the…

Computation and Language · Computer Science 2014-06-17 Misha Denil , Alban Demiraj , Nal Kalchbrenner , Phil Blunsom , Nando de Freitas

BiNet: Degraded-Manuscript Binarization in Diverse Document Textures and Layouts using Deep Encoder-Decoder Networks

Handwritten document-image binarization is a semantic segmentation process to differentiate ink pixels from background pixels. It is one of the essential steps towards character recognition, writer identification, and script-style evolution…

Computer Vision and Pattern Recognition · Computer Science 2026-01-06 Maruf A. Dhali , Jan Willem de Wit , Lambert Schomaker

Learning to Extract Semantic Structure from Documents Using Multimodal Fully Convolutional Neural Network

We present an end-to-end, multimodal, fully convolutional network for extracting semantic structures from document images. We consider document semantic structure extraction as a pixel-wise segmentation task, and propose a unified model…

Computer Vision and Pattern Recognition · Computer Science 2017-06-09 Xiao Yang , Ersin Yumer , Paul Asente , Mike Kraley , Daniel Kifer , C. Lee Giles