Related papers: Deep Unrestricted Document Image Rectification

DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction

In this work, we propose a new framework, called Document Image Transformer (DocTr), to address the issue of geometry and illumination distortion of the document images. Specifically, DocTr consists of a geometric unwarping transformer and…

Computer Vision and Pattern Recognition · Computer Science 2022-10-11 Hao Feng , Yuechen Wang , Wengang Zhou , Jiajun Deng , Houqiang Li

DocScanner: Robust Document Image Rectification with Progressive Learning

Compared with flatbed scanners, portable smartphones provide more convenience for physical document digitization. However, such digitized documents are often distorted due to uncontrolled physical deformations, camera positions, and…

Computer Vision and Pattern Recognition · Computer Science 2022-12-27 Hao Feng , Wengang Zhou , Jiajun Deng , Qi Tian , Houqiang Li

DocEnTr: An End-to-End Document Image Enhancement Transformer

Document images can be affected by many degradation scenarios, which cause recognition and processing difficulties. In this age of digitization, it is important to denoise them for proper usage. To address this challenge, we present a new…

Computer Vision and Pattern Recognition · Computer Science 2022-01-26 Mohamed Ali Souibgui , Sanket Biswas , Sana Khamekhem Jemni , Yousri Kessentini , Alicia Fornés , Josep Lladós , Umapada Pal

DocMAE: Document Image Rectification via Self-supervised Representation Learning

Tremendous efforts have been made on document image rectification, but how to learn effective representation of such distorted images is still under-explored. In this paper, we present DocMAE, a novel self-supervised framework for document…

Computer Vision and Pattern Recognition · Computer Science 2023-04-21 Shaokai Liu , Hao Feng , Wengang Zhou , Houqiang Li , Cong Liu , Feng Wu

OCR accuracy improvement on document images through a novel pre-processing approach

Digital camera and mobile document image acquisition are new trends arising in the world of Optical Character Recognition and text detection. In some cases, such process integrates many distortions and produces poorly scanned text or…

Computer Vision and Pattern Recognition · Computer Science 2015-09-14 Abdeslam El Harraj , Naoufal Raissouni

Geometric Rectification of Creased Document Images based on Isometric Mapping

Geometric rectification of images of distorted documents finds wide applications in document digitization and Optical Character Recognition (OCR). Although smoothly curved deformations have been widely investigated by many works, the most…

Computer Vision and Pattern Recognition · Computer Science 2022-12-19 Dong Luo , Pengbo Bo

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network

As camera-based documents are increasingly used, the rectification of distorted document images becomes a need to improve the recognition performance. In this paper, we propose a novel framework for both rectifying distorted document image…

Computer Vision and Pattern Recognition · Computer Science 2021-04-15 Guo-Wang Xie , Fei Yin , Xu-Yao Zhang , Cheng-Lin Liu

DocDiff: Document Enhancement via Residual Diffusion Models

Removing degradation from document images not only improves their visual quality and readability, but also enhances the performance of numerous automated document analysis and recognition tasks. However, existing regression-based methods…

Computer Vision and Pattern Recognition · Computer Science 2023-08-10 Zongyuan Yang , Baolin Liu , Yongping Xiong , Lan Yi , Guibin Wu , Xiaojun Tang , Ziqi Liu , Junjie Zhou , Xing Zhang

Deep Learning-based Forgery Attack on Document Images

With the ongoing popularization of online services, the digital document images have been used in various applications. Meanwhile, there have emerged some deep learning-based text editing algorithms which alter the textual information of an…

Multimedia · Computer Science 2021-09-13 Lin Zhao , Changsheng Chen , Jiwu Huang

Image Correction via Deep Reciprocating HDR Transformation

Image correction aims to adjust an input image into a visually pleasing one. Existing approaches are proposed mainly from the perspective of image pixel manipulation. They are not effective to recover the details in the under/over exposed…

Computer Vision and Pattern Recognition · Computer Science 2018-04-13 Xin Yang , Ke Xu , Yibing Song , Qiang Zhang , Xiaopeng Wei , Rynson Lau

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Document image restoration is a crucial aspect of Document AI systems, as the quality of document images significantly influences the overall performance. Prevailing methods address distinct restoration tasks independently, leading to…

Computer Vision and Pattern Recognition · Computer Science 2024-05-08 Jiaxin Zhang , Dezhi Peng , Chongyu Liu , Peirong Zhang , Lianwen Jin

Geometric Representation Learning for Document Image Rectification

In document image rectification, there exist rich geometric constraints between the distorted image and the ground truth one. However, such geometric constraints are largely ignored in existing advanced solutions, which limits the…

Computer Vision and Pattern Recognition · Computer Science 2022-10-18 Hao Feng , Wengang Zhou , Jiajun Deng , Yuechen Wang , Houqiang Li

TextIR: A Simple Framework for Text-based Editable Image Restoration

Most existing image restoration methods use neural networks to learn strong image-level priors from huge data to estimate the lost information. However, these works still struggle in cases when images have severe information deficits.…

Computer Vision and Pattern Recognition · Computer Science 2023-03-01 Yunpeng Bai , Cairong Wang , Shuzhao Xie , Chao Dong , Chun Yuan , Zhi Wang

DeepOtsu: Document Enhancement and Binarization using Iterative Deep Learning

This paper presents a novel iterative deep learning framework and apply it for document enhancement and binarization. Unlike the traditional methods which predict the binary label of each pixel on the input image, we train the neural…

Computer Vision and Pattern Recognition · Computer Science 2019-01-21 Sheng He , Lambert Schomaker

A Deep Ordinal Distortion Estimation Approach for Distortion Rectification

Distortion is widely existed in the images captured by popular wide-angle cameras and fisheye cameras. Despite the long history of distortion rectification, accurately estimating the distortion parameters from a single distorted image is…

Computer Vision and Pattern Recognition · Computer Science 2024-04-30 Kang Liao , Chunyu Lin , Yao Zhao

Cascaded Robust Rectification for Arbitrary Document Images

Document rectification in real-world scenarios poses significant challenges due to extreme variations in camera perspectives and physical distortions. Driven by the insight that complex transformations can be decomposed and resolved…

Computer Vision and Pattern Recognition · Computer Science 2025-12-01 Chaoyun Wang , Quanxin Huang , I-Chao Shen , Takeo Igarashi , Nanning Zheng , Caigui Jiang

Divide and Restore: A Modular Task-Decoupled Framework for Universal Image Restoration

Restoring images affected by various types of degradation, such as noise, blur, or improper exposure, remains a significant challenge in computer vision. While recent trends favor complex monolithic all-in-one architectures, these models…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Joanna Wiekiera , Martyna Zur

Document Image Rectification Bases on Self-Adaptive Multitask Fusion

Deformed document image rectification is essential for real-world document understanding tasks, such as layout analysis and text recognition. However, current multi-task methods -- such as background removal, 3D coordinate prediction, and…

Computer Vision and Pattern Recognition · Computer Science 2025-05-12 Heng Li , Xiangping Wu , Qingcai Chen

Efficient Document Image Dewarping via Hybrid Deep Learning and Cubic Polynomial Geometry Restoration

Camera-captured document images often suffer from geometric distortions caused by paper deformation, perspective distortion, and lens aberrations, significantly reducing OCR accuracy. This study develops an efficient automated method for…

Computer Vision and Pattern Recognition · Computer Science 2025-11-20 Valery Istomin , Oleg Pereziabov , Ilya Afanasyev

Accurate, Data-Efficient, Unconstrained Text Recognition with Convolutional Neural Networks

Unconstrained text recognition is an important computer vision task, featuring a wide variety of different sub-tasks, each with its own set of challenges. One of the biggest promises of deep neural networks has been the convergence and…

Computer Vision and Pattern Recognition · Computer Science 2019-01-01 Mohamed Yousef , Khaled F. Hussain , Usama S. Mohammed