Related papers: DocScanner: Robust Document Image Rectification wi…

Deep Unrestricted Document Image Rectification

In recent years, tremendous efforts have been made on document image rectification, but existing advanced algorithms are limited to processing restricted document images, i.e., the input images must incorporate a complete document. Once the…

Computer Vision and Pattern Recognition · Computer Science 2023-12-19 Hao Feng , Shaokai Liu , Jiajun Deng , Wengang Zhou , Houqiang Li

Geometric Representation Learning for Document Image Rectification

In document image rectification, there exist rich geometric constraints between the distorted image and the ground truth one. However, such geometric constraints are largely ignored in existing advanced solutions, which limits the…

Computer Vision and Pattern Recognition · Computer Science 2022-10-18 Hao Feng , Wengang Zhou , Jiajun Deng , Yuechen Wang , Houqiang Li

DocTr: Document Image Transformer for Geometric Unwarping and Illumination Correction

In this work, we propose a new framework, called Document Image Transformer (DocTr), to address the issue of geometry and illumination distortion of the document images. Specifically, DocTr consists of a geometric unwarping transformer and…

Computer Vision and Pattern Recognition · Computer Science 2022-10-11 Hao Feng , Yuechen Wang , Wengang Zhou , Jiajun Deng , Houqiang Li

OCR accuracy improvement on document images through a novel pre-processing approach

Digital camera and mobile document image acquisition are new trends arising in the world of Optical Character Recognition and text detection. In some cases, such process integrates many distortions and produces poorly scanned text or…

Computer Vision and Pattern Recognition · Computer Science 2015-09-14 Abdeslam El Harraj , Naoufal Raissouni

DocAligner: Annotating Real-world Photographic Document Images by Simply Taking Pictures

Recently, there has been a growing interest in research concerning document image analysis and recognition in photographic scenarios. However, the lack of labeled datasets for this emerging challenge poses a significant obstacle, as manual…

Computer Vision and Pattern Recognition · Computer Science 2023-06-13 Jiaxin Zhang , Bangdong Chen , Hiuyi Cheng , Fengjun Guo , Kai Ding , Lianwen Jin

Dewarping Document Image By Displacement Flow Estimation with Fully Convolutional Network

As camera-based documents are increasingly used, the rectification of distorted document images becomes a need to improve the recognition performance. In this paper, we propose a novel framework for both rectifying distorted document image…

Computer Vision and Pattern Recognition · Computer Science 2021-04-15 Guo-Wang Xie , Fei Yin , Xu-Yao Zhang , Cheng-Lin Liu

TADoc: Robust Time-Aware Document Image Dewarping

Flattening curved, wrinkled, and rotated document images captured by portable photographing devices, termed document image dewarping, has become an increasingly important task with the rise of digital economy and online working. Although…

Computer Vision and Pattern Recognition · Computer Science 2025-08-12 Fangmin Zhao , Weichao Zeng , Zhenhang Li , Dongbao Yang , Yu Zhou

DocMAE: Document Image Rectification via Self-supervised Representation Learning

Tremendous efforts have been made on document image rectification, but how to learn effective representation of such distorted images is still under-explored. In this paper, we present DocMAE, a novel self-supervised framework for document…

Computer Vision and Pattern Recognition · Computer Science 2023-04-21 Shaokai Liu , Hao Feng , Wengang Zhou , Houqiang Li , Cong Liu , Feng Wu

Geometric Rectification of Creased Document Images based on Isometric Mapping

Geometric rectification of images of distorted documents finds wide applications in document digitization and Optical Character Recognition (OCR). Although smoothly curved deformations have been widely investigated by many works, the most…

Computer Vision and Pattern Recognition · Computer Science 2022-12-19 Dong Luo , Pengbo Bo

Efficient Document Image Dewarping via Hybrid Deep Learning and Cubic Polynomial Geometry Restoration

Camera-captured document images often suffer from geometric distortions caused by paper deformation, perspective distortion, and lens aberrations, significantly reducing OCR accuracy. This study develops an efficient automated method for…

Computer Vision and Pattern Recognition · Computer Science 2025-11-20 Valery Istomin , Oleg Pereziabov , Ilya Afanasyev

Document Dewarping with Control Points

Document images are now widely captured by handheld devices such as mobile phones. The OCR performance on these images are largely affected due to geometric distortion of the document paper, diverse camera positions and complex backgrounds.…

Computer Vision and Pattern Recognition · Computer Science 2022-03-22 Guo-Wang Xie , Fei Yin , Xu-Yao Zhang , Cheng-Lin Liu

Deep Photo Scan: Semi-Supervised Learning for dealing with the real-world degradation in Smartphone Photo Scanning

Physical photographs now can be conveniently scanned by smartphones and stored forever as a digital version, yet the scanned photos are not restored well. One solution is to train a supervised deep neural network on many digital photos and…

Computer Vision and Pattern Recognition · Computer Science 2021-08-19 Man M. Ho , Jinjia Zhou

DocStormer: Revitalizing Multi-Degraded Colored Document Images to Pristine PDF

For capturing colored document images, e.g. posters and magazines, it is common that multiple degradations such as shadows, wrinkles, etc., are simultaneously introduced due to external factors. Restoring multi-degraded colored document…

Computer Vision and Pattern Recognition · Computer Science 2023-10-30 Chaowei Liu , Jichun Li , Yihua Teng , Chaoqun Wang , Nuo Xu , Jihao Wu , Dandan Tu

Cascaded Robust Rectification for Arbitrary Document Images

Document rectification in real-world scenarios poses significant challenges due to extreme variations in camera perspectives and physical distortions. Driven by the insight that complex transformations can be decomposed and resolved…

Computer Vision and Pattern Recognition · Computer Science 2025-12-01 Chaoyun Wang , Quanxin Huang , I-Chao Shen , Takeo Igarashi , Nanning Zheng , Caigui Jiang

Handheld Video Document Scanning: A Robust On-Device Model for Multi-Page Document Scanning

Document capture applications on smartphones have emerged as popular tools for digitizing documents. For many individuals, capturing documents with their smartphones is more convenient than using dedicated photocopiers or scanners, even if…

Computer Vision and Pattern Recognition · Computer Science 2024-11-04 Curtis Wigington

DocDiff: Document Enhancement via Residual Diffusion Models

Removing degradation from document images not only improves their visual quality and readability, but also enhances the performance of numerous automated document analysis and recognition tasks. However, existing regression-based methods…

Computer Vision and Pattern Recognition · Computer Science 2023-08-10 Zongyuan Yang , Baolin Liu , Yongping Xiong , Lan Yi , Guibin Wu , Xiaojun Tang , Ziqi Liu , Junjie Zhou , Xing Zhang

DocSynth: A Layout Guided Approach for Controllable Document Image Synthesis

Despite significant progress on current state-of-the-art image generation models, synthesis of document images containing multiple and complex object layouts is a challenging task. This paper presents a novel approach, called DocSynth, to…

Computer Vision and Pattern Recognition · Computer Science 2021-07-07 Sanket Biswas , Pau Riba , Josep Lladós , Umapada Pal

ForCenNet: Foreground-Centric Network for Document Image Rectification

Document image rectification aims to eliminate geometric deformation in photographed documents to facilitate text recognition. However, existing methods often neglect the significance of foreground elements, which provide essential…

Computer Vision and Pattern Recognition · Computer Science 2025-07-29 Peng Cai , Qiang Li , Kaicheng Yang , Dong Guo , Jia Li , Nan Zhou , Xiang An , Ninghua Yang , Jiankang Deng

D2Dewarp: Dual Dimensions Geometric Representation Learning Based Document Image Dewarping

Document image dewarping remains a challenging task in the deep learning era. While existing methods have improved by leveraging text line awareness, they typically focus only on a single horizontal dimension. In this paper, we propose a…

Computer Vision and Pattern Recognition · Computer Science 2026-03-05 Heng Li , Xiangping Wu , Qingcai Chen

DocRes: A Generalist Model Toward Unifying Document Image Restoration Tasks

Document image restoration is a crucial aspect of Document AI systems, as the quality of document images significantly influences the overall performance. Prevailing methods address distinct restoration tasks independently, leading to…

Computer Vision and Pattern Recognition · Computer Science 2024-05-08 Jiaxin Zhang , Dezhi Peng , Chongyu Liu , Peirong Zhang , Lianwen Jin