Related papers: PICK: Processing Key Information Extraction from D…

One-shot Key Information Extraction from Document with Deep Partial Graph Matching

Automating the Key Information Extraction (KIE) from documents improves efficiency, productivity, and security in many industrial scenarios such as rapid indexing and archiving. Many existing supervised learning methods for the KIE task…

Computer Vision and Pattern Recognition · Computer Science 2021-09-30 Minghong Yao , Zhiguang Liu , Liangwei Wang , Houqiang Li , Liansheng Zhuang

Information Extraction from Unstructured data using Augmented-AI and Computer Vision

Information extraction (IE) from unstructured documents remains a critical challenge in data processing pipelines. Traditional optical character recognition (OCR) methods and conventional parsing engines demonstrate limited effectiveness…

Computer Vision and Pattern Recognition · Computer Science 2025-07-28 Aditya Parikh

GraphRevisedIE: Multimodal Information Extraction with Graph-Revised Network

Key information extraction (KIE) from visually rich documents (VRD) has been a challenging task in document intelligence because of not only the complicated and diverse layouts of VRD that make the model hard to generalize but also the lack…

Information Retrieval · Computer Science 2024-10-03 Panfeng Cao , Jian Wu

Key Information Extraction From Documents: Evaluation And Generator

Extracting information from documents usually relies on natural language processing methods working on one-dimensional sequences of text. In some cases, for example, for the extraction of key information from semi-structured documents, such…

Computation and Language · Computer Science 2021-06-29 Oliver Bensch , Mirela Popa , Constantin Spille

\'Eclair -- Extracting Content and Layout with Integrated Reading Order for Documents

Optical Character Recognition (OCR) technology is widely used to extract text from images of documents, facilitating efficient digitization and data retrieval. However, merely extracting text is insufficient when dealing with complex…

Computer Vision and Pattern Recognition · Computer Science 2025-02-07 Ilia Karmanov , Amala Sanjay Deshmukh , Lukas Voegtle , Philipp Fischer , Kateryna Chumachenko , Timo Roman , Jarno Seppänen , Jupinder Parmar , Joseph Jennings , Andrew Tao , Karan Sapra

RDU: A Region-based Approach to Form-style Document Understanding

Key Information Extraction (KIE) is aimed at extracting structured information (e.g. key-value pairs) from form-style documents (e.g. invoices), which makes an important step towards intelligent document understanding. Previous approaches…

Artificial Intelligence · Computer Science 2022-06-15 Fengbin Zhu , Chao Wang , Wenqiang Lei , Ziyang Liu , Tat Seng Chua

Digitization of Document and Information Extraction using OCR

Retrieving accurate details from documents is a crucial task, especially when handling a combination of scanned images and native digital formats. This document presents a combined framework for text extraction that merges Optical Character…

Computer Vision and Pattern Recognition · Computer Science 2025-06-16 Rasha Sinha , Rekha B S

CUTIE: Learning to Understand Documents with Convolutional Universal Text Information Extractor

Extracting key information from documents, such as receipts or invoices, and preserving the interested texts to structured data is crucial in the document-intensive streamline processes of office automation in areas that includes but not…

Computer Vision and Pattern Recognition · Computer Science 2019-06-21 Xiaohui Zhao , Endi Niu , Zhuo Wu , Xiaoguang Wang

Key Information Extraction in Purchase Documents using Deep Learning and Rule-based Corrections

Deep Learning (DL) is dominating the fields of Natural Language Processing (NLP) and Computer Vision (CV) in the recent times. However, DL commonly relies on the availability of large data annotations, so other alternative or complementary…

Computer Vision and Pattern Recognition · Computer Science 2022-10-10 Roberto Arroyo , Javier Yebes , Elena Martínez , Héctor Corrales , Javier Lorenzo

BROS: A Pre-trained Language Model Focusing on Text and Layout for Better Key Information Extraction from Documents

Key information extraction (KIE) from document images requires understanding the contextual and spatial semantics of texts in two-dimensional (2D) space. Many recent studies try to solve the task by developing pre-trained language models…

Computation and Language · Computer Science 2022-04-06 Teakgyu Hong , Donghyun Kim , Mingi Ji , Wonseok Hwang , Daehyun Nam , Sungrae Park

Deep Reader: Information extraction from Document images via relation extraction and Natural Language

Recent advancements in the area of Computer Vision with state-of-art Neural Networks has given a boost to Optical Character Recognition (OCR) accuracies. However, extracting characters/text alone is often insufficient for relevant…

Computer Vision and Pattern Recognition · Computer Science 2018-12-17 Vishwanath D , Rohit Rahul , Gunjan Sehgal , Swati , Arindam Chowdhury , Monika Sharma , Lovekesh Vig , Gautam Shroff , Ashwin Srinivasan

DocTr: Document Transformer for Structured Information Extraction in Documents

We present a new formulation for structured information extraction (SIE) from visually rich documents. It aims to address the limitations of existing IOB tagging or graph-based formulations, which are either overly reliant on the correct…

Computer Vision and Pattern Recognition · Computer Science 2023-07-18 Haofu Liao , Aruni RoyChowdhury , Weijian Li , Ankan Bansal , Yuting Zhang , Zhuowen Tu , Ravi Kumar Satzoda , R. Manmatha , Vijay Mahadevan

PPN: Parallel Pointer-based Network for Key Information Extraction with Complex Layouts

Key Information Extraction (KIE) is a challenging multimodal task that aims to extract structured value semantic entities from visually rich documents. Although significant progress has been made, there are still two major challenges that…

Artificial Intelligence · Computer Science 2023-07-21 Kaiwen Wei , Jie Yao , Jingyuan Zhang , Yangyang Kang , Fubang Zhao , Yating Zhang , Changlong Sun , Xin Jin , Xin Zhang

End-to-End Document Classification and Key Information Extraction using Assignment Optimization

We propose end-to-end document classification and key information extraction (KIE) for automating document processing in forms. Through accurate document classification we harness known information from templates to enhance KIE from forms.…

Information Retrieval · Computer Science 2023-06-02 Ciaran Cooney , Joana Cavadas , Liam Madigan , Bradley Savage , Rachel Heyburn , Mairead O'Cuinn

KIEval: Evaluation Metric for Document Key Information Extraction

Document Key Information Extraction (KIE) is a technology that transforms valuable information in document images into structured data, and it has become an essential function in industrial settings. However, current evaluation metrics of…

Computation and Language · Computer Science 2025-03-27 Minsoo Khang , Sang Chul Jung , Sungrae Park , Teakgyu Hong

ChartEye: A Deep Learning Framework for Chart Information Extraction

The widespread use of charts and infographics as a means of data visualization in various domains has inspired recent research in automated chart understanding. However, information extraction from chart images is a complex multitasked…

Computer Vision and Pattern Recognition · Computer Science 2024-08-30 Osama Mustafa , Muhammad Khizer Ali , Momina Moetesum , Imran Siddiqi

Information Extraction from Visually Rich Documents with Font Style Embeddings

Information extraction (IE) from documents is an intensive area of research with a large set of industrial applications. Current state-of-the-art methods focus on scanned documents with approaches combining computer vision, natural language…

Computation and Language · Computer Science 2022-08-16 Ismail Oussaid , William Vanhuffel , Pirashanth Ratnamogan , Mhamed Hajaiej , Alexis Mathey , Thomas Gilles

Visual Information Extraction in the Wild: Practical Dataset and End-to-end Solution

Visual information extraction (VIE), which aims to simultaneously perform OCR and information extraction in a unified framework, has drawn increasing attention due to its essential role in various applications like understanding receipts,…

Computer Vision and Pattern Recognition · Computer Science 2023-06-16 Jianfeng Kuang , Wei Hua , Dingkang Liang , Mingkun Yang , Deqiang Jiang , Bo Ren , Xiang Bai

Graph Convolution for Multimodal Information Extraction from Visually Rich Documents

Visually rich documents (VRDs) are ubiquitous in daily business and life. Examples are purchase receipts, insurance policy documents, custom declaration forms and so on. In VRDs, visual and layout information is critical for document…

Information Retrieval · Computer Science 2019-03-28 Xiaojing Liu , Feiyu Gao , Qiong Zhang , Huasha Zhao

MatchVIE: Exploiting Match Relevancy between Entities for Visual Information Extraction

Visual Information Extraction (VIE) task aims to extract key information from multifarious document images (e.g., invoices and purchase receipts). Most previous methods treat the VIE task simply as a sequence labeling problem or…

Computer Vision and Pattern Recognition · Computer Science 2021-06-25 Guozhi Tang , Lele Xie , Lianwen Jin , Jiapeng Wang , Jingdong Chen , Zhen Xu , Qianying Wang , Yaqiang Wu , Hui Li