Related papers: ExStrucTiny: A Benchmark for Schema-Variable Struc…

Modeling Entities as Semantic Points for Visual Information Extraction in the Wild

Recently, Visual Information Extraction (VIE) has been becoming increasingly important in both the academia and industry, due to the wide range of real-world applications. Previously, numerous works have been proposed to tackle this…

Computer Vision and Pattern Recognition · Computer Science 2023-03-30 Zhibo Yang , Rujiao Long , Pengfei Wang , Sibo Song , Humen Zhong , Wenqing Cheng , Xiang Bai , Cong Yao

From Recognition to Reasoning: Benchmarking and Enhancing MLLMs on Real-World Receipt Document Understanding

Extracting structured information from visual documents (Visual Information Extraction, VIE) is a cornerstone of business automation. While recent Multimodal Large Language Models (MLLMs) have shown promising capabilities, existing…

Computer Vision and Pattern Recognition · Computer Science 2026-05-22 Yandi Wang , Libin Zhan , Ziwei Huang , Tiancheng Luo , Yuxuan Jiang , Wang Dong , Leilei Gan , Jun Chen

Learning to Extract Structured Entities Using Language Models

Recent advances in machine learning have significantly impacted the field of information extraction, with Language Models (LMs) playing a pivotal role in extracting structured information from unstructured text. Prior works typically…

Computation and Language · Computer Science 2024-10-03 Haolun Wu , Ye Yuan , Liana Mikaelyan , Alexander Meulemans , Xue Liu , James Hensman , Bhaskar Mitra

TabLeX: A Benchmark Dataset for Structure and Content Information Extraction from Scientific Tables

Information Extraction (IE) from the tables present in scientific articles is challenging due to complicated tabular representations and complex embedded text. This paper presents TabLeX, a large-scale benchmark dataset comprising table…

Information Retrieval · Computer Science 2021-09-07 Harsh Desai , Pratik Kayal , Mayank Singh

Business Document Information Extraction: Towards Practical Benchmarks

Information extraction from semi-structured documents is crucial for frictionless business-to-business (B2B) communication. While machine learning problems related to Document Information Extraction (IE) have been studied for decades, many…

Information Retrieval · Computer Science 2022-06-23 Matyáš Skalický , Štěpán Šimsa , Michal Uřičář , Milan Šulc

UNIKIE-BENCH: Benchmarking Large Multimodal Models for Key Information Extraction in Visual Documents

Key Information Extraction (KIE) from real-world documents remains challenging due to substantial variations in layout structures, visual quality, and task-specific information requirements. Recent Large Multimodal Models (LMMs) have shown…

Computer Vision and Pattern Recognition · Computer Science 2026-04-27 Yifan Ji , Zhipeng Xu , Zhenghao Liu , Zulong Chen , Qian Zhang , Zhibo Yang , Junyang Lin , Yu Gu , Ge Yu , Maosong Sun

Image2Struct: Benchmarking Structure Extraction for Vision-Language Models

We introduce Image2Struct, a benchmark to evaluate vision-language models (VLMs) on extracting structure from images. Our benchmark 1) captures real-world use cases, 2) is fully automatic and does not require human judgment, and 3) is based…

Computer Vision and Pattern Recognition · Computer Science 2024-10-31 Josselin Somerville Roberts , Tony Lee , Chi Heem Wong , Michihiro Yasunaga , Yifan Mai , Percy Liang

VDInstruct: Zero-Shot Key Information Extraction via Content-Aware Vision Tokenization

Key Information Extraction (KIE) underpins the understanding of visual documents (e.g., receipts and contracts) by extracting precise semantic content and accurately capturing spatial structure. Yet existing multimodal large language models…

Computer Vision and Pattern Recognition · Computer Science 2025-07-15 Son Nguyen , Giang Nguyen , Hung Dao , Thao Do , Daeyoung Kim

Document-level Entity-based Extraction as Template Generation

Document-level entity-based extraction (EE), aiming at extracting entity-centric information such as entity roles and entity relations, is key to automatic knowledge acquisition from text corpora for various domains. Most document-level EE…

Computation and Language · Computer Science 2021-09-13 Kung-Hsiang Huang , Sam Tang , Nanyun Peng

Information Extraction in Low-Resource Scenarios: Survey and Perspective

Information Extraction (IE) seeks to derive structured information from unstructured texts, often facing challenges in low-resource scenarios due to data scarcity and unseen classes. This paper presents a review of neural approaches to…

Computation and Language · Computer Science 2024-10-29 Shumin Deng , Yubo Ma , Ningyu Zhang , Yixin Cao , Bryan Hooi

Beyond Isolated Dots: Benchmarking Structured Table Construction as Deep Knowledge Extraction

With the emergence of large language models (LLMs), there is an expectation that LLMs can effectively extract explicit information from complex real-world documents (e.g., papers, reports). However, most LLMs generate paragraph-style…

Computation and Language · Computer Science 2025-10-31 Tianyun Zhong , Guozhao Mo , Yanjiang Liu , Yihan Chen , Lingdi Kong , Xuanang Chen , Yaojie Lu , Hongyu Lin , Shiwei Ye , Xianpei Han , Ben He , Le Sun

Relation Extraction with Fine-Tuned Large Language Models in Retrieval Augmented Generation Frameworks

Information Extraction (IE) is crucial for converting unstructured data into structured formats like Knowledge Graphs (KGs). A key task within IE is Relation Extraction (RE), which identifies relationships between entities in text. Various…

Computation and Language · Computer Science 2024-06-25 Sefika Efeoglu , Adrian Paschke

Code-MIE: A Code-style Model for Multimodal Information Extraction with Scene Graph and Entity Attribute Knowledge Enhancement

With the rapid development of large language models (LLMs), more and more researchers have paid attention to information extraction based on LLMs. However, there are still some spaces to improve in the existing related methods. First,…

Computation and Language · Computer Science 2026-03-24 Jiang Liu , Ge Qiu , Hao Fei , Dongdong Xie , Jinbo Li , Fei Li , Chong Teng , Donghong Ji

VRDU: A Benchmark for Visually-rich Document Understanding

Understanding visually-rich business documents to extract structured data and automate business workflows has been receiving attention both in academia and industry. Although recent multi-modal language models have achieved impressive…

Computation and Language · Computer Science 2023-09-19 Zilong Wang , Yichao Zhou , Wei Wei , Chen-Yu Lee , Sandeep Tata

REXEL: An End-to-end Model for Document-Level Relation Extraction and Entity Linking

Extracting structured information from unstructured text is critical for many downstream NLP applications and is traditionally achieved by closed information extraction (cIE). However, existing approaches for cIE suffer from two…

Computation and Language · Computer Science 2024-04-22 Nacime Bouziani , Shubhi Tyagi , Joseph Fisher , Jens Lehmann , Andrea Pierleoni

Cost-effective End-to-end Information Extraction for Semi-structured Document Images

A real-world information extraction (IE) system for semi-structured document images often involves a long pipeline of multiple modules, whose complexity dramatically increases its development and maintenance cost. One can instead consider…

Computation and Language · Computer Science 2021-08-31 Wonseok Hwang , Hyunji Lee , Jinyeong Yim , Geewook Kim , Minjoon Seo

Integrating Local Context and Global Cohesiveness for Open Information Extraction

Extracting entities and their relations from text is an important task for understanding massive text corpora. Open information extraction (IE) systems mine relation tuples (i.e., entity arguments and a predicate string to describe their…

Computation and Language · Computer Science 2018-12-04 Qi Zhu , Xiang Ren , Jingbo Shang , Yu Zhang , Ahmed El-Kishky , Jiawei Han

KIEval: Evaluation Metric for Document Key Information Extraction

Document Key Information Extraction (KIE) is a technology that transforms valuable information in document images into structured data, and it has become an essential function in industrial settings. However, current evaluation metrics of…

Computation and Language · Computer Science 2025-03-27 Minsoo Khang , Sang Chul Jung , Sungrae Park , Teakgyu Hong

Benchmarking Large Language Models with Augmented Instructions for Fine-grained Information Extraction

Information Extraction (IE) is an essential task in Natural Language Processing. Traditional methods have relied on coarse-grained extraction with simple instructions. However, with the emergence of Large Language Models (LLMs), there is a…

Computation and Language · Computer Science 2023-10-10 Jun Gao , Huan Zhao , Yice Zhang , Wei Wang , Changlong Yu , Ruifeng Xu

Joint Event Extraction via Structural Semantic Matching

Event Extraction (EE) is one of the essential tasks in information extraction, which aims to detect event mentions from text and find the corresponding argument roles. The EE task can be abstracted as a process of matching the semantic…

Computation and Language · Computer Science 2023-06-07 Haochen Li , Tianhao Gao , Jingkun Wang , Weiping Li