Related papers: Image-based table recognition: data, model, and ev…

TableFormer: Table Structure Understanding with Transformers

Tables organize valuable content in a concise and compact representation. This content is extremely valuable for systems such as search engines, Knowledge Graph's, etc, since they enhance their predictive capabilities. Unfortunately, tables…

Computer Vision and Pattern Recognition · Computer Science 2022-03-14 Ahmed Nassar , Nikolaos Livathinos , Maksym Lysak , Peter Staar

TableNet: Deep Learning model for end-to-end Table detection and Tabular data extraction from Scanned Document Images

With the widespread use of mobile phones and scanners to photograph and upload documents, the need for extracting the information trapped in unstructured document images such as retail receipts, insurance claim forms and financial invoices…

Computer Vision and Pattern Recognition · Computer Science 2020-01-07 Shubham Paliwal , Vishwanath D , Rohit Rahul , Monika Sharma , Lovekesh Vig

UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure Recognition

In the digital era, table structure recognition technology is a critical tool for processing and analyzing large volumes of tabular data. Previous methods primarily focus on visual aspects of table structure recovery but often fail to…

Computer Vision and Pattern Recognition · Computer Science 2024-09-23 Zhenrong Zhang , Shuhang Liu , Pengfei Hu , Jiefeng Ma , Jun Du , Jianshu Zhang , Yu Hu

TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content

The automatic recognition of tabular data in document images presents a significant challenge due to the diverse range of table styles and complex structures. Tables offer valuable content representation, enhancing the predictive…

Computer Vision and Pattern Recognition · Computer Science 2024-04-22 Avinash Anand , Raj Jaiswal , Pijush Bhuyan , Mohit Gupta , Siddhesh Bangar , Md. Modassir Imam , Rajiv Ratn Shah , Shin'ichi Satoh

An End-to-End Multi-Task Learning Model for Image-based Table Recognition

Image-based table recognition is a challenging task due to the diversity of table styles and the complexity of table structures. Most of the previous methods focus on a non-end-to-end approach which divides the problem into two separate…

Computer Vision and Pattern Recognition · Computer Science 2023-03-30 Nam Tuan Ly , Atsuhiro Takasu

PubTables-v2: A new large-scale dataset for full-page and multi-page table extraction

Table extraction (TE) is a key challenge in visual document understanding. Traditional approaches detect tables first, then recognize their structure. Recently, interest has surged in developing methods, such as vision-language models…

Computer Vision and Pattern Recognition · Computer Science 2026-03-19 Brandon Smock , Valerie Faucon-Morin , Max Sokolov , Libin Liang , Tayyibah Khanam , Amrit Ramesh , Maury Courtland

Table Structure Recognition with Conditional Attention

Tabular data in digital documents is widely used to express compact and important information for readers. However, it is challenging to parse tables from unstructured digital documents, such as PDFs and images, into machine-readable format…

Computer Vision and Pattern Recognition · Computer Science 2022-03-09 Bin Xiao , Murat Simsek , Burak Kantarci , Ala Abu Alkheir

Complicated Table Structure Recognition

The task of table structure recognition aims to recognize the internal structure of a table, which is a key step to make machines understand tables. Currently, there are lots of studies on this task for different file formats such as ASCII…

Information Retrieval · Computer Science 2019-08-29 Zewen Chi , Heyan Huang , Heng-Da Xu , Houjin Yu , Wanxuan Yin , Xian-Ling Mao

Current Status and Performance Analysis of Table Recognition in Document Images with Deep Neural Networks

The first phase of table recognition is to detect the tabular area in a document. Subsequently, the tabular structures are recognized in the second phase in order to extract information from the respective cells. Table detection and…

Computer Vision and Pattern Recognition · Computer Science 2021-05-11 Khurram Azeem Hashmi , Marcus Liwicki , Didier Stricker , Muhammad Adnan Afzal , Muhammad Ahtsham Afzal , Muhammad Zeshan Afzal

Visual Understanding of Complex Table Structures from Document Images

Table structure recognition is necessary for a comprehensive understanding of documents. Tables in unstructured business documents are tough to parse due to the high diversity of layouts, varying alignments of contents, and the presence of…

Computer Vision and Pattern Recognition · Computer Science 2021-11-16 Sachin Raja , Ajoy Mondal , C V Jawahar

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

An automatic table recognition method for interpretation of tabular data in document images majorly involves solving two problems of table detection and table structure recognition. The prior work involved solving both problems…

Computer Vision and Pattern Recognition · Computer Science 2020-05-29 Devashish Prasad , Ayan Gadpal , Kshitij Kapadni , Manish Visave , Kavita Sultanpure

ClusterTabNet: Supervised clustering method for table detection and table structure recognition

We present a novel deep-learning-based method to cluster words in documents which we apply to detect and recognize tables given the OCR output. We interpret table structure bottom-up as a graph of relations between pairs of words (belonging…

Machine Learning · Computer Science 2024-05-24 Marek Polewczyk , Marco Spinaci

Rethinking Image-based Table Recognition Using Weakly Supervised Methods

Most of the previous methods for table recognition rely on training datasets containing many richly annotated table images. Detailed table image annotation, e.g., cell or text bounding box annotation, however, is costly and often…

Computer Vision and Pattern Recognition · Computer Science 2023-03-16 Nam Tuan Ly , Atsuhiro Takasu , Phuc Nguyen , Hideaki Takeda

PubLayNet: largest dataset ever for document layout analysis

Recognizing the layout of unstructured digital documents is an important step when parsing the documents into structured machine-readable format for downstream applications. Deep neural networks that are developed for computer vision have…

Computation and Language · Computer Science 2019-08-22 Xu Zhong , Jianbin Tang , Antonio Jimeno Yepes

A Review On Table Recognition Based On Deep Learning

Table recognition is using the computer to automatically understand the table, to detect the position of the table from the document or picture, and to correctly extract and identify the internal structure and content of the table. After…

Computer Vision and Pattern Recognition · Computer Science 2023-12-11 Shi Jiyuan , Shi chunqi

Table Structure Recognition using Top-Down and Bottom-Up Cues

Tables are information-rich structured objects in document images. While significant work has been done in localizing tables as graphic objects in document images, only limited attempts exist on table structure recognition. Most existing…

Computer Vision and Pattern Recognition · Computer Science 2020-10-12 Sachin Raja , Ajoy Mondal , C. V. Jawahar

Global Table Extractor (GTE): A Framework for Joint Table Identification and Cell Structure Recognition Using Visual Context

Documents are often used for knowledge sharing and preservation in business and science, within which are tables that capture most of the critical data. Unfortunately, most documents are stored and distributed as PDF or scanned images,…

Computer Vision and Pattern Recognition · Computer Science 2020-12-03 Xinyi Zheng , Doug Burdick , Lucian Popa , Xu Zhong , Nancy Xin Ru Wang

TabAug: Data Driven Augmentation for Enhanced Table Structure Recognition

Table Structure Recognition is an essential part of end-to-end tabular data extraction in document images. The recent success of deep learning model architectures in computer vision remains to be non-reflective in table structure…

Computer Vision and Pattern Recognition · Computer Science 2021-05-18 Umar Khan , Sohaib Zahid , Muhammad Asad Ali , Adnan ul Hassan , Faisal Shafait

Table Structure Extraction with Bi-directional Gated Recurrent Unit Networks

Tables present summarized and structured information to the reader, which makes table structure extraction an important part of document understanding applications. However, table structure identification is a hard problem not only because…

Computer Vision and Pattern Recognition · Computer Science 2020-02-07 Saqib Ali Khan , Syed Muhammad Daniyal Khalid , Muhammad Ali Shahzad , Faisal Shafait

Multi-Type-TD-TSR -- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: from OCR to Structured Table Representations

As global trends are shifting towards data-driven industries, the demand for automated algorithms that can convert digital images of scanned documents into machine readable information is rapidly growing. Besides the opportunity of data…

Computer Vision and Pattern Recognition · Computer Science 2021-05-25 Pascal Fischer , Alen Smajic , Alexander Mehler , Giuseppe Abrami