Related papers: Improving Table Structure Recognition with Visual-…

SEMv2: Table Separation Line Detection Based on Instance Segmentation

Table structure recognition is an indispensable element for enabling machines to comprehend tables. Its primary purpose is to identify the internal structure of a table. Nevertheless, due to the complexity and diversity of their structure…

Computer Vision and Pattern Recognition · Computer Science 2024-01-15 Zhenrong Zhang , Pengfei Hu , Jiefeng Ma , Jun Du , Jianshu Zhang , Huihui Zhu , Baocai Yin , Bing Yin , Cong Liu

TRUST: An Accurate and End-to-End Table structure Recognizer Using Splitting-based Transformers

Table structure recognition is a crucial part of document image analysis domain. Its difficulty lies in the need to parse the physical coordinates and logical indices of each cell at the same time. However, the existing methods are…

Computer Vision and Pattern Recognition · Computer Science 2022-09-01 Zengyuan Guo , Yuechen Yu , Pengyuan Lv , Chengquan Zhang , Haojie Li , Zhihui Wang , Kun Yao , Jingtuo Liu , Jingdong Wang

Table Structure Recognition with Conditional Attention

Tabular data in digital documents is widely used to express compact and important information for readers. However, it is challenging to parse tables from unstructured digital documents, such as PDFs and images, into machine-readable format…

Computer Vision and Pattern Recognition · Computer Science 2022-03-09 Bin Xiao , Murat Simsek , Burak Kantarci , Ala Abu Alkheir

CascadeTabNet: An approach for end to end table detection and structure recognition from image-based documents

An automatic table recognition method for interpretation of tabular data in document images majorly involves solving two problems of table detection and table structure recognition. The prior work involved solving both problems…

Computer Vision and Pattern Recognition · Computer Science 2020-05-29 Devashish Prasad , Ayan Gadpal , Kshitij Kapadni , Manish Visave , Kavita Sultanpure

Post-Processing Mask-Based Table Segmentation for Structural Coordinate Extraction

Structured data extraction from tables plays a crucial role in document image analysis for scanned documents and digital archives. Although many methods have been proposed to detect table structures and extract cell contents, accurately…

Computer Vision and Pattern Recognition · Computer Science 2025-12-25 Suren Bandara

Table Structure Recognition using Top-Down and Bottom-Up Cues

Tables are information-rich structured objects in document images. While significant work has been done in localizing tables as graphic objects in document images, only limited attempts exist on table structure recognition. Most existing…

Computer Vision and Pattern Recognition · Computer Science 2020-10-12 Sachin Raja , Ajoy Mondal , C. V. Jawahar

Guided Table Structure Recognition through Anchor Optimization

This paper presents the novel approach towards table structure recognition by leveraging the guided anchors. The concept differs from current state-of-the-art approaches for table structure recognition that naively apply object detection…

Computer Vision and Pattern Recognition · Computer Science 2021-04-22 Khurram Azeem Hashmi , Didier Stricker , Marcus Liwicki , Muhammad Noman Afzal , Muhammad Zeshan Afzal

UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure Recognition

In the digital era, table structure recognition technology is a critical tool for processing and analyzing large volumes of tabular data. Previous methods primarily focus on visual aspects of table structure recovery but often fail to…

Computer Vision and Pattern Recognition · Computer Science 2024-09-23 Zhenrong Zhang , Shuhang Liu , Pengfei Hu , Jiefeng Ma , Jun Du , Jianshu Zhang , Yu Hu

TableCenterNet: A one-stage network for table structure recognition

Table structure recognition aims to parse tables in unstructured data into machine-understandable formats. Recent methods address this problem through a two-stage process or optimized one-stage approaches. However, these methods either…

Computer Vision and Pattern Recognition · Computer Science 2025-05-13 Anyi Xiao , Cihui Yang

TDATR: Improving End-to-End Table Recognition via Table Detail-Aware Learning and Cell-Level Visual Alignment

Tables are pervasive in diverse documents, making table recognition (TR) a fundamental task in document analysis. Existing modular TR pipelines separately model table structure and content, leading to suboptimal integration and complex…

Computer Vision and Pattern Recognition · Computer Science 2026-03-25 Chunxia Qin , Chenyu Liu , Pengcheng Xia , Jun Du , Baocai Yin , Bing Yin , Cong Liu

TableFormer: Table Structure Understanding with Transformers

Tables organize valuable content in a concise and compact representation. This content is extremely valuable for systems such as search engines, Knowledge Graph's, etc, since they enhance their predictive capabilities. Unfortunately, tables…

Computer Vision and Pattern Recognition · Computer Science 2022-03-14 Ahmed Nassar , Nikolaos Livathinos , Maksym Lysak , Peter Staar

Visual Understanding of Complex Table Structures from Document Images

Table structure recognition is necessary for a comprehensive understanding of documents. Tables in unstructured business documents are tough to parse due to the high diversity of layouts, varying alignments of contents, and the presence of…

Computer Vision and Pattern Recognition · Computer Science 2021-11-16 Sachin Raja , Ajoy Mondal , C V Jawahar

LGPMA: Complicated Table Structure Recognition with Local and Global Pyramid Mask Alignment

Table structure recognition is a challenging task due to the various structures and complicated cell spanning relations. Previous methods handled the problem starting from elements in different granularities (rows/columns, text regions),…

Computer Vision and Pattern Recognition · Computer Science 2022-04-21 Liang Qiao , Zaisheng Li , Zhanzhan Cheng , Peng Zhang , Shiliang Pu , Yi Niu , Wenqi Ren , Wenming Tan , Fei Wu

Beyond Static Visual Tokens: Structured Sequential Visual Chain-of-Thought Reasoning

Current multimodal LLMs encode images as static visual prefixes and rely on text-based reasoning, lacking goal-driven and adaptive visual access. Inspired by human visual perception-where attention is selectively and sequentially shifted…

Computer Vision and Pattern Recognition · Computer Science 2026-03-31 Guangfu Guo , Xiaoqian Lu , Yue Feng , Mingming Sun

LORE: Logical Location Regression Network for Table Structure Recognition

Table structure recognition (TSR) aims at extracting tables in images into machine-understandable formats. Recent methods solve this problem by predicting the adjacency relations of detected cell boxes, or learning to generate the…

Computer Vision and Pattern Recognition · Computer Science 2023-03-08 Hangdi Xing , Feiyu Gao , Rujiao Long , Jiajun Bu , Qi Zheng , Liangcheng Li , Cong Yao , Zhi Yu

Beyond Embeddings: The Promise of Visual Table in Visual Reasoning

Visual representation learning has been a cornerstone in computer vision, involving typical forms such as visual embeddings, structural symbols, and text-based representations. Despite the success of CLIP-type visual embeddings, they often…

Computer Vision and Pattern Recognition · Computer Science 2024-06-18 Yiwu Zhong , Zi-Yuan Hu , Michael R. Lyu , Liwei Wang

Split, embed and merge: An accurate table structure recognizer

Table structure recognition is an essential part for making machines understand tables. Its main task is to recognize the internal structure of a table. However, due to the complexity and diversity in their structure and style, it is very…

Computer Vision and Pattern Recognition · Computer Science 2022-02-01 Zhenrong Zhang , Jianshu Zhang , Jun Du

Multi-Cell Decoder and Mutual Learning for Table Structure and Character Recognition

Extracting table contents from documents such as scientific papers and financial reports and converting them into a format that can be processed by large language models is an important task in knowledge information processing. End-to-end…

Computer Vision and Pattern Recognition · Computer Science 2024-05-14 Takaya Kawakatsu

Table Structure Extraction with Bi-directional Gated Recurrent Unit Networks

Tables present summarized and structured information to the reader, which makes table structure extraction an important part of document understanding applications. However, table structure identification is a hard problem not only because…

Computer Vision and Pattern Recognition · Computer Science 2020-02-07 Saqib Ali Khan , Syed Muhammad Daniyal Khalid , Muhammad Ali Shahzad , Faisal Shafait

An End-to-End Multi-Task Learning Model for Image-based Table Recognition

Image-based table recognition is a challenging task due to the diversity of table styles and the complexity of table structures. Most of the previous methods focus on a non-end-to-end approach which divides the problem into two separate…

Computer Vision and Pattern Recognition · Computer Science 2023-03-30 Nam Tuan Ly , Atsuhiro Takasu