Related papers: Optimized Table Tokenization for Table Structure R…

Innovative tokenisation of structured data for LLM training

Data representation remains a fundamental challenge in machine learning, particularly when adapting sequence-based architectures like Transformers and Large Language Models (LLMs) for structured tabular data. Existing methods often fail to…

Machine Learning · Computer Science 2025-08-05 Kayvan Karim , Hani Ragab Hassen. Hadj Batatia

TableFormer: Table Structure Understanding with Transformers

Tables organize valuable content in a concise and compact representation. This content is extremely valuable for systems such as search engines, Knowledge Graph's, etc, since they enhance their predictive capabilities. Unfortunately, tables…

Computer Vision and Pattern Recognition · Computer Science 2022-03-14 Ahmed Nassar , Nikolaos Livathinos , Maksym Lysak , Peter Staar

TC-OCR: TableCraft OCR for Efficient Detection & Recognition of Table Structure & Content

The automatic recognition of tabular data in document images presents a significant challenge due to the diverse range of table styles and complex structures. Tables offer valuable content representation, enhancing the predictive…

Computer Vision and Pattern Recognition · Computer Science 2024-04-22 Avinash Anand , Raj Jaiswal , Pijush Bhuyan , Mohit Gupta , Siddhesh Bangar , Md. Modassir Imam , Rajiv Ratn Shah , Shin'ichi Satoh

Form2Seq : A Framework for Higher-Order Form Structure Extraction

Document structure extraction has been a widely researched area for decades with recent works performing it as a semantic segmentation task over document images using fully-convolution networks. Such methods are limited by image resolution…

Machine Learning · Computer Science 2021-07-12 Milan Aggarwal , Hiresh Gupta , Mausoom Sarkar , Balaji Krishnamurthy

TabEmb: Joint Semantic-Structure Embedding for Table Annotation

Table annotation is crucial for making web and enterprise tables usable in downstream NLP applications. Unlike textual data where learning semantically rich token or sentence embeddings often suffice, tables are structured combinations of…

Machine Learning · Computer Science 2026-04-22 Ehsan Hoseinzade , Ke Wang , Anandharaju Durai Raju

InstructTable: Improving Table Structure Recognition Through Instructions

Table structure recognition (TSR) holds widespread practical importance by parsing tabular images into structured representations, yet encounters significant challenges when processing complex layouts involving merged or empty cells.…

Computer Vision and Pattern Recognition · Computer Science 2026-04-20 Boming Chen , Zining Wang , Zhentao Guo , Jianqiang Liu , Chen Duan , Yu Gu , Kai zhou , Pengfei Yan

Table Structure Recognition using Top-Down and Bottom-Up Cues

Tables are information-rich structured objects in document images. While significant work has been done in localizing tables as graphic objects in document images, only limited attempts exist on table structure recognition. Most existing…

Computer Vision and Pattern Recognition · Computer Science 2020-10-12 Sachin Raja , Ajoy Mondal , C. V. Jawahar

Tables to LaTeX: structure and content extraction from scientific tables

Scientific documents contain tables that list important information in a concise fashion. Structure and content extraction from tables embedded within PDF research documents is a very challenging task due to the existence of visual features…

Information Retrieval · Computer Science 2022-11-01 Pratik Kayal , Mrinal Anand , Harsh Desai , Mayank Singh

TableSeq: Unified Generation of Structure, Content, and Layout

We present TableSeq, an image-only, end-to-end framework for joint table structure recognition, content recognition, and cell localization. The model formulates these tasks as a single sequence-generation problem: one decoder produces an…

Computer Vision and Pattern Recognition · Computer Science 2026-04-20 Laziz Hamdi , Amine Tamasna , Pascal Boisson , Thierry Paquet

UniTabNet: Bridging Vision and Language Models for Enhanced Table Structure Recognition

In the digital era, table structure recognition technology is a critical tool for processing and analyzing large volumes of tabular data. Previous methods primarily focus on visual aspects of table structure recovery but often fail to…

Computer Vision and Pattern Recognition · Computer Science 2024-09-23 Zhenrong Zhang , Shuhang Liu , Pengfei Hu , Jiefeng Ma , Jun Du , Jianshu Zhang , Yu Hu

Table Structure Recognition with Conditional Attention

Tabular data in digital documents is widely used to express compact and important information for readers. However, it is challenging to parse tables from unstructured digital documents, such as PDFs and images, into machine-readable format…

Computer Vision and Pattern Recognition · Computer Science 2022-03-09 Bin Xiao , Murat Simsek , Burak Kantarci , Ala Abu Alkheir

Visual Understanding of Complex Table Structures from Document Images

Table structure recognition is necessary for a comprehensive understanding of documents. Tables in unstructured business documents are tough to parse due to the high diversity of layouts, varying alignments of contents, and the presence of…

Computer Vision and Pattern Recognition · Computer Science 2021-11-16 Sachin Raja , Ajoy Mondal , C V Jawahar

High-Performance Transformers for Table Structure Recognition Need Early Convolutions

Table structure recognition (TSR) aims to convert tabular images into a machine-readable format, where a visual encoder extracts image features and a textual decoder generates table-representing tokens. Existing approaches use classic…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 ShengYun Peng , Seongmin Lee , Xiaojing Wang , Rajarajeswari Balasubramaniyan , Duen Horng Chau

Aligning benchmark datasets for table structure recognition

Benchmark datasets for table structure recognition (TSR) must be carefully processed to ensure they are annotated consistently. However, even if a dataset's annotations are self-consistent, there may be significant inconsistency across…

Computer Vision and Pattern Recognition · Computer Science 2023-05-25 Brandon Smock , Rohith Pesala , Robin Abraham

OTSeq2Set: An Optimal Transport Enhanced Sequence-to-Set Model for Extreme Multi-label Text Classification

Extreme multi-label text classification (XMTC) is the task of finding the most relevant subset labels from an extremely large-scale label collection. Recently, some deep learning models have achieved state-of-the-art results in XMTC tasks.…

Computation and Language · Computer Science 2022-11-29 Jie Cao , Yin Zhang

Table-to-Text Generation with Effective Hierarchical Encoder on Three Dimensions (Row, Column and Time)

Although Seq2Seq models for table-to-text generation have achieved remarkable progress, modeling table representation in one dimension is inadequate. This is because (1) the table consists of multiple rows and columns, which means that…

Computation and Language · Computer Science 2019-09-06 Heng Gong , Xiaocheng Feng , Bing Qin , Ting Liu

TableCenterNet: A one-stage network for table structure recognition

Table structure recognition aims to parse tables in unstructured data into machine-understandable formats. Recent methods address this problem through a two-stage process or optimized one-stage approaches. However, these methods either…

Computer Vision and Pattern Recognition · Computer Science 2025-05-13 Anyi Xiao , Cihui Yang

Evaluating Structured Decoding for Text-to-Table Generation: Evidence from Three Datasets

We present a comprehensive evaluation of structured decoding for text-to-table generation with large language models (LLMs). While previous work has primarily focused on unconstrained generation of tables, the impact of enforcing structural…

Computation and Language · Computer Science 2025-08-25 Julian Oestreich , Lydia Müller

Token Optimization Strategies for LLM-Based Oracle-to-PostgreSQL Migration

LLMs are increasingly used for software modernization, code translation, and database migration. However, LLM-based Oracle2PostgreSQL migration remains constrained by high token consumption, long-context degradation, dialect-specific…

Logic in Computer Science · Computer Science 2026-05-28 Oleg Grynets , Dmytro Babarytskyi , Vasyl Lyashkevych

Multi-Type-TD-TSR -- Extracting Tables from Document Images using a Multi-stage Pipeline for Table Detection and Table Structure Recognition: from OCR to Structured Table Representations

As global trends are shifting towards data-driven industries, the demand for automated algorithms that can convert digital images of scanned documents into machine readable information is rapidly growing. Besides the opportunity of data…

Computer Vision and Pattern Recognition · Computer Science 2021-05-25 Pascal Fischer , Alen Smajic , Alexander Mehler , Giuseppe Abrami