English
Related papers

Related papers: Optimized Table Tokenization for Table Structure R…

200 papers

Data representation remains a fundamental challenge in machine learning, particularly when adapting sequence-based architectures like Transformers and Large Language Models (LLMs) for structured tabular data. Existing methods often fail to…

Machine Learning · Computer Science 2025-08-05 Kayvan Karim , Hani Ragab Hassen. Hadj Batatia

Tables organize valuable content in a concise and compact representation. This content is extremely valuable for systems such as search engines, Knowledge Graph's, etc, since they enhance their predictive capabilities. Unfortunately, tables…

Computer Vision and Pattern Recognition · Computer Science 2022-03-14 Ahmed Nassar , Nikolaos Livathinos , Maksym Lysak , Peter Staar

The automatic recognition of tabular data in document images presents a significant challenge due to the diverse range of table styles and complex structures. Tables offer valuable content representation, enhancing the predictive…

Computer Vision and Pattern Recognition · Computer Science 2024-04-22 Avinash Anand , Raj Jaiswal , Pijush Bhuyan , Mohit Gupta , Siddhesh Bangar , Md. Modassir Imam , Rajiv Ratn Shah , Shin'ichi Satoh

Document structure extraction has been a widely researched area for decades with recent works performing it as a semantic segmentation task over document images using fully-convolution networks. Such methods are limited by image resolution…

Machine Learning · Computer Science 2021-07-12 Milan Aggarwal , Hiresh Gupta , Mausoom Sarkar , Balaji Krishnamurthy

Table annotation is crucial for making web and enterprise tables usable in downstream NLP applications. Unlike textual data where learning semantically rich token or sentence embeddings often suffice, tables are structured combinations of…

Machine Learning · Computer Science 2026-04-22 Ehsan Hoseinzade , Ke Wang , Anandharaju Durai Raju

Table structure recognition (TSR) holds widespread practical importance by parsing tabular images into structured representations, yet encounters significant challenges when processing complex layouts involving merged or empty cells.…

Computer Vision and Pattern Recognition · Computer Science 2026-04-20 Boming Chen , Zining Wang , Zhentao Guo , Jianqiang Liu , Chen Duan , Yu Gu , Kai zhou , Pengfei Yan

Tables are information-rich structured objects in document images. While significant work has been done in localizing tables as graphic objects in document images, only limited attempts exist on table structure recognition. Most existing…

Computer Vision and Pattern Recognition · Computer Science 2020-10-12 Sachin Raja , Ajoy Mondal , C. V. Jawahar

Scientific documents contain tables that list important information in a concise fashion. Structure and content extraction from tables embedded within PDF research documents is a very challenging task due to the existence of visual features…

Information Retrieval · Computer Science 2022-11-01 Pratik Kayal , Mrinal Anand , Harsh Desai , Mayank Singh

We present TableSeq, an image-only, end-to-end framework for joint table structure recognition, content recognition, and cell localization. The model formulates these tasks as a single sequence-generation problem: one decoder produces an…

Computer Vision and Pattern Recognition · Computer Science 2026-04-20 Laziz Hamdi , Amine Tamasna , Pascal Boisson , Thierry Paquet

In the digital era, table structure recognition technology is a critical tool for processing and analyzing large volumes of tabular data. Previous methods primarily focus on visual aspects of table structure recovery but often fail to…

Computer Vision and Pattern Recognition · Computer Science 2024-09-23 Zhenrong Zhang , Shuhang Liu , Pengfei Hu , Jiefeng Ma , Jun Du , Jianshu Zhang , Yu Hu

Tabular data in digital documents is widely used to express compact and important information for readers. However, it is challenging to parse tables from unstructured digital documents, such as PDFs and images, into machine-readable format…

Computer Vision and Pattern Recognition · Computer Science 2022-03-09 Bin Xiao , Murat Simsek , Burak Kantarci , Ala Abu Alkheir

Table structure recognition is necessary for a comprehensive understanding of documents. Tables in unstructured business documents are tough to parse due to the high diversity of layouts, varying alignments of contents, and the presence of…

Computer Vision and Pattern Recognition · Computer Science 2021-11-16 Sachin Raja , Ajoy Mondal , C V Jawahar

Table structure recognition (TSR) aims to convert tabular images into a machine-readable format, where a visual encoder extracts image features and a textual decoder generates table-representing tokens. Existing approaches use classic…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 ShengYun Peng , Seongmin Lee , Xiaojing Wang , Rajarajeswari Balasubramaniyan , Duen Horng Chau

Benchmark datasets for table structure recognition (TSR) must be carefully processed to ensure they are annotated consistently. However, even if a dataset's annotations are self-consistent, there may be significant inconsistency across…

Computer Vision and Pattern Recognition · Computer Science 2023-05-25 Brandon Smock , Rohith Pesala , Robin Abraham

Extreme multi-label text classification (XMTC) is the task of finding the most relevant subset labels from an extremely large-scale label collection. Recently, some deep learning models have achieved state-of-the-art results in XMTC tasks.…

Computation and Language · Computer Science 2022-11-29 Jie Cao , Yin Zhang

Although Seq2Seq models for table-to-text generation have achieved remarkable progress, modeling table representation in one dimension is inadequate. This is because (1) the table consists of multiple rows and columns, which means that…

Computation and Language · Computer Science 2019-09-06 Heng Gong , Xiaocheng Feng , Bing Qin , Ting Liu

Table structure recognition aims to parse tables in unstructured data into machine-understandable formats. Recent methods address this problem through a two-stage process or optimized one-stage approaches. However, these methods either…

Computer Vision and Pattern Recognition · Computer Science 2025-05-13 Anyi Xiao , Cihui Yang

We present a comprehensive evaluation of structured decoding for text-to-table generation with large language models (LLMs). While previous work has primarily focused on unconstrained generation of tables, the impact of enforcing structural…

Computation and Language · Computer Science 2025-08-25 Julian Oestreich , Lydia Müller

LLMs are increasingly used for software modernization, code translation, and database migration. However, LLM-based Oracle2PostgreSQL migration remains constrained by high token consumption, long-context degradation, dialect-specific…

Logic in Computer Science · Computer Science 2026-05-28 Oleg Grynets , Dmytro Babarytskyi , Vasyl Lyashkevych

As global trends are shifting towards data-driven industries, the demand for automated algorithms that can convert digital images of scanned documents into machine readable information is rapidly growing. Besides the opportunity of data…

Computer Vision and Pattern Recognition · Computer Science 2021-05-25 Pascal Fischer , Alen Smajic , Alexander Mehler , Giuseppe Abrami
‹ Prev 1 2 3 10 Next ›