English
Related papers

Related papers: Making Table Understanding Work in Practice

200 papers

Understanding the semantics of relational tables is instrumental for automation in data exploration and preparation systems. A key source for understanding a table is the semantics of its columns. With the rise of deep learning, learned…

Databases · Computer Science 2023-11-27 Madelon Hulsebos , Paul Groth , Çağatay Demiralp

Tables have gained significant attention in large language models (LLMs) and multimodal large language models (MLLMs) due to their complex and flexible structure. Unlike linear text inputs, tables are two-dimensional, encompassing formats…

Computation and Language · Computer Science 2025-08-04 Xiaofeng Wu , Alan Ritter , Wei Xu

Our work addresses the challenges of understanding tables. Existing methods often struggle with the unpredictable nature of table content, leading to a reliance on preprocessing and keyword matching. They also face limitations due to the…

Computation and Language · Computer Science 2025-08-26 Thi-Nhung Nguyen , Hoang Ngo , Dinh Phung , Thuy-Trang Vu , Dat Quoc Nguyen

Tables serve as a fundamental format for representing structured relational data. While current language models (LMs) excel at many text-based tasks, they still face challenges in table understanding due to the complex characteristics of…

Computation and Language · Computer Science 2026-04-16 Lang Cao , Hanbing Liu

Detecting the semantic types of data columns in relational tables is important for various data preparation and information retrieval tasks such as data cleaning, schema matching, data discovery, and semantic search. However, existing…

Databases · Computer Science 2020-06-04 Dan Zhang , Yoshihiko Suhara , Jinfeng Li , Madelon Hulsebos , Çağatay Demiralp , Wang-Chiew Tan

Conventional text-to-SQL parsers are not good at synthesizing complex SQL queries that involve multiple tables or columns, due to the challenges inherent in identifying the correct schema items and performing accurate alignment between…

Computation and Language · Computer Science 2024-03-18 Yangjun Wu , Han Wang

This study addresses the challenge of detecting semantic column types in relational tables, a key task in many real-world applications. While language models like BERT have improved prediction accuracy, their token input constraints limit…

Machine Learning · Computer Science 2024-05-02 Ehsan Hoseinzade , Ke Wang

The usefulness of tabular data such as web tables critically depends on understanding their semantics. This study focuses on column type prediction for tables without any meta data. Unlike traditional lexical matching-based methods, we…

Databases · Computer Science 2019-06-04 Jiaoyan Chen , Ernesto Jimenez-Ruiz , Ian Horrocks , Charles Sutton

Many organizations rely on data from government and third-party sources, and those sources rarely follow the same data formatting. This introduces challenges in integrating data from multiple sources or aligning external sources with…

Databases · Computer Science 2023-12-27 Arash Dargahi Nobari , Davood Rafiei

Tables on the Web contain a vast amount of knowledge in a structured form. To tap into this valuable resource, we address the problem of table retrieval: answering an information need with a ranked list of tables. We investigate this…

Information Retrieval · Computer Science 2021-05-14 Shuo Zhang , Krisztian Balog

Table extraction is an important but still unsolved problem. In this paper, we introduce a flexible and modular table extraction system. We develop two rule-based algorithms that perform the complete table recognition process, including…

Computer Vision and Pattern Recognition · Computer Science 2021-12-03 Marcin Namysl , Alexander M. Esser , Sven Behnke , Joachim Köhler

Although great progress has been made by previous table understanding methods including recent approaches based on large language models (LLMs), they rely heavily on the premise that given tables must be converted into a certain text…

Computation and Language · Computer Science 2024-06-13 Mingyu Zheng , Xinwei Feng , Qingyi Si , Qiaoqiao She , Zheng Lin , Wenbin Jiang , Weiping Wang

Documents are core carriers of information and knowl-edge, with broad applications in finance, healthcare, and scientific research. Tables, as the main medium for structured data, encapsulate key information and are among the most critical…

Computation and Language · Computer Science 2025-08-15 Xuan Li , Jialiang Dong , Raymond Wong

The first phase of table recognition is to detect the tabular area in a document. Subsequently, the tabular structures are recognized in the second phase in order to extract information from the respective cells. Table detection and…

Computer Vision and Pattern Recognition · Computer Science 2021-05-11 Khurram Azeem Hashmi , Marcus Liwicki , Didier Stricker , Muhammad Adnan Afzal , Muhammad Ahtsham Afzal , Muhammad Zeshan Afzal

While Large Language Models (LLMs) are increasingly deployed for table-related tasks, the internal mechanisms enabling them to process linearized two-dimensional structured tables remain opaque. In this work, we investigate the process of…

Computation and Language · Computer Science 2026-02-10 Xuanliang Zhang , Dingzirui Wang , Keyan Xu , Qingfu Zhu , Wanxiang Che

Language Models (LLMs) are increasingly explored in the telecom industry to support engineering tasks, accelerate troubleshooting, and assist in interpreting complex technical documents. However, recent studies show that LLMs perform poorly…

Computation and Language · Computer Science 2026-01-09 Anas Ezzakri , Nicola Piovesan , Mohamed Sana , Antonio De Domenico , Fadhel Ayed , Haozhe Zhang

Over the past few years, table interpretation tasks have made significant progress due to their importance and the introduction of new technologies and benchmarks in the field. This work experiments with a hybrid approach for detecting…

Computation and Language · Computer Science 2025-08-18 Panagiotis Koletsis , Christos Panagiotopoulos , Georgios Th. Papadopoulos , Vasilis Efthymiou

Since a vast number of tables can be easily collected from web pages, spreadsheets, PDFs, and various other document types, a flurry of table pre-training frameworks have been proposed following the success of text and images, and they have…

Computation and Language · Computer Science 2022-05-02 Haoyu Dong , Zhoujun Cheng , Xinyi He , Mengyu Zhou , Anda Zhou , Fan Zhou , Ao Liu , Shi Han , Dongmei Zhang

Table structure recognition is necessary for a comprehensive understanding of documents. Tables in unstructured business documents are tough to parse due to the high diversity of layouts, varying alignments of contents, and the presence of…

Computer Vision and Pattern Recognition · Computer Science 2021-11-16 Sachin Raja , Ajoy Mondal , C V Jawahar

Table recognition is using the computer to automatically understand the table, to detect the position of the table from the document or picture, and to correctly extract and identify the internal structure and content of the table. After…

Computer Vision and Pattern Recognition · Computer Science 2023-12-11 Shi Jiyuan , Shi chunqi
‹ Prev 1 2 3 10 Next ›