Related papers: TableFormer: Robust Transformer Modeling for Table…

TableFormer: Table Structure Understanding with Transformers

Tables organize valuable content in a concise and compact representation. This content is extremely valuable for systems such as search engines, Knowledge Graph's, etc, since they enhance their predictive capabilities. Unfortunately, tables…

Computer Vision and Pattern Recognition · Computer Science 2022-03-14 Ahmed Nassar , Nikolaos Livathinos , Maksym Lysak , Peter Staar

Structural Deep Encoding for Table Question Answering

Although Transformers-based architectures excel at processing textual information, their naive adaptation for tabular data often involves flattening the table structure. This simplification can lead to the loss of essential…

Computation and Language · Computer Science 2025-03-04 Raphaël Mouravieff , Benjamin Piwowarski , Sylvain Lamprier

TabTreeFormer: Tabular Data Generation Using Hybrid Tree-Transformer

Transformers have shown impressive results in tabular data generation. However, they lack domain-specific inductive biases which are critical for preserving the intrinsic characteristics of tabular data. They also suffer from poor…

Machine Learning · Computer Science 2025-05-19 Jiayu Li , Bingyin Zhao , Zilong Zhao , Uzair Javaid , Kevin Yee , Biplab Sikdar

Robust (Controlled) Table-to-Text Generation with Structure-Aware Equivariance Learning

Controlled table-to-text generation seeks to generate natural language descriptions for highlighted subparts of a table. Previous SOTA systems still employ a sequence-to-sequence generation method, which merely captures the table as a…

Computation and Language · Computer Science 2022-05-10 Fei Wang , Zhewei Xu , Pedro Szekely , Muhao Chen

Table-To-Text generation and pre-training with TabT5

Encoder-only transformer models have been successfully applied to different table understanding tasks, as in TAPAS (Herzig et al., 2020). A major limitation of these architectures is that they are constrained to classification-like tasks…

Computation and Language · Computer Science 2022-10-18 Ewa Andrejczuk , Julian Martin Eisenschlos , Francesco Piccinno , Syrine Krichene , Yasemin Altun

TabTransformer: Tabular Data Modeling Using Contextual Embeddings

We propose TabTransformer, a novel deep tabular data modeling architecture for supervised and semi-supervised learning. The TabTransformer is built upon self-attention based Transformers. The Transformer layers transform the embeddings of…

Machine Learning · Computer Science 2020-12-15 Xin Huang , Ashish Khetan , Milan Cvitkovic , Zohar Karnin

Learning Causal Orderings for In-Context Tabular Prediction

In-context learning for tabular data sets strong predictive standards in observational settings; it however primarily relies on correlational structure, which becomes unreliable under distribution shift or intervention. While established…

Machine Learning · Computer Science 2026-05-22 Sascha Xu , Sarah Mameche , Jilles Vreeken

Tabular Learning: Encoding for Entity and Context Embeddings

Examining the effect of different encoding techniques on entity and context embeddings, the goal of this work is to challenge commonly used Ordinal encoding for tabular learning. Applying different preprocessing methods and network…

Machine Learning · Computer Science 2024-03-29 Fredy Reusser

STable: Table Generation Framework for Encoder-Decoder Models

The output structure of database-like tables, consisting of values structured in horizontal rows and vertical columns identifiable by name, can cover a wide range of NLP tasks. Following this constatation, we propose a framework for…

Computation and Language · Computer Science 2022-10-13 Michał Pietruszka , Michał Turski , Łukasz Borchmann , Tomasz Dwojak , Gabriela Pałka , Karolina Szyndler , Dawid Jurkiewicz , Łukasz Garncarek

TableMaster: A Recipe to Advance Table Understanding with Language Models

Tables serve as a fundamental format for representing structured relational data. While current language models (LMs) excel at many text-based tasks, they still face challenges in table understanding due to the complex characteristics of…

Computation and Language · Computer Science 2026-04-16 Lang Cao , Hanbing Liu

SQLformer: Deep Auto-Regressive Query Graph Generation for Text-to-SQL Translation

In recent years, the task of text-to-SQL translation, which converts natural language questions into executable SQL queries, has gained significant attention for its potential to democratize data access. Despite its promise, challenges such…

Computation and Language · Computer Science 2024-05-28 Adrián Bazaga , Pietro Liò , Gos Micklem

TUTA: Tree-based Transformers for Generally Structured Table Pre-training

Tables are widely used with various structures to organize and present data. Recent attempts on table understanding mainly focus on relational tables, yet overlook to other common table structures. In this paper, we propose TUTA, a unified…

Information Retrieval · Computer Science 2021-07-21 Zhiruo Wang , Haoyu Dong , Ran Jia , Jia Li , Zhiyi Fu , Shi Han , Dongmei Zhang

TempoFormer: A Transformer for Temporally-aware Representations in Change Detection

Dynamic representation learning plays a pivotal role in understanding the evolution of linguistic content over time. On this front both context and time dynamics as well as their interplay are of prime importance. Current approaches model…

Computation and Language · Computer Science 2024-10-23 Talia Tseriotou , Adam Tsakalidis , Maria Liakata

TABLET: Table Structure Recognition using Encoder-only Transformers

To address the challenges of table structure recognition, we propose a novel Split-Merge-based top-down model optimized for large, densely populated tables. Our approach formulates row and column splitting as sequence labeling tasks,…

Computer Vision and Pattern Recognition · Computer Science 2025-10-20 Qiyu Hou , Jun Wang

TURL: Table Understanding through Representation Learning

Relational tables on the Web store a vast amount of knowledge. Owing to the wealth of such tables, there has been tremendous progress on a variety of tasks in the area of table understanding. However, existing work generally relies on…

Information Retrieval · Computer Science 2020-12-04 Xiang Deng , Huan Sun , Alyssa Lees , You Wu , Cong Yu

Rethinking Tabular Data Understanding with Large Language Models

Large Language Models (LLMs) have shown to be capable of various tasks, yet their capability in interpreting and reasoning over tabular data remains an underexplored area. In this context, this study investigates from three core…

Computation and Language · Computer Science 2023-12-29 Tianyang Liu , Fei Wang , Muhao Chen

High-Performance Transformers for Table Structure Recognition Need Early Convolutions

Table structure recognition (TSR) aims to convert tabular images into a machine-readable format, where a visual encoder extracts image features and a textual decoder generates table-representing tokens. Existing approaches use classic…

Computer Vision and Pattern Recognition · Computer Science 2023-11-10 ShengYun Peng , Seongmin Lee , Xiaojing Wang , Rajarajeswari Balasubramaniyan , Duen Horng Chau

TabGLM: Tabular Graph Language Model for Learning Transferable Representations Through Multi-Modal Consistency Minimization

Handling heterogeneous data in tabular datasets poses a significant challenge for deep learning models. While attention-based architectures and self-supervised learning have achieved notable success, their application to tabular data…

Machine Learning · Computer Science 2025-02-27 Anay Majee , Maria Xenochristou , Wei-Peng Chen

NLAFormer: Transformers Learn Numerical Linear Algebra Operations

Transformers are effective and efficient at modeling complex relationships and learning patterns from structured data in many applications. The main aim of this paper is to propose and design NLAFormer, which is a transformer-based…

Numerical Analysis · Mathematics 2025-08-28 Zhantao Ma , Yihang Gao , Michael K. Ng

TransTab: Learning Transferable Tabular Transformers Across Tables

Tabular data (or tables) are the most widely used data format in machine learning (ML). However, ML models often assume the table structure keeps fixed in training and testing. Before ML modeling, heavy data cleaning is required to merge…

Machine Learning · Computer Science 2022-09-19 Zifeng Wang , Jimeng Sun