English
Related papers

Related papers: TabAug: Data Driven Augmentation for Enhanced Tabl…

200 papers

Incorporating external knowledge bases in traditional retrieval-augmented generation (RAG) relies on parsing the document, followed by querying a language model with the parsed information via in-context learning. While effective for…

Computation and Language · Computer Science 2026-02-03 Jacob Si , Mike Qu , Michelle Lee , Marek Rei , Yingzhen Li

Data augmentations are effective in improving the invariance of learning machines. We argue that the core challenge of data augmentations lies in designing data transformations that preserve labels. This is relatively straightforward for…

Machine Learning · Computer Science 2023-03-01 Youzhi Luo , Michael McThrow , Wing Yee Au , Tao Komikado , Kanji Uchino , Koji Maruhashi , Shuiwang Ji

Tables present summarized and structured information to the reader, which makes table structure extraction an important part of document understanding applications. However, table structure identification is a hard problem not only because…

Computer Vision and Pattern Recognition · Computer Science 2020-02-07 Saqib Ali Khan , Syed Muhammad Daniyal Khalid , Muhammad Ali Shahzad , Faisal Shafait

Considerable research attention has been paid to table detection by developing not only rule-based approaches reliant on hand-crafted heuristics but also deep learning approaches. Although recent studies successfully perform table detection…

Machine Learning · Computer Science 2022-11-15 Hyebin Kwon , Joungbin An , Dongwoo Lee , Won-Yong Shin

In recent years, graph representation learning has achieved remarkable success while suffering from low-quality data problems. As a mature technology to improve data quality in computer vision, data augmentation has also attracted…

Machine Learning · Computer Science 2024-06-24 Jiajun Zhou , Chenxuan Xie , Shengbo Gong , Zhenyu Wen , Xiangyu Zhao , Qi Xuan , Xiaoniu Yang

In recent years, one of the most popular techniques in the computer vision community has been the deep learning technique. As a data-driven technique, deep model requires enormous amounts of accurately labelled training data, which is often…

Computer Vision and Pattern Recognition · Computer Science 2022-10-10 Zihan Yang , Richard O. Sinnott , James Bailey , Qiuhong Ke

Recent work has shown that data augmentation has the potential to significantly improve the generalization of deep learning models. Recently, automated augmentation strategies have led to state-of-the-art results in image classification and…

Computer Vision and Pattern Recognition · Computer Science 2019-11-15 Ekin D. Cubuk , Barret Zoph , Jonathon Shlens , Quoc V. Le

With the widespread use of mobile phones and scanners to photograph and upload documents, the need for extracting the information trapped in unstructured document images such as retail receipts, insurance claim forms and financial invoices…

Computer Vision and Pattern Recognition · Computer Science 2020-01-07 Shubham Paliwal , Vishwanath D , Rohit Rahul , Monika Sharma , Lovekesh Vig

Optimization of image transformation functions for the purpose of data augmentation has been intensively studied. In particular, adversarial data augmentation strategies, which search augmentation maximizing task loss, show significant…

Computer Vision and Pattern Recognition · Computer Science 2022-03-30 Teppei Suzuki

This paper presents the novel approach towards table structure recognition by leveraging the guided anchors. The concept differs from current state-of-the-art approaches for table structure recognition that naively apply object detection…

Computer Vision and Pattern Recognition · Computer Science 2021-04-22 Khurram Azeem Hashmi , Didier Stricker , Marcus Liwicki , Muhammad Noman Afzal , Muhammad Zeshan Afzal

Recent advancements in language models (LMs) have notably enhanced their ability to reason with tabular data, primarily through program-aided mechanisms that manipulate and analyze tables. However, these methods often require the entire…

Data augmentation methods have played an important role in the recent advance of deep learning models, and have become an indispensable component of state-of-the-art models in semi-supervised, self-supervised, and supervised training for…

Computer Vision and Pattern Recognition · Computer Science 2023-05-24 Emirhan Kurtulus , Zichao Li , Yann Dauphin , Ekin Dogus Cubuk

Tabular data is a fundamental form of data structure. The evolution of table analysis tools reflects humanity's continuous progress in data acquisition, management, and processing. The dynamic changes in table columns arise from…

Artificial Intelligence · Computer Science 2026-01-28 Xinda Chen , Zhen Xing , Hanyu Zhang , Weimin Tan , Bo Yan

Deep neural networks have emerged as very successful tools for image restoration and reconstruction tasks. These networks are often trained end-to-end to directly reconstruct an image from a noisy or corrupted measurement of that image. To…

Image and Video Processing · Electrical Eng. & Systems 2021-06-30 Zalan Fabian , Reinhard Heckel , Mahdi Soltanolkotabi

Data preparation, also called data wrangling, is considered one of the most expensive and time-consuming steps when performing analytics or building machine learning models. Preparing data typically involves collecting and merging data from…

Computation and Language · Computer Science 2023-06-22 Michael Glass , Xueqing Wu , Ankita Rajaram Naik , Gaetano Rossiello , Alfio Gliozzo

In this paper, we explore and compare multiple solutions to the problem of data augmentation in image classification. Previous work has demonstrated the effectiveness of data augmentation through simple techniques, such as cropping,…

Computer Vision and Pattern Recognition · Computer Science 2017-12-14 Luis Perez , Jason Wang

Tables are widely used in documents because of their compact and structured representation of information. In particular, in scientific papers, tables can sum up novel discoveries and summarize experimental results, making the research…

Computer Vision and Pattern Recognition · Computer Science 2023-02-21 Davide del Bimbo , Andrea Gemelli , Simone Marinai

The first phase of table recognition is to detect the tabular area in a document. Subsequently, the tabular structures are recognized in the second phase in order to extract information from the respective cells. Table detection and…

Computer Vision and Pattern Recognition · Computer Science 2021-05-11 Khurram Azeem Hashmi , Marcus Liwicki , Didier Stricker , Muhammad Adnan Afzal , Muhammad Ahtsham Afzal , Muhammad Zeshan Afzal

In the digital era, table structure recognition technology is a critical tool for processing and analyzing large volumes of tabular data. Previous methods primarily focus on visual aspects of table structure recovery but often fail to…

Computer Vision and Pattern Recognition · Computer Science 2024-09-23 Zhenrong Zhang , Shuhang Liu , Pengfei Hu , Jiefeng Ma , Jun Du , Jianshu Zhang , Yu Hu

In recent years, deep learning has achieved remarkable achievements in many fields, including computer vision, natural language processing, speech recognition and others. Adequate training data is the key to ensure the effectiveness of the…

Machine Learning · Computer Science 2019-05-24 Chunxu Zhang , Jiaxu Cui , Bo Yang
‹ Prev 1 2 3 10 Next ›