Related papers: Transfer Learning with Deep Tabular Models

Deep Learning with Tabular Data: A Self-supervised Approach

We have described a novel approach for training tabular data using the TabTransformer model with self-supervised learning. Traditional machine learning models for tabular data, such as GBDT are being widely used though our paper examines…

Machine Learning · Computer Science 2024-01-30 Tirth Kiranbhai Vyas

Unlocking the Transferability of Tokens in Deep Models for Tabular Data

Fine-tuning a pre-trained deep neural network has become a successful paradigm in various machine learning tasks. However, such a paradigm becomes particularly challenging with tabular data when there are discrepancies between the feature…

Machine Learning · Computer Science 2023-10-24 Qi-Le Zhou , Han-Jia Ye , Le-Ye Wang , De-Chuan Zhan

Revisiting Pretraining Objectives for Tabular Deep Learning

Recent deep learning models for tabular data currently compete with the traditional ML models based on decision trees (GBDT). Unlike GBDT, deep models can additionally benefit from pretraining, which is a workhorse of DL for vision and NLP.…

Machine Learning · Computer Science 2022-07-13 Ivan Rubachev , Artem Alekberov , Yury Gorishniy , Artem Babenko

Representation Learning for Tabular Data: A Comprehensive Survey

Tabular data, structured as rows and columns, is among the most prevalent data types in machine learning classification and regression applications. Models for learning from tabular data have continuously evolved, with Deep Neural Networks…

Machine Learning · Computer Science 2025-04-24 Jun-Peng Jiang , Si-Yang Liu , Hao-Run Cai , Qile Zhou , Han-Jia Ye

Transformers with Stochastic Competition for Tabular Data Modelling

Despite the prevalence and significance of tabular data across numerous industries and fields, it has been relatively underexplored in the realm of deep learning. Even today, neural networks are often overshadowed by techniques such as…

Machine Learning · Computer Science 2024-07-19 Andreas Voskou , Charalambos Christoforou , Sotirios Chatzis

Fine-Tuning the Retrieval Mechanism for Tabular Deep Learning

While interests in tabular deep learning has significantly grown, conventional tree-based models still outperform deep learning methods. To narrow this performance gap, we explore the innovative retrieval mechanism, a methodology that…

Machine Learning · Computer Science 2023-11-14 Felix den Breejen , Sangmin Bae , Stephen Cha , Tae-Young Kim , Seoung Hyun Koh , Se-Young Yun

Simple Modifications to Improve Tabular Neural Networks

There is growing interest in neural network architectures for tabular data. Many general-purpose tabular deep learning models have been introduced recently, with performance sometimes rivaling gradient boosted decision trees (GBDTs). These…

Machine Learning · Computer Science 2021-08-10 James Fiedler

Understanding and Improving Transfer Learning of Deep Models via Neural Collapse

With the ever-increasing complexity of large-scale pre-trained models coupled with a shortage of labeled data for downstream training, transfer learning has become the primary approach in many fields, including natural language processing,…

Machine Learning · Computer Science 2024-07-22 Xiao Li , Sheng Liu , Jinxin Zhou , Xinyu Lu , Carlos Fernandez-Granda , Zhihui Zhu , Qing Qu

When Do Neural Nets Outperform Boosted Trees on Tabular Data?

Tabular data is one of the most commonly used types of data in machine learning. Despite recent advances in neural nets (NNs) for tabular data, there is still an active discussion on whether or not NNs generally outperform gradient-boosted…

Machine Learning · Computer Science 2024-07-17 Duncan McElfresh , Sujay Khandagale , Jonathan Valverde , Vishak Prasad C , Benjamin Feuer , Chinmay Hegde , Ganesh Ramakrishnan , Micah Goldblum , Colin White

Modular Deep Learning

Transfer learning has recently become the dominant paradigm of machine learning. Pre-trained models fine-tuned for downstream tasks achieve better performance with fewer labelled examples. Nonetheless, it remains unclear how to develop…

Machine Learning · Computer Science 2024-01-30 Jonas Pfeiffer , Sebastian Ruder , Ivan Vulić , Edoardo Maria Ponti

When & How to Transfer with Transfer Learning

In deep learning, transfer learning (TL) has become the de facto approach when dealing with image related tasks. Visual features learnt for one task have been shown to be reusable for other tasks, improving performance significantly. By…

Computer Vision and Pattern Recognition · Computer Science 2022-11-09 Adrian Tormos , Dario Garcia-Gasulla , Victor Gimenez-Abalos , Sergio Alvarez-Napagao

TabReD: Analyzing Pitfalls and Filling the Gaps in Tabular Deep Learning Benchmarks

Advances in machine learning research drive progress in real-world applications. To ensure this progress, it is important to understand the potential pitfalls on the way from a novel method's success on academic benchmarks to its practical…

Machine Learning · Computer Science 2024-10-25 Ivan Rubachev , Nikolay Kartashev , Yury Gorishniy , Artem Babenko

Unveiling the Role of Data Uncertainty in Tabular Deep Learning

Recent advancements in tabular deep learning have demonstrated exceptional practical performance, yet the field often lacks a clear understanding of why these techniques actually succeed. To address this gap, our paper highlights the…

Machine Learning · Computer Science 2025-09-05 Nikolay Kartashev , Ivan Rubachev , Artem Babenko

VisTabNet: Adapting Vision Transformers for Tabular Data

Although deep learning models have had great success in natural language processing and computer vision, we do not observe comparable improvements in the case of tabular data, which is still the most common data type used in biological,…

Machine Learning · Computer Science 2025-04-28 Witold Wydmański , Ulvi Movsum-zada , Jacek Tabor , Marek Śmieja

TabR: Tabular Deep Learning Meets Nearest Neighbors in 2023

Deep learning (DL) models for tabular data problems (e.g. classification, regression) are currently receiving increasingly more attention from researchers. However, despite the recent efforts, the non-DL algorithms based on gradient-boosted…

Machine Learning · Computer Science 2023-10-27 Yury Gorishniy , Ivan Rubachev , Nikolay Kartashev , Daniil Shlenskii , Akim Kotelnikov , Artem Babenko

Transfer Learning of Tabular Data by Finetuning Large Language Models

Despite the artificial intelligence (AI) revolution, deep learning has yet to achieve much success with tabular data due to heterogeneous feature space and limited sample sizes without viable transfer learning. The new era of generative AI,…

Machine Learning · Computer Science 2025-01-14 Shourav B. Rabbani , Ibna Kowsar , Manar D. Samad

Improving Transferability of Deep Neural Networks

Learning from small amounts of labeled data is a challenge in the area of deep learning. This is currently addressed by Transfer Learning where one learns the small data set as a transfer task from a larger source dataset. Transfer Learning…

Computer Vision and Pattern Recognition · Computer Science 2018-07-31 Parijat Dube , Bishwaranjan Bhattacharjee , Elisabeth Petit-Bois , Matthew Hill

A Survey on Deep Transfer Learning

As a new classification platform, deep learning has recently received increasing attention from researchers and has been successfully applied to many domains. In some domains, like bioinformatics and robotics, it is very difficult to…

Machine Learning · Computer Science 2018-08-13 Chuanqi Tan , Fuchun Sun , Tao Kong , Wenchang Zhang , Chao Yang , Chunfang Liu

A Performance-Driven Benchmark for Feature Selection in Tabular Deep Learning

Academic tabular benchmarks often contain small sets of curated features. In contrast, data scientists typically collect as many features as possible into their datasets, and even engineer new features from existing ones. To prevent…

Machine Learning · Computer Science 2023-11-13 Valeriia Cherepanova , Roman Levin , Gowthami Somepalli , Jonas Geiping , C. Bayan Bruss , Andrew Gordon Wilson , Tom Goldstein , Micah Goldblum

Deep Learning within Tabular Data: Foundations, Challenges, Advances and Future Directions

Tabular data remains one of the most prevalent data types across a wide range of real-world applications, yet effective representation learning for this domain poses unique challenges due to its irregular patterns, heterogeneous feature…

Machine Learning · Computer Science 2025-01-08 Weijieying Ren , Tianxiang Zhao , Yuqing Huang , Vasant Honavar