Related papers: HyperFast: Instant Classification for Tabular Data

MotherNet: Fast Training and Inference via Hyper-Network Transformers

Foundation models are transforming machine learning across many modalities, with in-context learning replacing classical model training. Recent work on tabular data hints at a similar opportunity to build foundation models for…

Machine Learning · Computer Science 2025-05-12 Andreas Müller , Carlo Curino , Raghu Ramakrishnan

HyperTab: Hypernetwork Approach for Deep Learning on Small Tabular Datasets

Deep learning has achieved impressive performance in many domains, such as computer vision and natural language processing, but its advantage over classical shallow methods on tabular datasets remains questionable. It is especially…

Machine Learning · Computer Science 2023-08-25 Witold Wydmański , Oleksii Bulenok , Marek Śmieja

Fast and Accurate Zero-Training Classification for Tabular Engineering Data

In engineering design, navigating complex decision-making landscapes demands a thorough exploration of the design, performance, and constraint spaces, often impeded by resource-intensive simulations. Data-driven methods can mitigate this…

Computational Engineering, Finance, and Science · Computer Science 2024-01-17 Cyril Picard , Faez Ahmed

TabPFN: A Transformer That Solves Small Tabular Classification Problems in a Second

We present TabPFN, a trained Transformer that can do supervised classification for small tabular datasets in less than a second, needs no hyperparameter tuning and is competitive with state-of-the-art classification methods. TabPFN performs…

Machine Learning · Computer Science 2023-09-19 Noah Hollmann , Samuel Müller , Katharina Eggensperger , Frank Hutter

Prior-Fitted Networks Scale to Larger Datasets When Treated as Weak Learners

Prior-Fitted Networks (PFNs) have recently been proposed to efficiently perform tabular classification tasks. Although they achieve good performance on small datasets, they encounter limitations with larger datasets. These limitations…

Machine Learning · Computer Science 2025-03-04 Yuxin Wang , Botian Jiang , Yiran Guo , Quan Gan , David Wipf , Xuanjing Huang , Xipeng Qiu

Transfer Learning with Deep Tabular Models

Recent work on deep learning for tabular data demonstrates the strong performance of deep tabular models, often bridging the gap between gradient boosted decision trees and neural networks. Accuracy aside, a major advantage of neural models…

Machine Learning · Computer Science 2023-08-08 Roman Levin , Valeriia Cherepanova , Avi Schwarzschild , Arpit Bansal , C. Bayan Bruss , Tom Goldstein , Andrew Gordon Wilson , Micah Goldblum

TabSTAR: A Tabular Foundation Model for Tabular Data with Text Fields

While deep learning has achieved remarkable success across many domains, it has historically underperformed on tabular learning tasks, which remain dominated by gradient boosting decision trees. However, recent advancements are paving the…

Machine Learning · Computer Science 2025-10-31 Alan Arazi , Eilam Shapira , Roi Reichart

Deep Neural Networks and Tabular Data: A Survey

Heterogeneous tabular data are the most commonly used form of data and are essential for numerous critical and computationally demanding applications. On homogeneous data sets, deep neural networks have repeatedly shown excellent…

Machine Learning · Computer Science 2023-01-24 Vadim Borisov , Tobias Leemann , Kathrin Seßler , Johannes Haug , Martin Pawelczyk , Gjergji Kasneci

HyperJump: Accelerating HyperBand via Risk Modelling

In the literature on hyper-parameter tuning, a number of recent solutions rely on low-fidelity observations (e.g., training with sub-sampled datasets) in order to efficiently identify promising configurations to be then tested via…

Machine Learning · Computer Science 2022-12-05 Pedro Mendes , Maria Casimiro , Paolo Romano , David Garlan

A Full-Stack Search Technique for Domain Optimized Deep Learning Accelerators

The rapidly-changing deep learning landscape presents a unique opportunity for building inference accelerators optimized for specific datacenter-scale workloads. We propose Full-stack Accelerator Search Technique (FAST), a hardware…

Machine Learning · Computer Science 2022-02-02 Dan Zhang , Safeen Huda , Ebrahim Songhori , Kartik Prabhu , Quoc Le , Anna Goldie , Azalia Mirhoseini

Superfast Selection for Decision Tree Algorithms

We present a novel and systematic method, called Superfast Selection, for selecting the "optimal split" for decision tree and feature selection algorithms over tabular data. The method speeds up split selection on a single feature by…

Machine Learning · Computer Science 2024-06-05 Huaduo Wang , Gopal Gupta

AUTOMATA: Gradient Based Data Subset Selection for Compute-Efficient Hyper-parameter Tuning

Deep neural networks have seen great success in recent years; however, training a deep model is often challenging as its performance heavily depends on the hyper-parameters used. In addition, finding the optimal hyper-parameter…

Machine Learning · Computer Science 2022-03-17 Krishnateja Killamsetty , Guttu Sai Abhishek , Aakriti , Alexandre V. Evfimievski , Lucian Popa , Ganesh Ramakrishnan , Rishabh Iyer

Faster Meta Update Strategy for Noise-Robust Deep Learning

It has been shown that deep neural networks are prone to overfitting on biased training data. Towards addressing this issue, meta-learning employs a meta model for correcting the training bias. Despite the promising performances, super slow…

Machine Learning · Computer Science 2021-05-03 Youjiang Xu , Linchao Zhu , Lu Jiang , Yi Yang

End-to-End Neural Network Training for Hyperbox-Based Classification

Hyperbox-based classification has been seen as a promising technique in which decisions on the data are represented as a series of orthogonal, multidimensional boxes (i.e., hyperboxes) that are often interpretable and human-readable.…

Machine Learning · Computer Science 2023-08-02 Denis Mayr Lima Martins , Christian Lülf , Fabian Gieseke

In-context Learning of Evolving Data Streams with Tabular Foundational Models

State-of-the-art data stream mining has long drawn from ensembles of the Very Fast Decision Tree, a seminal algorithm honored with the 2015 KDD Test-of-Time Award. However, the emergence of large tabular models, i.e., transformers designed…

Machine Learning · Computer Science 2025-12-16 Afonso Lourenço , João Gama , Eric P. Xing , Goreti Marreiros

Tab2Visual: Overcoming Limited Data in Tabular Data Classification Using Deep Learning with Visual Representations

This research addresses the challenge of limited data in tabular data classification, particularly prevalent in domains with constraints like healthcare. We propose Tab2Visual, a novel approach that transforms heterogeneous tabular data…

Machine Learning · Computer Science 2025-02-12 Ahmed Mamdouh , Moumen El-Melegy , Samia Ali , Ron Kikinis

Tokenize features, enhancing tables: the FT-TABPFN model for tabular classification

Traditional methods for tabular classification usually rely on supervised learning from scratch, which requires extensive training data to determine model parameters. However, a novel approach called Prior-Data Fitted Networks (TabPFN) has…

Machine Learning · Computer Science 2024-06-12 Quangao Liu , Wei Yang , Chen Liang , Longlong Pang , Zhuozhang Zou

AutoAssist: A Framework to Accelerate Training of Deep Neural Networks

Deep neural networks have yielded superior performance in many applications; however, the gradient computation in a deep model with millions of instances lead to a lengthy training process even with modern GPU/TPU hardware acceleration. In…

Machine Learning · Computer Science 2019-05-10 Jiong Zhang , Hsiang-fu Yu , Inderjit S. Dhillon

BUFF: Boosted Decision Tree based Ultra-Fast Flow matching

Tabular data stands out as one of the most frequently encountered types in high energy physics. Unlike commonly homogeneous data such as pixelated images, simulating high-dimensional tabular data and accurately capturing their correlations…

Instrumentation and Detectors · Physics 2024-04-30 Cheng Jiang , Sitian Qian , Huilin Qu

A Closer Look at Deep Learning Methods on Tabular Datasets

Tabular data is prevalent across diverse domains in machine learning. With the rapid progress of deep tabular prediction methods, especially pretrained (foundation) models, there is a growing need to evaluate these methods systematically…

Machine Learning · Computer Science 2025-11-10 Han-Jia Ye , Si-Yang Liu , Hao-Run Cai , Qi-Le Zhou , De-Chuan Zhan