English
Related papers

Related papers: HyperFast: Instant Classification for Tabular Data

200 papers

Foundation models are transforming machine learning across many modalities, with in-context learning replacing classical model training. Recent work on tabular data hints at a similar opportunity to build foundation models for…

Machine Learning · Computer Science 2025-05-12 Andreas Müller , Carlo Curino , Raghu Ramakrishnan

Deep learning has achieved impressive performance in many domains, such as computer vision and natural language processing, but its advantage over classical shallow methods on tabular datasets remains questionable. It is especially…

Machine Learning · Computer Science 2023-08-25 Witold Wydmański , Oleksii Bulenok , Marek Śmieja

In engineering design, navigating complex decision-making landscapes demands a thorough exploration of the design, performance, and constraint spaces, often impeded by resource-intensive simulations. Data-driven methods can mitigate this…

Computational Engineering, Finance, and Science · Computer Science 2024-01-17 Cyril Picard , Faez Ahmed

We present TabPFN, a trained Transformer that can do supervised classification for small tabular datasets in less than a second, needs no hyperparameter tuning and is competitive with state-of-the-art classification methods. TabPFN performs…

Machine Learning · Computer Science 2023-09-19 Noah Hollmann , Samuel Müller , Katharina Eggensperger , Frank Hutter

Prior-Fitted Networks (PFNs) have recently been proposed to efficiently perform tabular classification tasks. Although they achieve good performance on small datasets, they encounter limitations with larger datasets. These limitations…

Machine Learning · Computer Science 2025-03-04 Yuxin Wang , Botian Jiang , Yiran Guo , Quan Gan , David Wipf , Xuanjing Huang , Xipeng Qiu

Recent work on deep learning for tabular data demonstrates the strong performance of deep tabular models, often bridging the gap between gradient boosted decision trees and neural networks. Accuracy aside, a major advantage of neural models…

While deep learning has achieved remarkable success across many domains, it has historically underperformed on tabular learning tasks, which remain dominated by gradient boosting decision trees. However, recent advancements are paving the…

Machine Learning · Computer Science 2025-10-31 Alan Arazi , Eilam Shapira , Roi Reichart

Heterogeneous tabular data are the most commonly used form of data and are essential for numerous critical and computationally demanding applications. On homogeneous data sets, deep neural networks have repeatedly shown excellent…

Machine Learning · Computer Science 2023-01-24 Vadim Borisov , Tobias Leemann , Kathrin Seßler , Johannes Haug , Martin Pawelczyk , Gjergji Kasneci

In the literature on hyper-parameter tuning, a number of recent solutions rely on low-fidelity observations (e.g., training with sub-sampled datasets) in order to efficiently identify promising configurations to be then tested via…

Machine Learning · Computer Science 2022-12-05 Pedro Mendes , Maria Casimiro , Paolo Romano , David Garlan

The rapidly-changing deep learning landscape presents a unique opportunity for building inference accelerators optimized for specific datacenter-scale workloads. We propose Full-stack Accelerator Search Technique (FAST), a hardware…

Machine Learning · Computer Science 2022-02-02 Dan Zhang , Safeen Huda , Ebrahim Songhori , Kartik Prabhu , Quoc Le , Anna Goldie , Azalia Mirhoseini

We present a novel and systematic method, called Superfast Selection, for selecting the "optimal split" for decision tree and feature selection algorithms over tabular data. The method speeds up split selection on a single feature by…

Machine Learning · Computer Science 2024-06-05 Huaduo Wang , Gopal Gupta

Deep neural networks have seen great success in recent years; however, training a deep model is often challenging as its performance heavily depends on the hyper-parameters used. In addition, finding the optimal hyper-parameter…

It has been shown that deep neural networks are prone to overfitting on biased training data. Towards addressing this issue, meta-learning employs a meta model for correcting the training bias. Despite the promising performances, super slow…

Machine Learning · Computer Science 2021-05-03 Youjiang Xu , Linchao Zhu , Lu Jiang , Yi Yang

Hyperbox-based classification has been seen as a promising technique in which decisions on the data are represented as a series of orthogonal, multidimensional boxes (i.e., hyperboxes) that are often interpretable and human-readable.…

Machine Learning · Computer Science 2023-08-02 Denis Mayr Lima Martins , Christian Lülf , Fabian Gieseke

State-of-the-art data stream mining has long drawn from ensembles of the Very Fast Decision Tree, a seminal algorithm honored with the 2015 KDD Test-of-Time Award. However, the emergence of large tabular models, i.e., transformers designed…

Machine Learning · Computer Science 2025-12-16 Afonso Lourenço , João Gama , Eric P. Xing , Goreti Marreiros

This research addresses the challenge of limited data in tabular data classification, particularly prevalent in domains with constraints like healthcare. We propose Tab2Visual, a novel approach that transforms heterogeneous tabular data…

Machine Learning · Computer Science 2025-02-12 Ahmed Mamdouh , Moumen El-Melegy , Samia Ali , Ron Kikinis

Traditional methods for tabular classification usually rely on supervised learning from scratch, which requires extensive training data to determine model parameters. However, a novel approach called Prior-Data Fitted Networks (TabPFN) has…

Machine Learning · Computer Science 2024-06-12 Quangao Liu , Wei Yang , Chen Liang , Longlong Pang , Zhuozhang Zou

Deep neural networks have yielded superior performance in many applications; however, the gradient computation in a deep model with millions of instances lead to a lengthy training process even with modern GPU/TPU hardware acceleration. In…

Machine Learning · Computer Science 2019-05-10 Jiong Zhang , Hsiang-fu Yu , Inderjit S. Dhillon

Tabular data stands out as one of the most frequently encountered types in high energy physics. Unlike commonly homogeneous data such as pixelated images, simulating high-dimensional tabular data and accurately capturing their correlations…

Instrumentation and Detectors · Physics 2024-04-30 Cheng Jiang , Sitian Qian , Huilin Qu

Tabular data is prevalent across diverse domains in machine learning. With the rapid progress of deep tabular prediction methods, especially pretrained (foundation) models, there is a growing need to evaluate these methods systematically…

Machine Learning · Computer Science 2025-11-10 Han-Jia Ye , Si-Yang Liu , Hao-Run Cai , Qi-Le Zhou , De-Chuan Zhan
‹ Prev 1 2 3 10 Next ›