Related papers: Robust Tabular Foundation Models

On the Robustness of Tabular Foundation Models: Test-Time Attacks and In-Context Defenses

Recent tabular Foundational Models (FM) such as TabPFN and TabICL, leverage in-context learning to achieve strong performance without gradient updates or fine-tuning. However, their robustness to adversarial manipulation remains largely…

Machine Learning · Computer Science 2026-04-10 Mohamed Djilani , Thibault Simonetto , Karim Tit , Florian Tambon , Salah Ghamizi , Maxime Cordy , Mike Papadakis

TabularFM: An Open Framework For Tabular Foundational Models

Foundational models (FMs), pretrained on extensive datasets using self-supervised techniques, are capable of learning generalized patterns from large amounts of data. This reduces the need for extensive labeled datasets for each new task,…

Machine Learning · Computer Science 2024-06-19 Quan M. Tran , Suong N. Hoang , Lam M. Nguyen , Dzung Phan , Hoang Thanh Lam

Mitra: Mixed Synthetic Priors for Enhancing Tabular Foundation Models

Since the seminal work of TabPFN, research on tabular foundation models (TFMs) based on in-context learning (ICL) has challenged long-standing paradigms in machine learning. Without seeing any real-world data, models pretrained on purely…

Machine Learning · Computer Science 2025-10-27 Xiyuan Zhang , Danielle C. Maddix , Junming Yin , Nick Erickson , Abdul Fatir Ansari , Boran Han , Shuai Zhang , Leman Akoglu , Christos Faloutsos , Michael W. Mahoney , Cuixiong Hu , Huzefa Rangwala , George Karypis , Bernie Wang

High Performance, Low Reliability: Uncertainty Benchmarking for Tabular Foundation Models

Recent Tabular Foundation Models (TFMs) have demonstrated state-of-the-art predictive performance, often surpassing Gradient-Boosted Decision Trees (GBDTs). However, the trustworthiness of these models, particularly their uncertainty…

Machine Learning · Computer Science 2026-05-28 José Lucas De Melo Costa , Fabrice Popineau , Arpad Rimmel , Bich-Liên Doan

Tabular foundation models for in-context prediction of molecular properties

Accurate molecular property prediction is central to drug discovery, catalysis, and process design, yet real-world applications are often limited by small datasets. Molecular foundation models provide a promising direction by learning…

Machine Learning · Computer Science 2026-04-21 Karim K. Ben Hicham , Jan G. Rittig , Martin Grohe , Alexander Mitsos

TabICLv2: A better, faster, scalable, and open tabular foundation model

Tabular foundation models, such as TabPFNv2 and TabICL, have recently dethroned gradient-boosted trees at the top of predictive benchmarks, demonstrating the value of in-context learning for tabular data. We introduce TabICLv2, a new…

Machine Learning · Computer Science 2026-02-12 Jingang Qu , David Holzmüller , Gaël Varoquaux , Marine Le Morvan

Real-TabPFN: Improving Tabular Foundation Models via Continued Pre-training With Real-World Data

Foundation models for tabular data, like TabPFN, achieve strong performance on small datasets when pre-trained solely on synthetic data. We show that this performance can be significantly boosted by a targeted continued pre-training phase.…

Machine Learning · Computer Science 2025-07-08 Anurag Garg , Muhammad Ali , Noah Hollmann , Lennart Purucker , Samuel Müller , Frank Hutter

Is TabPFN the Silver Bullet for Insurance Pricing?

Modelling claim frequency and severity for non-life insurance pricing predominantly relies on generalised linear models, with gradient-boosted machines as the leading machine learning alternative. Tabular foundation models (TFMs) present a…

Risk Management · Quantitative Finance 2026-05-26 Bruno Deprez , Wouter Verbeke , Tim Verdonck

Causal Pre-training Under the Fairness Lens: An Empirical Study of TabPFN

Foundation models for tabular data, such as the Tabular Prior-data Fitted Network (TabPFN), are pre-trained on a massive number of synthetic datasets generated by structural causal models (SCM). They leverage in-context learning to offer…

Machine Learning · Computer Science 2026-01-28 Qinyi Liu , Mohammad Khalil , Naman Goel

A Closer Look at TabPFN v2: Understanding Its Strengths and Extending Its Capabilities

Tabular datasets are inherently heterogeneous, presenting significant challenges for developing pre-trained foundation models. The recently introduced transformer-based Tabular Prior-data Fitted Network v2 (TabPFN v2) achieves unprecedented…

Machine Learning · Computer Science 2025-06-12 Han-Jia Ye , Si-Yang Liu , Wei-Lun Chao

Exploring Fine-Tuning for Tabular Foundation Models

Tabular Foundation Models (TFMs) have recently shown strong in-context learning capabilities on structured data, achieving zero-shot performance comparable to traditional machine learning methods. We find that zero-shot TFMs already achieve…

Machine Learning · Computer Science 2026-01-15 Aditya Tanna , Pratinav Seth , Mohamed Bouadi , Vinay Kumar Sankarapu

Are Time-Series Foundation Models Deployment-Ready? A Systematic Study of Adversarial Robustness Across Domains

Time-Series Foundation Models (TSFMs) are rapidly transitioning from research prototypes to core components of critical decision-making systems, driven by their impressive zero-shot forecasting capabilities. However, as their deployment…

Machine Learning · Computer Science 2025-12-09 Jiawen Zhang , Zhenwei Zhang , Shun Zheng , Xumeng Wen , Jia Li , Jiang Bian

Robustness Analysis on Foundational Segmentation Models

Due to the increase in computational resources and accessibility of data, an increase in large, deep learning models trained on copious amounts of multi-modal data using self-supervised or semi-supervised learning have emerged. These…

Computer Vision and Pattern Recognition · Computer Science 2024-04-30 Madeline Chantry Schiappa , Shehreen Azad , Sachidanand VS , Yunhao Ge , Ondrej Miksik , Yogesh S. Rawat , Vibhav Vineet

Tabular Foundation Models Can Learn Association Rules

Association Rule Mining (ARM) is a fundamental task for knowledge discovery in tabular data and is widely used in high-stakes decision-making. Classical ARM methods rely on frequent itemset mining, leading to rule explosion and poor…

Artificial Intelligence · Computer Science 2026-02-18 Erkan Karabulut , Daniel Daza , Paul Groth , Martijn C. Schut , Victoria Degeler

Live Knowledge Tracing: Real-Time Adaptation using Tabular Foundation Models

Deep knowledge tracing models have achieved significant breakthroughs in modeling student learning trajectories. However, these architectures require substantial training time and are prone to overfitting on datasets with short sequences.…

Machine Learning · Computer Science 2026-04-28 Mounir Lbath , Alexandre Parésy , Abdelkayoum Kaddouri , Abdelrahman Zighem , Jill-Jênn Vie

Prior-Aligned Data Cleaning for Tabular Foundation Models

Tabular Foundation Models (TFMs) achieve state-of-the-art zero-shot accuracy on small tabular datasets by meta-learning over synthetic data-generating processes -- making them highly attractive for practitioners who cannot afford large…

Machine Learning · Computer Science 2026-04-29 Laure Berti-Equille

xRFM: Accurate, scalable, and interpretable feature learning models for tabular data

Inference from tabular data, collections of continuous and categorical variables organized into matrices, is a foundation for modern technology and science. Yet, in contrast to the explosive changes in the rest of AI, the best practice for…

Machine Learning · Computer Science 2026-04-07 Daniel Beaglehole , David Holzmüller , Adityanarayanan Radhakrishnan , Mikhail Belkin

Bridging the Gap Between Foundation Models and Heterogeneous Federated Learning

Federated learning (FL) offers privacy-preserving decentralized machine learning, optimizing models at edge clients without sharing private data. Simultaneously, foundation models (FMs) have gained traction in the artificial intelligence…

Machine Learning · Computer Science 2023-10-06 Sixing Yu , J. Pablo Muñoz , Ali Jannesari

nanoTabPFN: A Lightweight and Educational Reimplementation of TabPFN

Tabular foundation models such as TabPFN have revolutionized predictive machine learning for tabular data. At the same time, the driving factors of this revolution are hard to understand. Existing open-source tabular foundation models are…

Machine Learning · Computer Science 2025-12-19 Alexander Pfefferle , Johannes Hog , Lennart Purucker , Frank Hutter

Pocket Foundation Models: Distilling TFMs into CPU-Ready Gradient-Boosted Trees

A fraud scorer needs to answer in under 2 ms. The best tabular foundation models (TFMs) take 151-1,275 ms on GPU. We close this gap by distilling the TFM offline into an XGBoost or CatBoost student that runs natively on CPU. The central…

Machine Learning · Computer Science 2026-05-19 Aditya Tanna , Nassim Bouarour , Mohamed Bouadi , Vinay kumar Sankarapu , Pratinav Seth