Related papers: Interpretable Tabular Foundation Models via In-Con…

TabICL: A Tabular Foundation Model for In-Context Learning on Large Data

The long-standing dominance of gradient-boosted decision trees on tabular data is currently challenged by tabular foundation models using In-Context Learning (ICL): setting the training data as context for the test data and predicting in a…

Machine Learning · Computer Science 2025-05-27 Jingang Qu , David Holzmüller , Gaël Varoquaux , Marine Le Morvan

Towards Fair In-Context Learning with Tabular Foundation Models

Transformer-based tabular foundation models have recently demonstrated promising in-context learning (ICL) performance on structured data, emerging as competitive alternatives to gradient-boosted trees. However, the fairness implications of…

Machine Learning · Computer Science 2026-01-06 Patrik Kenfack , Samira Ebrahimi Kahou , Ulrich Aïvodji

TabICLv2: A better, faster, scalable, and open tabular foundation model

Tabular foundation models, such as TabPFNv2 and TabICL, have recently dethroned gradient-boosted trees at the top of predictive benchmarks, demonstrating the value of in-context learning for tabular data. We introduce TabICLv2, a new…

Machine Learning · Computer Science 2026-02-12 Jingang Qu , David Holzmüller , Gaël Varoquaux , Marine Le Morvan

TabPFN Through The Looking Glass: An interpretability study of TabPFN and its internal representations

Tabular foundational models are pre-trained models designed for a wide range of tabular data tasks. They have shown strong performance across domains, yet their internal representations and learned concepts remain poorly understood. This…

Machine Learning · Computer Science 2026-01-14 Aviral Gupta , Armaan Sethi , Dhruv Kumar

Distributional Regression with Tabular Foundation Models: Evaluating Probabilistic Predictions via Proper Scoring Rules

Tabular foundation models such as TabPFN and TabICL already produce full predictive distributions, yet the benchmarks used to evaluate them (TabArena, TALENT, and others) still rely almost exclusively on point-estimate metrics (RMSE,…

Machine Learning · Computer Science 2026-03-31 Jonas Landsgesell , Pascal Knoll

Implicit Kernel Learning

Kernels are powerful and versatile tools in machine learning and statistics. Although the notion of universal kernels and characteristic kernels has been studied, kernel selection still greatly influences the empirical performance. While…

Machine Learning · Statistics 2019-02-28 Chun-Liang Li , Wei-Cheng Chang , Youssef Mroueh , Yiming Yang , Barnabás Póczos

TabQL: In-Context Q-Learning with Tabular Foundation Models

We propose Tabular Q-Learning (TabQL), a reinforcement learning framework that replaces the conventional parametric Q-network in Deep Q-Learning (DQN) with a tabular foundation model endowed with in-context learning capabilities. The key…

Machine Learning · Computer Science 2026-05-20 Qisai Liu , Zhanhong Jiang , Timilehin Ayanlade , Ashutosh Kumar Nirala , Yang Li , Aditya Balu , Soumik Sarkar

Towards Understanding Layer Contributions in Tabular In-Context Learning Models

Despite the architectural similarities between tabular in-context learning (ICL) models and large language models (LLMs), little is known about how individual layers contribute to tabular prediction. In this paper, we investigate how the…

Machine Learning · Computer Science 2025-11-20 Amir Rezaei Balef , Mykhailo Koshil , Katharina Eggensperger

Low-dimensional Interpretable Kernels with Conic Discriminant Functions for Classification

Kernels are often developed and used as implicit mapping functions that show impressive predictive power due to their high-dimensional feature space representations. In this study, we gradually construct a series of simple feature maps that…

Machine Learning · Computer Science 2020-07-20 Gurhan Ceylan , S. Ilker Birbil

Fine-tuned In-Context Learning Transformers are Excellent Tabular Data Classifiers

The recently introduced TabPFN pretrains an In-Context Learning (ICL) transformer on synthetic data to perform tabular data classification. In this work, we extend TabPFN to the fine-tuning setting, resulting in a significant performance…

Machine Learning · Computer Science 2025-01-24 Felix den Breejen , Sangmin Bae , Stephen Cha , Se-Young Yun

Understanding In-Context Learning on Structured Manifolds: Bridging Attention to Kernel Methods

While in-context learning (ICL) has achieved remarkable success in natural language and vision domains, its theoretical understanding-particularly in the context of structured geometric data-remains unexplored. This paper initiates a…

Machine Learning · Computer Science 2026-05-19 Zhaiming Shen , Alexander Hsu , Rongjie Lai , Wenjing Liao

TabDPT: Scaling Tabular Foundation Models on Real Data

Tabular data is one of the most ubiquitous sources of information worldwide, spanning a wide variety of domains. This inherent heterogeneity has slowed the development of Tabular Foundation Models (TFMs) capable of fast generalization to…

Machine Learning · Computer Science 2026-01-21 Junwei Ma , Valentin Thomas , Rasa Hosseinzadeh , Alex Labach , Hamidreza Kamkari , Jesse C. Cresswell , Keyvan Golestan , Guangwei Yu , Anthony L. Caterini , Maksims Volkovs

Interpretable Kernels

The use of kernels for nonlinear prediction is widespread in machine learning. They have been popularized in support vector machines and used in kernel ridge regression, amongst others. Kernel methods share three aspects. First, instead of…

Machine Learning · Statistics 2025-08-25 Patrick J. F. Groenen , Michael Greenacre

RDBLearn: Simple In-Context Prediction Over Relational Databases

Recent advances in tabular in-context learning (ICL) show that a single pretrained model can adapt to new prediction tasks from a small set of labeled examples, avoiding per-task training and heavy tuning. However, many real-world tasks…

Databases · Computer Science 2026-02-24 Yanlin Zhang , Linjie Xu , Quan Gan , David Wipf , Minjie Wang

Fast Estimation of Information Theoretic Learning Descriptors using Explicit Inner Product Spaces

Kernel methods form a theoretically-grounded, powerful and versatile framework to solve nonlinear problems in signal processing and machine learning. The standard approach relies on the \emph{kernel trick} to perform pairwise evaluations of…

Machine Learning · Computer Science 2020-01-03 Kan Li , Jose C. Principe

Scalable In-Context Learning on Tabular Data via Retrieval-Augmented Large Language Models

Recent studies have shown that large language models (LLMs), when customized with post-training on tabular data, can acquire general tabular in-context learning (TabICL) capabilities. These models are able to transfer effectively across…

Computation and Language · Computer Science 2025-02-06 Xumeng Wen , Shun Zheng , Zhen Xu , Yiming Sun , Jiang Bian

Real-Time Explanations for Tabular Foundation Models

Interpretability is central for scientific machine learning, as understanding \emph{why} models make predictions enables hypothesis generation and validation. While tabular foundation models show strong performance, existing explanation…

Machine Learning · Computer Science 2026-04-01 Luan Borges Teodoro Reis Sena , Francisco Galuppo Azevedo

Retrieval & Fine-Tuning for In-Context Tabular Models

Tabular data is a pervasive modality spanning a wide range of domains, and the inherent diversity poses a considerable challenge for deep learning. Recent advancements using transformer-based in-context learning have shown promise on…

Machine Learning · Computer Science 2024-06-11 Valentin Thomas , Junwei Ma , Rasa Hosseinzadeh , Keyvan Golestan , Guangwei Yu , Maksims Volkovs , Anthony Caterini

Understanding Emergent In-Context Learning from a Kernel Regression Perspective

Large language models (LLMs) have initiated a paradigm shift in transfer learning. In contrast to the classic pretraining-then-finetuning procedure, in order to use LLMs for downstream prediction tasks, one only needs to provide a few…

Computation and Language · Computer Science 2025-09-16 Chi Han , Ziqi Wang , Han Zhao , Heng Ji

TabGen-ICL: Residual-Aware In-Context Example Selection for Tabular Data Generation

Large Language models (LLMs) have achieved encouraging results in tabular data generation. However, existing approaches require fine-tuning, which is computationally expensive. This paper explores an alternative: prompting a fixed LLM with…

Machine Learning · Computer Science 2025-02-25 Liancheng Fang , Aiwei Liu , Hengrui Zhang , Henry Peng Zou , Weizhi Zhang , Philip S. Yu