机器学习 — Scifaro

Provably Reliable Classifier Guidance via Cross-Entropy Control

Classifier-guided diffusion models generate conditional samples by augmenting the reverse-time score with the gradient of the log-probability predicted by a probabilistic classifier. In practice, this classifier is usually obtained by…

机器学习 · 统计学 2026-02-06 Sharan Sahu , Arisina Banerjee , Yuchen Wu

Neural Networks Learn Generic Multi-Index Models Near Information-Theoretic Limit

In deep learning, a central issue is to understand how neural networks efficiently learn high-dimensional features. To this end, we explore the gradient descent learning of a general Gaussian Multi-index model…

机器学习 · 统计学 2026-02-06 Bohan Zhang , Zihao Wang , Hengyu Fu , Jason D. Lee

How Data Mixing Shapes In-Context Learning: Asymptotic Equivalence for Transformers with MLPs

Pretrained Transformers demonstrate remarkable in-context learning (ICL) capabilities, enabling them to adapt to new tasks from demonstrations without parameter updates. However, theoretical studies often rely on simplified architectures…

机器学习 · 统计学 2026-02-06 Samet Demir , Zafer Dogan

A Representer Theorem for Hawkes Processes via Penalized Least Squares Minimization

The representer theorem is a cornerstone of kernel methods, which aim to estimate latent functions in reproducing kernel Hilbert spaces (RKHSs) in a nonparametric manner. Its significance lies in converting inherently infinite-dimensional…

机器学习 · 统计学 2026-02-06 Hideaki Kim , Tomoharu Iwata

Conditional regression for the Nonlinear Single-Variable Model

Regressing a function $F$ on $\mathbb{R}^d$ without the statistical and computational curse of dimensionality requires special statistical models, for example that impose geometric assumptions on the distribution of the data (e.g., that its…

机器学习 · 统计学 2026-02-06 Yantao Wu , Mauro Maggioni

Root Cause Analysis of Outliers with Missing Structural Knowledge

The goal of Root Cause Analysis (RCA) is to explain why an anomaly occurred by identifying where the fault originated. Several recent works model the anomalous event as resulting from a change in the causal mechanism at the root cause,…

机器学习 · 统计学 2026-02-06 William Roy Orchard , Nastaran Okati , Sergio Hernan Garrido Mejia , Patrick Blöbaum , Dominik Janzing

CATS: Enhancing Multivariate Time Series Forecasting by Constructing Auxiliary Time Series as Exogenous Variables

For Multivariate Time Series Forecasting (MTSF), recent deep learning applications show that univariate models frequently outperform multivariate ones. To address the difficiency in multivariate models, we introduce a method to Construct…

机器学习 · 统计学 2026-02-06 Jiecheng Lu , Xu Han , Yan Sun , Shihao Yang

ARM: Refining Multivariate Forecasting with Adaptive Temporal-Contextual Learning

Long-term time series forecasting (LTSF) is important for various domains but is confronted by challenges in handling the complex temporal-contextual relationships. As multivariate input models underperforming some recent univariate…

机器学习 · 统计学 2026-02-06 Jiecheng Lu , Xu Han , Shihao Yang

Improved Generalization Bounds for Transductive Learning by Transductive Local Complexity and Its Applications

We introduce Transductive Local Complexity (TLC) to extend the classical Local Rademacher Complexity (LRC) to the transductive setting, incorporating substantial and novel components. Although LRC has been used to obtain sharp…

机器学习 · 统计学 2026-02-06 Yingzhen Yang

Conditional Counterfactual Mean Embeddings: Doubly Robust Estimation and Learning Rates

A complete understanding of heterogeneous treatment effects involves characterizing the full conditional distribution of potential outcomes. To this end, we propose the Conditional Counterfactual Mean Embeddings (CCME), a framework that…

机器学习 · 统计学 2026-02-05 Thatchanon Anancharoenkij , Donlapark Ponnoprat

Causal explanations of outliers in systems with lagged time-dependencies

Root-cause analysis in controlled time dependent systems poses a major challenge in applications. Especially energy systems are difficult to handle as they exhibit instantaneous as well as delayed effects and if equipped with storage, do…

机器学习 · 统计学 2026-02-05 Philipp Alexander Schwarz , Johannes Oberpriller , Sven Klaassen

A principled framework for uncertainty decomposition in TabPFN

TabPFN is a transformer that achieves state-of-the-art performance on supervised tabular tasks by amortizing Bayesian prediction into a single forward pass. However, there is currently no method for uncertainty decomposition in TabPFN.…

机器学习 · 统计学 2026-02-05 Sandra Fortini , Kenyon Ng , Sonia Petrone , Judith Rousseau , Susan Wei

Bayesian PINNs for uncertainty-aware inverse problems (BPINN-IP)

The main contribution of this paper is to develop a hierarchical Bayesian formulation of PINNs for linear inverse problems, which is called BPINN-IP. The proposed methodology extends PINN to account for prior knowledge on the nature of the…

机器学习 · 统计学 2026-02-05 Ali Mohammad-Djafari

Anytime-Valid Conformal Risk Control

Prediction sets provide a means of quantifying the uncertainty in predictive tasks. Using held out calibration data, conformal prediction and risk control can produce prediction sets that exhibit statistically valid error control in a…

机器学习 · 统计学 2026-02-05 Bror Hultberg , Dave Zachariah , Antônio H. Ribeiro

Geometry-Aware Optimal Transport: Fast Intrinsic Dimension and Wasserstein Distance Estimation

Solving large scale Optimal Transport (OT) in machine learning typically relies on sampling measures to obtain a tractable discrete problem. While the discrete solver's accuracy is controllable, the rate of convergence of the discretization…

机器学习 · 统计学 2026-02-05 Ferdinand Genans , Olivier Wintenberger

Provable Target Sample Complexity Improvements as Pre-Trained Models Scale

Pre-trained models have become indispensable for efficiently building models across a broad spectrum of downstream tasks. The advantages of pre-trained models have been highlighted by empirical studies on scaling laws, which demonstrate…

机器学习 · 统计学 2026-02-05 Kazuto Fukuchi , Ryuichiro Hataya , Kota Matsui

Maximin Relative Improvement: Fair Learning as a Bargaining Problem

When deploying a single predictor across multiple subpopulations, we propose a fundamentally different approach: interpreting group fairness as a bargaining problem among subpopulations. This game-theoretic perspective reveals that existing…

机器学习 · 统计学 2026-02-05 Jiwoo Han , Moulinath Banerjee , Yuekai Sun

Attack-Resistant Uniform Fairness for Linear and Smooth Contextual Bandits

Modern systems, such as digital platforms and service systems, increasingly rely on contextual bandits for online decision-making; however, their deployment can inadvertently create unfair exposure among arms, undermining long-term platform…

机器学习 · 统计学 2026-02-05 Qingwen Zhang , Wenjia Wang

Efficient Subgroup Analysis via Optimal Trees with Global Parameter Fusion

Identifying and making statistical inferences on differential treatment effects (commonly known as subgroup analysis in clinical research) is central to precision health. Subgroup analysis allows practitioners to pinpoint populations for…

机器学习 · 统计学 2026-02-05 Zhongming Xie , Joseph Giorgio , Jingshen Wang

Learning Multi-type heterogeneous interacting particle systems

We propose a framework for the joint inference of network topology, multi-type interaction kernels, and latent type assignments in heterogeneous interacting particle systems from multi-trajectory data. This learning task is a challenging…

机器学习 · 统计学 2026-02-05 Quanjun Lang , Xiong Wang , Fei Lu , Mauro Maggioni