机器学习 — Scifaro

Beyond NNGP: Large Deviations and Feature Learning in Bayesian Neural Networks

We study wide Bayesian neural networks focusing on the rare but statistically dominant fluctuations that govern posterior concentration, beyond Gaussian-process limits. Large-deviation theory provides explicit variational objectives-rate…

机器学习 · 统计学 2026-02-27 Katerina Papagiannouli , Dario Trevisan , Giuseppe Pio Zitto

Unsupervised Continual Learning for Amortized Bayesian Inference

Amortized Bayesian Inference (ABI) enables efficient posterior estimation using generative neural networks trained on simulated data, but often suffers from performance degradation under model misspecification. While self-consistency (SC)…

机器学习 · 统计学 2026-02-27 Aayush Mishra , Šimon Kucharský , Paul-Christian Bürkner

From Shallow Bayesian Neural Networks to Gaussian Processes: General Convergence, Identifiability and Scalable Inference

In this work, we study scaling limits of shallow Bayesian neural networks (BNNs) via their connection to Gaussian processes (GPs), with an emphasis on statistical modeling, identifiability, and scalable inference. We first establish a…

机器学习 · 统计学 2026-02-27 Gracielle Antunes de Araújo , Flávio B. Gonçalves

LoBoost: Fast Model-Native Local Conformal Prediction for Gradient-Boosted Trees

Gradient-boosted decision trees are among the strongest off-the-shelf predictors for tabular regression, but point predictions alone do not quantify uncertainty. Conformal prediction provides distribution-free marginal coverage, yet split…

机器学习 · 统计学 2026-02-27 Vagner Santos , Victor Coscrato , Luben Cabezas , Rafael Izbicki , Thiago Ramos

Not Just How Much, But Where: Decomposing Epistemic Uncertainty into Per-Class Contributions

In safety-critical classification, the cost of failure is often asymmetric, yet Bayesian deep learning summarises epistemic uncertainty with a single scalar, mutual information (MI), that cannot distinguish whether a model's ignorance…

机器学习 · 统计学 2026-02-27 Mame Diarra Toure , David A. Stephens

Scaling Laws for Precision in High-Dimensional Linear Regression

Low-precision training is critical for optimizing the trade-off between model quality and training costs, necessitating the joint allocation of model size, dataset size, and numerical precision. While empirical scaling laws suggest that…

机器学习 · 统计学 2026-02-27 Dechen Zhang , Xuan Tang , Yingyu Liang , Difan Zou

One-Step Diffusion Samplers via Self-Distillation and Deterministic Flow

Sampling from unnormalized target distributions is a fundamental yet challenging task in machine learning and statistics. Existing sampling algorithms typically require many iterative steps to produce high-quality samples, leading to high…

机器学习 · 统计学 2026-02-27 Pascal Jutras-Dube , Jiaru Zhang , Ziran Wang , Ruqi Zhang

On the Interpolation Error of Nonlinear Attention versus Linear Regression

Attention has become the core building block of modern machine learning (ML) by efficiently capturing the long-range dependencies among input tokens. Its inherently parallelizable structure allows for efficient performance scaling with the…

机器学习 · 统计学 2026-02-27 Zhenyu Liao , Jiaqing Liu , TianQi Hou , Difan Zou , Zenan Ling

Approximation Error and Complexity Bounds for ReLU Networks on Low-Regular Function Spaces

In this work, we consider the approximation of a large class of bounded functions, with minimal regularity assumptions, by ReLU neural networks. We show that the approximation error can be bounded from above by a quantity proportional to…

机器学习 · 统计学 2026-02-27 Owen Davis , Gianluca Geraci , Mohammad Motamed

Probing the Geometry of Diffusion Models with the String Method

Understanding the geometry of learned distributions is fundamental to improving and interpreting diffusion models, yet systematic tools for exploring their landscape remain limited. Standard latent-space interpolations fail to respect the…

机器学习 · 统计学 2026-02-26 Elio Moreau , Florentin Coeurdoux , Grégoire Ferre , Eric Vanden-Eijnden

Scalable Kernel-Based Distances for Statistical Inference and Integration

Representing, comparing, and measuring the distance between probability distributions is a key task in computational statistics and machine learning. The choice of representation and the associated distance determine properties of the…

机器学习 · 统计学 2026-02-26 Masha Naslidnyk

Goodness-of-Fit Tests for Latent Class Models with Ordinal Categorical Data

Ordinal categorical data are widely collected in psychology, education, and other social sciences, appearing commonly in questionnaires, assessments, and surveys. Latent class models provide a flexible framework for uncovering unobserved…

机器学习 · 统计学 2026-02-26 Huan Qing

Fair Model-based Clustering

The goal of fair clustering is to find clusters such that the proportion of sensitive attributes (e.g., gender, race, etc.) in each cluster is similar to that of the entire dataset. Various fair clustering algorithms have been proposed that…

机器学习 · 统计学 2026-02-26 Jinwon Park , Kunwoong Kim , Jihu Lee , Yongdai Kim

Efficient Inference after Directionally Stable Adaptive Experiments

We study inference on scalar-valued pathwise differentiable targets after adaptive data collection, such as a bandit algorithm. We introduce a novel target-specific condition, directional stability, which is strictly weaker than previously…

机器学习 · 统计学 2026-02-26 Zikai Shen , Houssam Zenati , Nathan Kallus , Arthur Gretton , Koulik Khamaru , Aurélien Bibaut

ConformalHDC: Uncertainty-Aware Hyperdimensional Computing with Application to Neural Decoding

Hyperdimensional Computing (HDC) offers a computationally efficient paradigm for neuromorphic learning. Yet, it lacks rigorous uncertainty quantification, leading to open decision boundaries and, consequently, vulnerability to outliers,…

机器学习 · 统计学 2026-02-26 Ziyi Liang , Hamed Poursiami , Zhishun Yang , Keiland Cooper , Akhilesh Jaiswal , Maryam Parsa , Norbert Fortin , Babak Shahbaba

Efficient Uncoupled Learning Dynamics with $\tilde{O}\!\left(T^{-1/4}\right)$ Last-Iterate Convergence in Bilinear Saddle-Point Problems over Convex Sets under Bandit Feedback

In this paper, we study last-iterate convergence of learning algorithms in bilinear saddle-point problems, a preferable notion of convergence that captures the day-to-day behavior of learning dynamics. We focus on the challenging setting…

机器学习 · 统计学 2026-02-26 Arnab Maiti , Claire Jie Zhang , Kevin Jamieson , Jamie Heather Morgenstern , Ioannis Panageas , Lillian J. Ratliff

Conditional neural control variates for variance reduction in Bayesian inverse problems

Bayesian inference for inverse problems involves computing expectations under posterior distributions -- e.g., posterior means, variances, or predictive quantities -- typically via Monte Carlo (MC) estimation. When the quantity of interest…

机器学习 · 统计学 2026-02-26 Ali Siahkoohi , Hyunwoo Oh

Counterdiabatic Hamiltonian Monte Carlo

Hamiltonian Monte Carlo (HMC) is a state of the art method for sampling from distributions with differentiable densities, but can converge slowly when applied to challenging multimodal problems. Running HMC with a time varying Hamiltonian,…

机器学习 · 统计学 2026-02-26 Reuben Cohn-Gordon , Uroš Seljak , Dries Sels

A Proof of Learning Rate Transfer under $\mu$P

We provide the first proof of learning rate transfer with width in a linear multi-layer perceptron (MLP) parametrized with $\mu$P, a neural network parameterization designed to ``maximize'' feature learning in the infinite-width limit. We…

机器学习 · 统计学 2026-02-26 Soufiane Hayou

Multimodal Datasets with Controllable Mutual Information

We introduce a framework for generating highly multimodal datasets with explicitly calculable mutual information (MI) between modalities. This enables the construction of benchmark datasets that provide a novel testbed for systematic…

机器学习 · 统计学 2026-02-26 Raheem Karim Hashmani , Garrett W. Merz , Helen Qu , Mariel Pettee , Kyle Cranmer