机器学习 — Scifaro

When Marginals Match but Structure Fails: Covariance Fidelity in Generative Models

Generative models are increasingly deployed as substitutes for real data in downstream scientific workflows, yet standard evaluation criteria remain focused on marginal distribution matching. We argue that this represents a fundamental gap:…

机器学习 · 统计学 2026-05-19 Nazia Riasat

RIE-Greedy: Regularization-Induced Exploration for Contextual Bandits

Real-world contextual bandit problems with complex reward models are often tackled with iteratively trained models, such as boosting trees. However, it is difficult to directly apply simple and effective exploration strategies--such as…

机器学习 · 统计学 2026-05-19 Tong Li , Thiago de Queiroz Casanova , Eric M. Schwartz , Victor Kostyuk , Dehan Kong , Joseph J. Williams

Masking Causality and Conditional Dependence

Many regulatory and analytic problems require that a prohibited variable influence a decision only through a designated allowable channel -- a conditional-independence requirement that arises in path-specific fairness, the handling of…

机器学习 · 统计学 2026-05-19 Zou Yang , Sophia Xiao , Bijan Mazaheri

Fast Rates for Nonstationary Weighted Risk Minimization

Weighted empirical risk minimization is a common approach to prediction under distribution drift. This article studies its out-of-sample prediction error under nonstationarity. We provide a general decomposition of the excess risk into a…

机器学习 · 统计学 2026-05-19 Tobias Brock , Thomas Nagler

Finite-Particle Rates for Regularized Stein Variational Gradient Descent

We derive finite-particle rates for the regularized Stein variational gradient descent (R-SVGD) algorithm introduced by He et al. (2024) that corrects the constant-order bias of the SVGD by applying a resolvent-type preconditioner to the…

机器学习 · 统计学 2026-05-19 Ye He , Krishnakumar Balasubramanian , Sayan Banerjee , Promit Ghosal

Multi-layer Cross-attention is Provably Optimal for Multi-modal In-context Learning

Recent progress has rapidly advanced our understanding of the mechanisms underlying in-context learning in modern attention-based neural networks. However, existing results focus exclusively on unimodal data; in contrast, the theoretical…

机器学习 · 统计学 2026-05-19 Nicholas Barnfield , Subhabrata Sen , Pragya Sur

ST-BCP: Tightening Coverage Bound for Backward Conformal Prediction via Non-Conformity Score Transformation

Conformal Prediction (CP) provides a statistical framework for uncertainty quantification that constructs prediction sets with coverage guarantees. While CP yields uncontrolled prediction set sizes, Backward Conformal Prediction (BCP)…

机器学习 · 统计学 2026-05-19 Junxian Liu , Hao Zeng , Hongxin Wei

Latent-IMH: Efficient Bayesian Inference for Inverse Problems with Approximate Operators

We study sampling from posterior distributions in Bayesian linear inverse problems where $A$, the parameters to observables operator, is computationally expensive. In many applications, $A$ can be factored in a manner that facilitates the…

机器学习 · 统计学 2026-05-19 Youguang Chen , George Biros

Detecting Stochasticity in Discrete Signals via Nonparametric Excursion Theorem

We develop a practical framework for distinguishing diffusive stochastic processes from deterministic signals using only a single discrete time series. Our approach is based on classical excursion and crossing theorems for continuous…

机器学习 · 统计学 2026-05-19 Sunia Tanweer , Firas A. Khasawneh

Gradient Dynamics of Attention: How Cross-Entropy Sculpts Bayesian Manifolds

Transformers empirically perform precise probabilistic reasoning in carefully constructed ``Bayesian wind tunnels'' and in large-scale language models, yet the mechanisms by which gradient-based learning creates the required internal…

机器学习 · 统计学 2026-05-19 Naman Agarwal , Siddhartha R. Dalal , Vishal Misra

TPV: Parameter Perturbations Through the Lens of Test Prediction Variance

We introduce test prediction variance (TPV)--the first-order sensitivity of a trained model's outputs to parameter perturbations--as a unifying framework for analyzing post-training robustness. TPV is a fully label-free object whose trace…

机器学习 · 统计学 2026-05-19 Devansh Arpit

Sparse Deep Additive Model with Interactions: Enhancing Interpretability and Predictability

Recent advances in deep learning highlight the need for personalized models that can learn from small samples, handle high-dimensional features, and remain interpretable. To address this, we propose the Sparse Deep Additive Model with…

机器学习 · 统计学 2026-05-19 Yi-Ting Hung , Li-Hsiang Lin , Vince D. Calhoun

Universal Inverse Distillation for Matching Models with Real-Data Supervision (No GANs)

While achieving exceptional generative quality, modern diffusion, flow, and other matching models suffer from slow inference, as they require many steps of iterative generation. Recent distillation methods address this problem by training…

机器学习 · 统计学 2026-05-19 Nikita Kornilov , David Li , Tikhon Mavrin , Aleksei Leonov , Nikita Gushchin , Evgeny Burnaev , Iaroslav Koshelev , Alexander Korotin

Lipschitz-Guided Design of Interpolation Schedules in Generative Models

We study the design of interpolation schedules in flow and diffusion-based generative models from both statistical and numerical perspectives. Within the stochastic interpolants framework, we first show that scalar interpolation schedules…

机器学习 · 统计学 2026-05-19 Yifan Chen , Eric Vanden-Eijnden , Jiawei Xu

Graph neural networks for residential location choice: connection to classical logit models

Researchers have adopted deep learning for classical discrete choice analysis as it can capture complex feature relationships and achieve higher predictive performance. However, the existing deep learning approaches cannot explicitly…

机器学习 · 统计学 2026-05-19 Zhanhong Cheng , Lingqian Hu , Yuheng Bu , Yuqi Zhou , Shenhao Wang

A Randomized Algorithm for Sparse PCA based on the Basic SDP Relaxation

Sparse Principal Component Analysis (SPCA) is a fundamental technique for dimensionality reduction, and is NP-hard. In this paper, we introduce a randomized approximation algorithm for SPCA, which is based on the basic SDP relaxation. Our…

机器学习 · 统计学 2026-05-19 Alberto Del Pia , Dekun Zhou

Nash: Neural Adaptive Shrinkage for Structured High-Dimensional Regression

Sparse linear regression is a fundamental tool in data analysis. However, traditional approaches often fall short when covariates exhibit structure or arise from heterogeneous sources. In biomedical applications, covariates may stem from…

机器学习 · 统计学 2026-05-19 William R. P. Denault

Throughput-Optimal Scheduling Algorithms for LLM Inference and AI Agents

As demand for Large Language Models (LLMs) and AI agents grows rapidly, optimizing systems for efficient LLM inference becomes critical. While significant efforts have targeted system-level engineering, little has been explored from a…

机器学习 · 统计学 2026-05-19 J. G. Dai , Tianze Deng , Yueying Li , Tianyi Peng

High-dimensional ridge regression with random features for non-identically distributed data with a variance profile

Random feature ridge regression is often analyzed in the high-dimensional regime under the homogeneous sampling model $x_i=\Sigma^{1/2}x_i'$, where the vectors $x_i'$ have iid entries and the same covariance matrix $\Sigma$ is shared by all…

机器学习 · 统计学 2026-05-19 Issa-Mbenard Dabo , Jérémie Bigot

Distribution Transformers: Fast Approximate Bayesian Inference With On-The-Fly Prior Adaptation

While Bayesian inference provides a principled framework for reasoning under uncertainty, its widespread adoption is limited by the intractability of exact posterior computation, necessitating the use of approximate inference. However,…

机器学习 · 统计学 2026-05-19 George Whittle , Juliusz Ziomek , Jacob Rawling , Maike A. Osborne