机器学习 — Scifaro

Conformal Prediction via Transported Beta Laws

Split conformal prediction provides finite-sample marginal coverage under exchangeability, but this guarantee averages over the random calibration sample. We study instead the law of the calibration-conditional coverage induced by a…

机器学习 · 统计学 2026-05-20 Thiago R. Ramos , Helton Graziadei , Luben M. C. Cabezas

Markov Chain Decoders Overcome the Heavy-Tail Limitations of Lipschitz Generative Models

Heavy-tailed distributions are prevalent in performance evaluation, network traffic, and risk modeling. This behavior poses a fundamental challenge for modern deep generative models. Standard Variational Autoencoders (VAEs) employ Gaussian…

机器学习 · 统计学 2026-05-20 Abdelhakim Ziani , Andras Horvath , Paolo Ballarini

Bayesian Latent Space Models for Graphs Are Misspecified: Toward Robust Inference via Generalized Posteriors

Bayesian latent space models offer a principled approach to network representation, but rely on correct specification of both geometry and link function. Real-world networks often violate these assumptions, exhibiting geometric mismatch and…

机器学习 · 统计学 2026-05-20 Aldric Labarthe

Neural Network Models for Contextual Regression

We propose a neural network model for contextual regression in which the regression model depends on contextual features that determine the active submodel and an algorithm to fit the model. The proposed simple contextual neural network…

机器学习 · 统计学 2026-05-20 Seksan Kiatsupaibul , Pakawan Chansiripas

Bayesian Symbolic Regression for Missing Physics

Model-based approaches for (bio)process systems often suffer from incomplete knowledge of the underlying physical, chemical, or biological laws. Universal differential equations, which embed neural networks within differential equations,…

机器学习 · 统计学 2026-05-20 Arno Strouwen

Stochastic Gradient Variational Inference with Price's Gradient Estimator from Bures-Wasserstein to Parameter Space

For approximating a target distribution given only its unnormalized log-density, stochastic gradient-based variational inference (VI) algorithms are a popular approach. For example, Wasserstein VI (WVI) and black-box VI (BBVI) perform…

机器学习 · 统计学 2026-05-20 Kyurae Kim , Qiang Fu , Yi-An Ma , Jacob R. Gardner , Trevor Campbell

Efficient and Minimax Optimal In-context Nonparametric Regression with Transformers

We study in-context learning for nonparametric regression with $\alpha$-H\"older smooth regression functions, for some $\alpha>0$. We prove that, with $n$ in-context examples and $d$-dimensional regression covariates, a pretrained…

机器学习 · 统计学 2026-05-20 Michelle Ching , Ioana Popescu , Nico Smith , Tianyi Ma , William G. Underwood , Richard J. Samworth

On the Provable Suboptimality of Momentum SGD in Nonstationary Stochastic Optimization

In this paper, we provide a comprehensive theoretical analysis of Stochastic Gradient Descent (SGD) and its momentum variants (Polyak Heavy-Ball and Nesterov) for tracking time-varying optima under strong convexity and smoothness. Our…

机器学习 · 统计学 2026-05-20 Sharan Sahu , Cameron J. Hogan , Martin T. Wells

A Derandomization Framework for Structure Discovery: Applications in Neural Networks and Beyond

Understanding the dynamics of feature learning in neural networks (NNs) remains a significant challenge. The work of (Mousavi-Hosseini et al., 2023) analyzes a multiple index teacher-student setting and shows that a two-layer student…

机器学习 · 统计学 2026-05-20 Nikos Tsikouras , Yorgos Pantis , Ioannis Mitliagkas , Christos Tzamos

Diffusion and Flow-based Copulas: Forgetting and Remembering Dependencies

Copulas are a fundamental tool for modelling multivariate dependencies in data, forming the method of choice in diverse fields and applications. However, the adoption of existing models for multimodal and high-dimensional dependencies is…

机器学习 · 统计学 2026-05-20 David Huk , Theodoros Damoulas

Recovering Wasserstein Distance Matrices from Few Measurements

This paper proposes two algorithms for estimating square Wasserstein distance matrices from a small number of entries. These matrices are used to compute manifold learning embeddings like multidimensional scaling (MDS) or Isomap, but…

机器学习 · 统计学 2026-05-20 Muhammad Rana , Abiy Tasissa , HanQin Cai , Yakov Gavriyelov , Keaton Hamm

Complexity Analysis of Normalizing Constant Estimation: from Jarzynski Equality to Annealed Importance Sampling and beyond

Given an unnormalized probability density $\pi\propto\mathrm{e}^{-V}$, estimating its normalizing constant $Z=\int_{\mathbb{R}^d}\mathrm{e}^{-V(x)}\mathrm{d}x$ or free energy $F=-\log Z$ is a crucial problem in Bayesian statistics,…

机器学习 · 统计学 2026-05-20 Wei Guo , Molei Tao , Yongxin Chen

Generalization Bounds of Surrogate Policies for Combinatorial Optimization Problems

Many real-world decision problems require solving, again and again, combinatorial optimization instances drawn from a common distribution. A recent line of structured learning methods exploits this regularity by learning policies that pair…

机器学习 · 统计学 2026-05-20 Pierre-Cyril Aubin-Frankowski , Yohann De Castro , Axel Parmentier , Alessandro Rudi

Statistical Limits and Efficient Algorithms for Differentially Private Federated Learning

Federated Learning is a leading framework for training ML and AI models collaboratively across numerous user devices or databases. We study the trade-offs among estimation accuracy, privacy constraints, and communication cost for…

机器学习 · 统计学 2026-05-19 Arnab Auddy , Xiangni Peng , Subhadeep Paul

Flowing with Confidence

Generative models can produce nonsensical text, unrealistic images, and unstable materials faster than simulation or human review can absorb; without per-sample confidence, trust erodes. Existing fixes run $k$ ensembles or stochastic…

机器学习 · 统计学 2026-05-19 Friso de Kruiff , Dario Coscia , Max Welling , Erik Bekkers

Generalized Functional ANOVA in Closed-Form: A Unified View of Additive Explanations

The functional ANOVA, or Hoeffding decomposition, provides a principled framework for interpretability by decomposing a model prediction into main effects and higher-order interactions. For independent inputs, this classical decomposition…

机器学习 · 统计学 2026-05-19 Baptiste Ferrere , Nicolas Bousquet , Fabrice Gamboa , Jean-Michel Loubes

Geometric Dictionary Learning of Dynamical Systems with Optimal Transport

Learning dynamical systems through operator-theoretic representations provides a powerful framework for analyzing complex dynamics, as spectral quantities such as eigenvalues and invariant structures encode characteristic time scales and…

机器学习 · 统计学 2026-05-19 Thibaut Germain , Sami Chemlal , Rémi Flamary , Vladimir R. Kostic , Karim Lounici

Forward-Learned Discrete Diffusion: Learning how to noise to denoise faster

Discrete diffusion models are a powerful class of generative models with strong performance across many domains. For efficiency, however, discrete diffusion typically parameterizes the generative (reverse) process with factorized…

机器学习 · 统计学 2026-05-19 Grigory Bartosh , Teodora Pandeva , Sushrut Karmalkar , Javier Zazo

Canonical Regularisation of Wide Feature-Learning Neural Networks

Wide neural networks in the feature-learning regime drive modern deep learning, and yet they remain far less studied than their kernel-regime counterparts. We consider a critical yet under-explored difference between these two regimes: the…

机器学习 · 统计学 2026-05-19 George Whittle , Pranav Vaidhyanathan , Juliusz Ziomek , Natalia Ares , Maike A. Osborne

Wasserstein bounds for denoising diffusion probabilistic models via the F\"ollmer process

This paper studies sampling error bounds for denoising diffusion probabilistic models (DDPMs) in the 2-Wasserstein distance. Our contributions are threefold. (i) Under general Lipschitz-type conditions on the score function and for a broad…

机器学习 · 统计学 2026-05-19 Yuta Koike