机器学习 — Scifaro

Posterior Bayesian Neural Networks with Dependent Weights

We consider fully connected and feedforward deep neural networks with dependent and possibly heavy-tailed weights, as introduced in [26], to address limitations of the standard Gaussian prior. It has been proved in [26] that, as the number…

机器学习 · 统计学 2026-05-14 Nicola Apollonio , Giovanni Franzina , Giovanni Luca Torrisi

High-Dimensional Analysis of Bootstrap Ensemble Classifiers

Bootstrap methods have long been the cornerstone of ensemble learning in machine learning. This paper presents a theoretical analysis of bootstrap techniques applied to the Least Square Support Vector Machine (LSSVM) ensemble in the context…

机器学习 · 统计学 2026-05-14 Malik Tiomoko , Hamza Cherkaoui , Mohamed El Amine Seddik , Cosme Louart , Ekkehard Schnoor , Balazs Kegl

Kernel Embeddings and the Separation of Measure Phenomenon

We prove that kernel covariance embeddings lead to information-theoretically perfect separation of distinct continuous probability distributions. In statistical terms, we establish that testing for the \emph{equality} of two non-atomic…

机器学习 · 统计学 2026-05-14 Leonardo V. Santoro , Kartik G. Waghmare , Victor M. Panaretos

Accelerating Particle-based Energetic Variational Inference

In this work, we propose a new particle-based variational inference (ParVI) method for accelerating the Energetic Variational Inference with Implicit scheme (EVI-Im) introduced in Ref. \cite{wang2021particle}. Inspired by energy…

机器学习 · 统计学 2026-05-14 Xuelian Bao , Lulu Kang , Chun Liu , Yiwei Wang

Distributional Autoencoders Know the Score

The Distributional Principal Autoencoder (DPA) combines distributionally correct reconstruction with principal-component-like interpretability of the encodings. In this work, we provide exact theoretical guarantees on both fronts. First, we…

机器学习 · 统计学 2026-05-14 Andrej Leban

Ensemble Transport Filter via Optimized Maximum Mean Discrepancy

In this paper, we present a new ensemble-based filter method by reconstructing the analysis step of the particle filter through a transport map, which directly transports prior particles to posterior particles. The transport map is…

机器学习 · 统计学 2026-05-14 Dengfei Zeng , Lijian Jiang

Generative Modeling by Minimizing the Wasserstein-2 Loss

This paper develops a generative model by minimizing the second-order Wasserstein loss (the $W_2$ loss) through a distribution-dependent ordinary differential equation (ODE), whose dynamics involves the Kantorovich potential associated with…

机器学习 · 统计学 2026-05-14 Yu-Jui Huang , Zachariah Malik

Small Area Estimation of Case Growths for Timely COVID-19 Outbreak Detection

The COVID-19 pandemic has exerted a profound impact on the global economy and continues to exact a significant toll on human lives. The COVID-19 case growth rate stands as a key epidemiological parameter to estimate and monitor for…

机器学习 · 统计学 2026-05-14 Zhaowei She , Zilong Wang , Jagpreet Chhatwal , Turgay Ayer

Training VAEs Under Structured Residuals

Variational auto-encoders (VAEs) are a popular and powerful deep generative model. Previous works on VAEs have assumed a factorized likelihood model, whereby the output uncertainty of each pixel is assumed to be independent. This…

机器学习 · 统计学 2026-05-14 Gara Dorta , Sara Vicente , Lourdes Agapito , Neill D. F. Campbell , Ivor Simpson

Structured Uncertainty Prediction Networks

This paper is the first work to propose a network to predict a structured uncertainty distribution for a synthesized image. Previous approaches have been mostly limited to predicting diagonal covariance matrices. Our novel model learns to…

机器学习 · 统计学 2026-05-14 Gara Dorta , Sara Vicente , Lourdes Agapito , Neill D. F. Campbell , Ivor Simpson

Model-based Bootstrap of Controlled Markov Chains

We propose and analyze a model-based bootstrap for transition kernels in finite controlled Markov chains (CMCs) with possibly nonstationary or history-dependent control policies, a setting that arises naturally in offline reinforcement…

机器学习 · 统计学 2026-05-13 Ziwei Su , Imon Banerjee , Diego Klabjan

Multi-Variable Conformal Prediction: Optimizing Prediction Sets without Data Splitting

Conformal prediction constructs prediction sets with finite-sample coverage guarantees, but its calibration stage is structurally constrained to a scalar score function and a single threshold variable - forcing shapes of prediction sets to…

机器学习 · 统计学 2026-05-13 Laura Lützow , Simone Garatti , Marco C. Campi , Lars Lindemann , Matthias Althoff

Optimal Policy Learning under Budget and Coverage Constraints

We study optimal policy learning under combined budget and minimum coverage constraints. We show that the problem admits a knapsack-type structure and that the optimal policy can be characterized by an affine threshold rule involving both…

机器学习 · 统计学 2026-05-13 Giovanni Cerulli

Information-Theoretic Generalization Bounds for Sequential Decision Making

Information-theoretic generalization bounds based on the supersample construction are a central tool for algorithm-dependent generalization analysis in the batch i.i.d.~setting. However, existing supersample conditional mutual information…

机器学习 · 统计学 2026-05-13 Futoshi Futami , Masahiro Fujisawa

Variance-aware Reward Modeling with Anchor Guidance

Standard Bradley--Terry (BT) reward models are limited when human preferences are pluralistic. Although soft preference labels preserve disagreement information, BT can only express it by shrinking reward margins. Gaussian reward models…

机器学习 · 统计学 2026-05-13 Shuxing Fang , Ruijian Han , Liangyu Zhang , Fan Zhou

Minimax Rates and Spectral Distillation for Tree Ensembles

Tree ensembles such as random forests (RFs) and gradient boosting machines (GBMs) are among the most widely used supervised learners, yet their theoretical properties remain incompletely understood. We adopt a spectral perspective on these…

机器学习 · 统计学 2026-05-13 Binh Duc Vu , David S. Watson

Posterior Contraction Rates for Sparse Kolmogorov-Arnold Networks in Anisotropic Besov Spaces

We study posterior contraction rates for sparse Bayesian Kolmogorov-Arnold networks (KANs) over anisotropic Besov spaces, providing a statistical foundation of KANs from a Bayesian point of view. We show that sparse Bayesian KANs equipped…

机器学习 · 统计学 2026-05-13 Jeunghun Oh , Kyeongwon Lee , Jaeyong Lee , Lizhen Lin

Learning U-Statistics with Active Inference

$U$-statistics play a central role in statistical inference. In many modern applications, however, acquiring the labels required for $U$-statistics is costly. Motivated by recent advances in active inference, we develop an active inference…

机器学习 · 统计学 2026-05-13 Xiaoning Wang , Yuyang Huo , Liuhua Peng , Changliang Zou

Exact Stiefel Optimization for Probabilistic PLS: Closed-Form Updates, Error Bounds, and Calibrated Uncertainty

Probabilistic partial least squares (PPLS) is a central likelihood-based model for two-view learning when one needs both interpretable latent factors and calibrated uncertainty. Building on the identifiable parameterization of Bouhaddani et…

机器学习 · 统计学 2026-05-13 Haoran Hu , Xingce Wang

Post-ADC Inference: Valid Inference After Active Data Collection

The validity of statistical inference depends critically on how data are collected. When data gathered through active data collection (ADC) are reused for a post-hoc inferential task, conventional inference can fail because the sampling is…

机器学习 · 统计学 2026-05-13 Shuichi Nishino , Tomohiro Shiraishi , Teruyuki Katsuoka , Ichiro Takeuchi