机器学习 — Scifaro

Approximating Simple ReLU Networks based on Spectral Decomposition of Fisher Information

Properties of Fisher information matrices of 2-layer neural ReLU networks with random hidden weights are studied. For these networks, it is known that the eigenvalue distribution highly concentrates on several eigenspaces approximately. In…

机器学习 · 统计学 2026-05-13 Ka Long Keith Ho , Yoshinari Takeishi , Junichi Takeuchi

Offline Constrained Reinforcement Learning under Partial Data Coverage

We study offline constrained reinforcement learning with general function approximation in discounted constrained Markov decision processes. Prior methods either require full data coverage for evaluating intermediate policies, lack oracle…

机器学习 · 统计学 2026-05-13 Seokmin Ko , Ambuj Tewari , Kihyuk Hong

Integral Imprecise Probability Metrics

Quantifying differences between probability distributions is fundamental to statistics and machine learning, primarily for comparing statistical uncertainty. In contrast, epistemic uncertainty -- due to incomplete knowledge -- requires…

机器学习 · 统计学 2026-05-13 Siu Lun Chau , Michele Caprio , Krikamol Muandet

Smoothed Analysis of Learning from Positive Samples

Binary classification from positive-only samples is a variant of PAC learning where the learner receives i.i.d. positive samples and aims to learn a classifier with low error. Previous work by Natarajan, Gereb-Graus, and Shvaytser…

机器学习 · 统计学 2026-05-13 Jane H. Lee , Anay Mehrotra , Manolis Zampetakis

Bayesian Surrogate Training on Multiple Data Sources: A Hybrid Modeling Strategy

Surrogate models are often used as computationally efficient approximations to complex simulation models, enabling tasks such as solving inverse problems, sensitivity analysis, and probabilistic forward predictions, which would otherwise be…

机器学习 · 统计学 2026-05-13 Philipp Reiser , Paul-Christian Bürkner , Anneli Guthke

Sparsity-Constraint Optimization via Splicing Iteration

Sparsity-constrained optimization underlies many problems in signal processing, statistics, and machine learning. State-of-the-art hard-thresholding (HT) algorithms rely on an appropriately selected continuous step-size parameter to ensure…

机器学习 · 统计学 2026-05-13 Jin Zhu , Junxian Zhu , Zezhi Wang , Borui Tang , Hongmei Lin , Xueqin Wang

Stochastic tensor space feature theory with applications to robust machine learning

In this paper we develop a Multilevel Orthogonal Subspace (MOS) Karhunen-Loeve feature theory based on stochastic tensor spaces, for the construction of robust machine learning features. Training data are treated as instances of a random…

机器学习 · 统计学 2026-05-13 Julio Enrique Castrillon-Candas , Kaili Shi , Dingning Liu , Sicheng Yang , Xiaoling Zhang , Mark Kon , the Alzheimer's Disease Neuroimaging Initiative

Hybrid safe-strong rules for efficient optimization in lasso-type problems

The lasso model has been widely used for model selection in data mining, machine learning, and high-dimensional statistical analysis. However, with the ultrahigh-dimensional, large-scale data sets now collected in many real-world…

机器学习 · 统计学 2026-05-13 Yaohui Zeng , Tianbao Yang , Patrick Breheny

Factual recall in linear associative memories: sharp asymptotics and mechanistic insights

Large language models demonstrate remarkable ability in factual recall, yet the fundamental limits of storing and retrieving input--output associations with neural networks remain unclear. We study these limits in a minimal setting: a…

机器学习 · 统计学 2026-05-12 Alessio Giorlandino , Sebastian Goldt , Antoine Maillard

Price of Quality: Sufficient Conditions for Sparse Recovery using Mixed-Quality Data

We study sparse recovery when observations come from mixed-quality sources: a small collection of high-quality measurements with small noise variance and a larger collection of lower-quality measurements with higher variance. For this…

机器学习 · 统计学 2026-05-12 Youssef Chaabouni , David Gamarnik

Amortizing Causal Sensitivity Analysis via Prior Data-Fitted Networks

Causal sensitivity analysis aims to provide bounds for causal effect estimates in the presence of unobserved confounding. However, existing methods for causal sensitivity analysis are per-instance procedures, meaning that changes to the…

机器学习 · 统计学 2026-05-12 Emil Javurek , Dennis Frauen , Marie Brockschmidt , Jonas Schweisthal , Stefan Feuerriegel

Affine Tracing: A New Paradigm for Probabilistic Linear Solvers

Probabilistic linear solvers (PLSs) return probability distributions that quantify uncertainty due to limited computation in the solution of linear systems. The literature has traditionally distinguished between Bayesian PLSs, which…

机器学习 · 统计学 2026-05-12 Disha Hegde , Marvin Pförtner , Jon Cockayne

Regret Analysis of Guided Diffusion for Black-Box Optimization over Structured Inputs

Guided-diffusion black-box optimization (BO) has shown strong empirical performance on structured design problems such as molecules and crystals, but its regret behavior remains poorly understood. Existing BO regret analyses typically rely…

机器学习 · 统计学 2026-05-12 Masaki Adachi , Anita Yang , Yakun Wang , Song Liu

Multifidelity Gaussian process regression for solving nonlinear partial differential equations

Solving nonlinear partial differential equations (PDEs) using kernel methods offers a compelling alternative to traditional numerical solvers. However, the performance of these methods strongly depends on the choice of kernel. In this work,…

机器学习 · 统计学 2026-05-12 Fatima-Zahrae El-Boukkouri , Josselin Garnier , Olivier Roustant

Uncertainty in Physics and AI: Taxonomy, Quantification, and Validation

Reliable uncertainty quantification is essential for the use of machine learning in physics, where scientific discoveries depend on validated probabilistic statements. We provide a structured overview of uncertainty quantification in ML for…

机器学习 · 统计学 2026-05-12 Manuel Haußmann , Ramon Winterhalder , Maria Ubiali

Fast Training of Mixture-of-Experts for Time Series Forecasting via Expert Loss Integration

We propose a novel adaptive Mixture-of-Experts (MoE) framework for time series forecasting that enhances expert specialization by incorporating expert-specific loss information directly into the training process. Notably, the overall…

机器学习 · 统计学 2026-05-12 Btissame El Mahtout , Florian Ziel

Characterizing the Generalization Error of Random Feature Regression with Arbitrary Data-Augmentation

This paper aims at analyzing the regularization effect that data augmentation induces on supervised regression methods in the proportional regime, where the number of covariates grows proportionally to the number of samples. We provide a…

机器学习 · 统计学 2026-05-12 Lucas Morisset , Alain Durmus , Adrien Hardy

Scalable Gaussian process inference via neural feature maps

We present a theoretically grounded Gaussian process framework that leverages neural feature maps to construct expressive kernels. We show that the learned feature map can be interpreted as an optimal low-rank approximation to a Gram matrix…

机器学习 · 统计学 2026-05-12 Anthony Stephenson

Coarsening Linear Non-Gaussian Causal Models with Cycles

Recent work on causal abstraction, in particular graphical approaches focusing on causal structure between clusters of variables, aims to summarize a high-dimensional causal structure in terms of a low-dimensional one. Existing methods for…

机器学习 · 统计学 2026-05-12 Francisco Madaleno , Francisco C Pereira , Alex Markham

PFN-TS: Thompson Sampling for Contextual Bandits via Prior-Data Fitted Networks

Thompson sampling is a widely used strategy for contextual bandits: at each round, it samples a reward function from a Bayesian posterior and acts greedily under that sample. Prior-data fitted networks (PFNs), such as TabPFN v2+ and TabICL…

机器学习 · 统计学 2026-05-12 Yan Shuo Tan , Kenyon Ng , Ruizhe Deng , Sumetha Loganathan , Qiong Zhang , Bibhas Chakraborty