机器学习 — Scifaro

Surprises in Proper Positive-Only Learning

Binary classification from positive-only samples is a variant of PAC learning in which the learner receives i.i.d. samples from the positive region of an unknown target concept, but is evaluated under the original distribution (which places…

机器学习 · 统计学 2026-06-26 Shai Ben-David , Farnam Mansouri , Anay Mehrotra , Manolis Zampetakis

Local Fokker--Planck Geometry for Score Estimation: Heat-Ball Mean-Value Representations and Exact High-Dimensional Sampling

Score-based generative models and Langevin samplers rely on estimating the score function $\nabla_x\log p_t(x)$ of a forward diffusion. Classically this is tractable when the drift is linear: the marginal density is Gaussian and the score…

机器学习 · 统计学 2026-06-26 Jiayao Bai , Lang Deng , Yi Du , Yifei Jia

Adversarial Contamination Meets Hard Thresholding: An Iterative Algorithm with Signal Adaptivity and Minimax Optimality

Pervasive data contamination -- stemming from measurement errors, outliers, or adversarial corruption -- has motivated the development of robust statistical methods. In this context, we propose a two-stage Adversarial…

机器学习 · 统计学 2026-06-26 Shixiang Liu , Hanming Yang

The Decision Geometry of Covariance Estimation for the Global Minimum-Variance Portfolio under Heavy Tails

The global minimum-variance portfolio (GMVP) is the canonical decision built from an estimated covariance matrix, yet covariance estimators are universally evaluated by matrix-norm loss, which is not the object the decision depends on. We…

机器学习 · 统计学 2026-06-25 Xavier Fonseca

Directed Graph Topology Inference via Graph Filter Identification

We address the problem of inferring a directed network from nodal measurements generated by linear diffusion dynamics on the sought graph. Observations are modeled as the outputs of a graph convolutional filter, i.e., a polynomial (with…

机器学习 · 统计学 2026-06-25 Rasoul Shafipour , Andrei Buciulea , Santiago Segarra , Antonio G. Marques , Gonzalo Mateos

When are likely answers right? On Sequence Probability and Correctness in LLMs

Many decoding methods for large language models can be understood as shifting probability mass toward outputs that are more likely under the model, either locally at the token level or globally at the sequence level. Therefore, their…

机器学习 · 统计学 2026-06-25 Johannes Zenn , Jonas Geiping

Ribbon: Scalable Approximation and Robust Uncertainty Quantification

Reliably quantifying predictive uncertainty is difficult for complex, high-dimensional, or misspecified models. Both fully Bayesian and bootstrap resampling methods provide principled uncertainty estimates but are often too expensive for…

机器学习 · 统计学 2026-06-25 Graham Gibson , John Tipton , Kellin Rumsey , Natalie Klein

Beyond Global Divergences: A Local-Mass Perspective on Bayesian Inference

Global objectives, such as KL divergence and ELBO, are widely used in Bayesian inference for measuring distributional discrepancy. This paper studies their local-mass behaviour that is not directly captured by such objectives. We introduce…

机器学习 · 统计学 2026-06-25 Hanli Xu , Fengxiang He , Sarat Moka

XMSE-Aware Adaptive Empirical Bayes Estimation

Empirical Bayes (EB) estimators can match the first-order asymptotic risk of maximum likelihood (ML) while behaving very differently at second order: recent excess mean squared error (XMSE) analysis shows that kernel-based EB estimation may…

机器学习 · 统计学 2026-06-25 Minghao Chen , Jiale Zheng

A probabilistic framework for online test-time adaptation

This paper presents a probabilistic framework for online test-time adaptation problems. In them, a model is trained on labeled data but must adapt to unlabeled data at test time under the assumption that training and test distributions…

机器学习 · 统计学 2026-06-24 Daniel Corrales , David Ríos Insua

The Role of Input Dimensionality in the Emergence and Targeted Control of Adversarial Examples

Several theoretical works have tried to explain the adversarial vulnerability of deep neural networks through properties of high-dimensional geometry. However, the assumptions underlying these works are rarely examined empirically, and…

机器学习 · 统计学 2026-06-24 Nasrin Malekzadeh Goradel , Niccolo Pancino , Yaser Gholizade Atani , Benedetta Tondi , Giovanni Bellettini , Mauro Barni

Improved Guarantees for Heterogeneous Treatment-Effect Estimation via Matrix Completion

A central goal of modern causal inference is estimating heterogeneous treatment effects to answer questions like "how does an intervention affect each unit," rather than only on average. We study this problem with panel-data where we…

机器学习 · 统计学 2026-05-29 Anay Mehrotra , Phuc Tran , Van H. Vu , Manolis Zampetakis

Leave a Window Out: Modifying the Jackknife for Predictive Inference in Time Series

Conformal prediction methods enjoy strong theoretical and empirical predictive inference performance, provided the data is exchangeable, and predictors are trained in a memoryless fashion. However, these assumptions and constraints are…

机器学习 · 统计学 2026-05-29 Hanyang Jiang , Rina Foygel Barber , Ashwin Pananjady , Yao Xie

Wasserstein Contraction of Coordinate Ascent Variational Inference

We study the contraction in Wasserstein distance of the coordinate ascent variational inference algorithm. This is shown to hold under a transport-information inequality at the fixed points and a functional smoothness condition. The results…

机器学习 · 统计学 2026-05-29 Rocco Caprio , Adrien Corenflos , Sam Power

Visual Spatial Learning: Single-Field Spatial Interpolation Using Convolutional Neural Networks

Predicting a complete spatially correlated field from sparse observations is a fundamental challenge in spatial statistics and environmental modelling. Classical interpolation methods such as Kriging rely on Gaussian process assumptions and…

机器学习 · 统计学 2026-05-29 Daniel Tinoco , Raquel Menezes , Carlos Baquero , Alexandra Silva

Diffusion Models Are Statistically Optimal for Learning Low-Dimensional Multi-Modal Distributions

Score-based diffusion models have demonstrated remarkable empirical success in learning high-dimensional distributions, particularly those exhibiting low-dimensional and multi-modal structures. However, theoretical understanding of their…

机器学习 · 统计学 2026-05-29 Jingda Wu , Changxiao Cai

Joint Model and Data Sparsification via the Marginal Likelihood

Sparse recovery in linear systems underpins applications from signal processing to high-dimensional regression. Sparse Bayesian Learning, grounded in the principle of automatic relevance determination (ARD), offers a practical Bayesian…

机器学习 · 统计学 2026-05-29 Alexander Timans , Thomas Möllenhoff , Christian A. Naesseth , Mohammad Emtiyaz Khan , Eric Nalisnick

Instance-dependent Stochastic Lipschitz bandit

We study the Lipschitz bandit problem, where a learner sequentially maximizes an unknown Lipschitz function $f$ over a domain $\mathcal{X} \subset [0,1]^d$ using noisy pointwise evaluations. Existing regret bounds are either worst-case,…

机器学习 · 统计学 2026-05-29 Marius Potfer , Vianney Perchet

Eigen-Spike Emergence and Quadratic Equivalents for Conjugate Kernels on Nonlinearly Separable Data

Recent work in random matrix theory (RMT) has developed the notion of deterministic equivalents: typically linear surrogate models that approximate the spectral behavior of large nonlinear random matrices, such as nonlinear feature maps in…

机器学习 · 统计学 2026-05-29 Collin Cranston , Zhichao Wang , Todd Kemp , Michael W. Mahoney

Matching Rates and Optimal Allocation for Federated Probe-Logit Distillation under Heterogeneous Bandwidth Budgets

In federated language modeling, $K$ nodes each hold $n$ samples but cannot pool data or exchange full-precision gradients or weights. We study the minimax rate at which a conditional distribution over $V$ tokens can be estimated when each…

机器学习 · 统计学 2026-05-29 Prasanjit Dubey , Xiaoming Huo