机器学习 — Scifaro

Learning manifold diffusion semigroups from graph transition matrices

We consider graph diffusion processes constructed from finite i.i.d. samples drawn from an unknown manifold embedded in ambient Euclidean space, where the graph affinity is defined by an ambient Gaussian kernel matrix. We show that the…

机器学习 · 统计学 2026-05-26 Xiuyuan Cheng , Nan Wu

Choosing Online Experiment Designs under Interference in Ads, Recommendations, and Member-Experience Systems

Online experiments in ads, recommendation, and member-experience systems are often planned before the dominant interference mechanism is known. A treatment may propagate through budgets, inventory, producer exposure, graph spillovers, or…

机器学习 · 统计学 2026-05-26 Prashant Shekhar , Caroline Howard

Nystr\"om Kernel Stein Discrepancy Tests

Kernel Stein discrepancy (KSD) is among the most popular goodness-of-fit (GoF) measures on general domains with a large number of successful deployments. One of the main applications of KSD is in constructing powerful GoF tests. However,…

机器学习 · 统计学 2026-05-26 Florian Kalinke , Zoltán Szabó , Bharath K. Sriperumbudur

Counterfactually Safe Reinforcement Learning

Reinforcement learning algorithms are generally designed to maximize the expected return across a population. However, a policy that is optimal on average may be suboptimal for certain individuals, leading to potential safety concerns. To…

机器学习 · 统计学 2026-05-26 Jingyi Li , Peng Wu , Chengchun Shi

Estimating Mixture Distributions via Stochastic Mirror Descent

We revisit the classical problem of estimating an unknown distribution from its samples by fitting a mixture model that minimizes cross-entropy loss. Framing the task as a stochastic convex optimization problem over the space of $ M…

机器学习 · 统计学 2026-05-26 Mohammadreza Ahmadypour , Tara Javidi , Farinaz Koushanfar

How Neural Reward Models Learn Features for Policy Optimization: A Single-Index Analysis

Reward modeling is not only a prediction problem: in KL-regularized policy optimization, the learned reward is exponentiated to define the deployed policy, so downstream value depends on errors in reward-tilted regions. We study this…

机器学习 · 统计学 2026-05-26 Rei Higuchi , Ryotaro Kawata , Akifumi Wachi , Shokichi Takakura , Kohei Miyaguchi , Taiji Suzuki

Affinity Graph Connectivity in Convex Clustering

We generalize finite-sample bounds for convex clustering to the setting where affinity weights appearing in the objective correspond to a general connected graph. These bounds and their analysis lead to a better understanding of clustering…

机器学习 · 统计学 2026-05-26 Sam Rosen , Jason Xu

Clustering based on Stochastic Dominance with application for risk averters and risk seekers

Stochastic Dominance (SD) theory provides a rigorous framework for selecting superior assets tailored to the asset allocation needs of investors with varying risk preferences (i.e., risk-averse, risk-seeking, and risk-neutral). However,…

机器学习 · 统计学 2026-05-26 Hua Li , Xue Jia , Yilin Kang , Wing-Keung Wong

Multicalibration Boosting: Theory, Convergence, and Transferability

Multicalibration extends classical calibration by requiring predictions to be unbiased over a rich collection of functions, encompassing both prediction slices and subpopulations. It has emerged as a powerful framework for fairness,…

机器学习 · 统计学 2026-05-26 Hanxuan Ye , Hongzhe Li

Detecting Metastable Basins in High Dimensions via Marginal Trajectory Distribution Discrimination

We study the problem of identifying dynamically distinct basins of attraction in high dimensional time-homogeneous Markov processes using only trajectory sampling. This problem is fundamental in the analysis of metastable dynamical systems,…

机器学习 · 统计学 2026-05-26 Taj Jones-McCormick

Causality as the Statistical Conscience of Artificial Intelligence: From Pearl's Ladder to Trustworthy Machines

Modern Artificial Intelligence achieves remarkable predictive power by optimizing statistical risk functionals over vast corpora. Yet a gap separates this from genuine intelligence: the inability to distinguish correlation from causation.…

机器学习 · 统计学 2026-05-26 Ernest Fokoué

Optimal Non-Asymptotic Edgeworth Expansions for Multivariate Neural Network Outputs

Finite-width fully connected neural networks with Gaussian-initialized weights deviate from their infinite-width Gaussian limit, exhibiting non-vanishing higher-order cumulants. We approximate these deviations, for a neural network…

机器学习 · 统计学 2026-05-26 Lucia Celli

Learning Kernel-Based MDPs from Episodic Preferential Feedback

Human feedback often arrives as preferences rather than calibrated numeric rewards, motivating reinforcement learning from preferential feedback, also referred to as reinforcement learning from human feedback (RLHF). We present a rigorous…

机器学习 · 统计学 2026-05-26 Nikola Pavlovic , Sattar Vakili , Qing Zhao

KAPLAN: Kolmogorov-Arnold Prognostic Learnable Activation Networks for Survival Analysis

Survival analysis aims to model how covariates and time jointly shape the time-to-event distribution under right censoring. Classical methods such as the Cox model and generalised additive models (GAMs) require interactions and time-varying…

机器学习 · 统计学 2026-05-26 Stelios Boulitsakis Logothetis , Angela Wood , Pietro Liò

Finite-Particle Convergence Rates for Conservative and Non-Conservative Drifting Models

We propose and analyze a conservative drifting method for one-step generative modeling. The method replaces the original displacement-based drifting velocity by a kernel density estimator (KDE)-gradient velocity, namely the difference of…

机器学习 · 统计学 2026-05-26 Krishnakumar Balasubramanian

Reducing Diffusion Model Memorization with Higher Order Langevin Dynamics

Diffusion/score-based models have emerged as powerful generative models, capable of generating high-quality samples that mimic the training data distribution. However, it has been observed that they are prone to reproducing training…

机器学习 · 统计学 2026-05-26 Benjamin Sterling , Mónica F. Bugallo , Tom Tirer

SURGE: Approximation and Training Free Particle Filter for Diffusion Surrogate

Data assimilation (DA) addresses the problem of sequentially estimating the state of a dynamical system from noisy and incomplete observations. In this work, we employ a diffusion model as a world model to simulate and predict the system's…

机器学习 · 统计学 2026-05-26 Lifu Wei , Yinuo Ren , Naichen Shi , Yiping Lu

Keeping Score: Efficiency Improvements in Neural Likelihood Surrogate Training via Score-Augmented Loss Functions

For stochastic process models, parameter inference is often severely bottlenecked by computationally expensive likelihood functions. Simulation-based inference (SBI) bypasses this restriction by constructing amortized surrogate likelihoods,…

机器学习 · 统计学 2026-05-26 Alexander Shen , Mikael Kuusela

Global Sequential Testing for Multi-Stream Auditing

Across many risk-sensitive areas, it is critical to continuously audit machine learning systems as we receive more data to quickly determine if they are performing as designed. This auditing task can be modeled as a sequential hypothesis…

机器学习 · 统计学 2026-05-26 Beepul Bharti , Ambar Pal , Jeremias Sulam

Why Agentic Theorem Prover Works: A Statistical Provability Theory of Mathematical Reasoning Models

Agentic theorem provers combine a reasoning model, retrieval, search, and a proof assistant verifier, yet it remains unclear which components actually improve finite-budget proof success and why they help on real mathematical workloads. We…

机器学习 · 统计学 2026-05-26 Sho Sonoda , Shunta Akiyama , Yuya Uezato