机器学习 — Scifaro

One-Shot Generative Flows: Existence and Obstructions

We study dynamic measure transport for generative modeling, focusing on transport maps that connect a source measure $P_0$ to a target measure $P_1$ by integrating a velocity field of the form $v_t(x) = \mathbb{E}[\dot X_t \mid X_t = x]$,…

机器学习 · 统计学 2026-05-11 Panos Tsimpos , Daniel Sharp , Youssef Marzouk

From Average Sensitivity to Small-Loss Regret Bounds under Random-Order Model

We study online learning in the random-order model, where the multiset of loss functions is chosen adversarially but revealed in a uniformly random order. By extending the batch-to-online transformation of Dong and Yoshida (2023), we show…

机器学习 · 统计学 2026-05-11 Shinsaku Sakaue , Yuichi Yoshida

Emergence of Distortions in High-Dimensional Guided Diffusion Models

Classifier-free guidance (CFG) is the de facto standard for conditional sampling in diffusion models, yet it often reduces sample diversity. Using tools from statistical physics, we analyze the emergence of generative distortions induced by…

机器学习 · 统计学 2026-05-11 Enrico Ventura , Beatrice Achilli , Luca Ambrogioni , Carlo Lucibello

Persistent-Transient Policy Evaluation for Markov Chains via Minimal Peripheral Quotients

We study fixed-policy evaluation for finite Markov chains that may be reducible and periodic. Classical evaluation methods with gain and bias decomposition are not always diagnostic: the gain records only invariant Ces\`aro averages, while…

机器学习 · 统计学 2026-05-11 Yang Xu , Vaneet Aggarwal

Diffusion Path Samplers via Sequential Monte Carlo

We develop diffusion-based samplers for target distributions known up to a normalising constant. To this end, we rely on the well-known diffusion path that smoothly interpolates between a simple base distribution and the target, popularised…

机器学习 · 统计学 2026-05-11 James Matthew Young , Paula Cordero-Encinar , Sebastian Reich , Andrew Duncan , O. Deniz Akyildiz

Multi-environment Invariance Learning with Missing Data

Learning models that can handle distribution shifts is a key challenge in domain generalization. Invariance learning, an approach that focuses on identifying features invariant across environments, improves model generalization by capturing…

机器学习 · 统计学 2026-05-11 Yiran Jia , Jelena Bradic

Stationary Reweighting Yields Local Convergence of Soft Fitted Q-Iteration

Fitted $Q$-iteration (FQI) and soft FQI are widely used value-based methods for offline reinforcement learning, but their standard stability guarantees often depend on Bellman completeness, a strong closure condition that can fail under…

机器学习 · 统计学 2026-05-11 Lars van der Laan , Nathan Kallus

Fitted $Q$ Evaluation Without Bellman Completeness via Stationary Weighting

Fitted $Q$-evaluation (FQE) is a standard regression-based tool for off-policy evaluation, but existing stability guarantees often rely on Bellman completeness, a strong closure condition that can fail under function approximation. We study…

机器学习 · 统计学 2026-05-11 Lars van der Laan , Nathan Kallus

Bellman Calibration for $V$-Learning in Offline Reinforcement Learning

Reliable long-horizon value prediction is difficult in offline reinforcement learning because fitted value methods combine bootstrapping, function approximation, and distribution shift, while standard guarantees often require Bellman…

机器学习 · 统计学 2026-05-11 Lars van der Laan , Nathan Kallus

Learning graphons from data: Random walks, transfer operators, and spectral clustering

Many signals evolve in time as a stochastic process, randomly switching between states over discretely sampled time points. Here we make an explicit link between the underlying stochastic process of a signal that can take on a bounded…

机器学习 · 统计学 2026-05-11 Stefan Klus , Jason J. Bramburger

DARTS: Targeting Prognostic Covariates in Budget-Constrained Sequential Experiments

Randomized controlled trials typically assume that prognostic covariates are known and available at no cost. In practice, obtaining high-dimensional pretreatment data is costly, forcing a trade-off between covariate-adaptive precision and a…

机器学习 · 统计学 2026-05-08 Kateryna Husar , Alexander Volfovsky

Dynamic Treatment on Networks

In networks, effective dynamic treatment allocation requires deciding both whom to treat and also when, so as to amplify policy impact through spillovers. An early intervention at a well-connected node can trigger cascades that change which…

机器学习 · 统计学 2026-05-08 Bengusu Nar , Jiguang Li , Veronika Ročková , Panos Toulis

Risk-Controlled Post-Processing of Decision Policies

Predictive models are often deployed through existing decision policies that stakeholders are reluctant to change unless a risk constraint requires intervention. We study risk-controlled post-processing: given a deterministic baseline…

机器学习 · 统计学 2026-05-08 Sunay Joshi , Tao Wang , Hamed Hassani , Edgar Dobriban

Neural-Actuarial Longevity Forecasting: Anchoring LSTMs for Explainable Risk Management

Traditional multi-population models, such as the Li-Lee framework, rely on the assumption of mean-reverting country-specific deviations. However, recent data from high-longevity clusters suggest a systemic break in this paradigm. We…

机器学习 · 统计学 2026-05-08 Davide Rindori

Decoupled PFNs: Identifiable Epistemic-Aleatoric Decomposition via Structured Synthetic Priors

Prior-Fitted Networks (PFNs) amortize Bayesian prediction by meta-learning over a synthetic task prior, but their standard output is a posterior predictive distribution over noisy observations. For sequential decision-making, such as active…

机器学习 · 统计学 2026-05-08 Richard Bergna , Stefan Depeweg , José Miguel Hernández-Lobato

Beyond the Independence Assumption: Finite-Sample Guarantees for Deep Q-Learning under $\tau$-Mixing

Finite-sample analyses of deep Q-learning typically treat replayed data as independent, even though it is sampled from temporally dependent state-action trajectories. We study the Deep Q-networks (DQN) algorithm under explicit dependence by…

机器学习 · 统计学 2026-05-08 Leon Halgryn , Sophie Langer , Janusz M. Meylahn , E. Moritz Hahn

The Interplay of Data Structure and Imbalance in the Learning Dynamics of Diffusion Models

Real-world datasets are inherently heterogeneous, yet how per-class structural differences and sampling imbalance shape the training dynamics of diffusion models-and potentially exacerbate disparities-remains poorly understood. While models…

机器学习 · 统计学 2026-05-08 Flavio Nicoletti , Chenxiao Ma , Enrico Ventura , Luca Saglietti , Stefano Sarao Mannelli

End-to-End Identifiable and Consistent Recurrent Switching Dynamical Systems

Learning identifiable representations in deep generative models remains a fundamental challenge, particularly for sequential data with regime-switching dynamics. Existing approaches establish identifiability under restrictive assumptions,…

机器学习 · 统计学 2026-05-08 Carles Balsells-Rodas , Zhengrui Xiang , Xavier Sumba , Yingzhen Li

Multimodal Deep Generative Model for Semi-Supervised Learning under Class Imbalance

When modeling class-imbalanced data, it is crucial to address the imbalance, as models trained on such data tend to be biased towards the majority classes. This problem is amplified under partial supervision, where pseudo-labels for…

机器学习 · 统计学 2026-05-08 Heegeon Yoon , Heeyoung Kim

ConquerNet: Convolution-Smoothed Quantile ReLU Neural Networks with Minimax Guarantees

Quantile regression is a fundamental tool for distributional learning but poses significant optimization challenges for deep models due to the non-smoothness of the pinball loss. We propose ConquerNet, a class of…

机器学习 · 统计学 2026-05-08 Tianpai Luo , Fangwei Wu , Weichi Wu