机器学习 — Scifaro

Score-based diffusion models for diffuse optical tomography with uncertainty quantification

Score-based diffusion models are a recently developed framework for posterior sampling in Bayesian inverse problems with a state-of-the-art performance for severely ill-posed problems by leveraging a powerful prior distribution learned from…

机器学习 · 统计学 2026-02-04 Fabian Schneider , Meghdoot Mozumder , Konstantin Tamarov , Leila Taghizadeh , Tanja Tarvainen , Tapio Helin , Duc-Lam Duong

Improving the Linearized Laplace Approximation via Quadratic Approximations

Deep neural networks (DNNs) often produce overconfident out-of-distribution predictions, motivating Bayesian uncertainty quantification. The Linearized Laplace Approximation (LLA) achieves this by linearizing the DNN and applying Laplace…

机器学习 · 统计学 2026-02-04 Pedro Jiménez , Luis A. Ortega , Pablo Morales-Álvarez , Daniel Hernández-Lobato

Multiparameter Uncertainty Mapping in Quantitative Molecular MRI using a Physics-Structured Variational Autoencoder (PS-VAE)

Quantitative imaging methods, such as magnetic resonance fingerprinting (MRF), aim to extract interpretable pathology biomarkers by estimating biophysical tissue parameters from signal evolutions. However, the pattern-matching algorithms or…

机器学习 · 统计学 2026-02-04 Alex Finkelstein , Ron Moneta , Or Zohar , Michal Rivlin , Moritz Zaiss , Dinora Friedmann Morvinski , Or Perlman

Latent Neural-ODE for Model-Informed Precision Dosing: Overcoming Structural Assumptions in Pharmacokinetics

Accurate estimation of tacrolimus exposure, quantified by the area under the concentration-time curve (AUC), is essential for precision dosing after renal transplantation. Current practice relies on population pharmacokinetic (PopPK) models…

机器学习 · 统计学 2026-02-04 Benjamin Maurel , Agathe Guilloux , Sarah Zohar , Moreno Ursino , Jean-Baptiste Woillard

Online Conformal Prediction via Universal Portfolio Algorithms

Online conformal prediction (OCP) seeks prediction intervals that achieve long-run $1-\alpha$ coverage for arbitrary (possibly adversarial) data streams, while remaining as informative as possible. Existing OCP methods often require manual…

机器学习 · 统计学 2026-02-04 Tuo Liu , Edgar Dobriban , Francesco Orabona

Unified Inference Framework for Single and Multi-Player Performative Prediction: Method and Asymptotic Optimality

Performative prediction characterizes environments where predictive models alter the very data distributions they aim to forecast, triggering complex feedback loops. While prior research treats single-agent and multi-agent performativity as…

机器学习 · 统计学 2026-02-04 Zhixian Zhang , Xiaotian Hou , Linjun Zhang

Training-Free Self-Correction for Multimodal Masked Diffusion Models

Masked diffusion models have emerged as a powerful framework for text and multimodal generation. However, their sampling procedure updates multiple tokens simultaneously and treats generated tokens as immutable, which may lead to error…

机器学习 · 统计学 2026-02-04 Yidong Ouyang , Panwen Hu , Zhengyan Wan , Zhe Wang , Liyan Xie , Dmitriy Bespalov , Ying Nian Wu , Guang Cheng , Hongyuan Zha , Qiang Sun

Rethinking Test-Time Training: Tilting The Latent Distribution For Few-Shot Source-Free Adaptation

Often, constraints arise in deployment settings where even lightweight parameter updates e.g. parameter-efficient fine-tuning could induce model shift or tuning instability. We study test-time adaptation of foundation models for few-shot…

机器学习 · 统计学 2026-02-04 Tahir Qasim Syed , Behraj Khan

Admissibility of Stein Shrinkage for Batch Normalization in the Presence of Adversarial Attacks

Batch normalization (BN) is a ubiquitous operation in deep neural networks, primarily used to improve stability and regularization during training. BN centers and scales feature maps using sample means and variances, which are naturally…

机器学习 · 统计学 2026-02-04 Sofia Ivolgina , P. Thomas Fletcher , Baba C. Vemuri

Learning non-equilibrium diffusions with Schr\"odinger bridges: from exactly solvable to simulation-free

We consider the Schr\"odinger bridge problem which, given ensemble measurements of the initial and final configurations of a stochastic dynamical system and some prior knowledge on the dynamics, aims to reconstruct the "most likely"…

机器学习 · 统计学 2026-02-04 Stephen Y. Zhang , Michael P H Stumpf

Full-Batch Gradient Descent Outperforms One-Pass SGD: Sample Complexity Separation in Single-Index Learning

It is folklore that reusing training data more than once can improve the statistical efficiency of gradient-based learning. However, beyond linear regression, the theoretical advantage of full-batch gradient descent (GD, which always reuses…

机器学习 · 统计学 2026-02-03 Filip Kovačević , Hong Chang Ji , Denny Wu , Mahdi Soltanolkotabi , Marco Mondelli

Transfer Learning Through Conditional Quantile Matching

We introduce a transfer learning framework for regression that leverages heterogeneous source domains to improve predictive performance in a data-scarce target domain. Our approach learns a conditional generative model separately for each…

机器学习 · 统计学 2026-02-03 Yikun Zhang , Steven Wilkins-Reeves , Wesley Lee , Aude Hofleitner

PCA of probability measures: Sparse and Dense sampling regimes

A common approach to perform PCA on probability measures is to embed them into a Hilbert space where standard functional PCA techniques apply. While convergence rates for estimating the embedding of a single measure from $m$ samples are…

机器学习 · 统计学 2026-02-03 Gachon Erell , Jérémie Bigot , Elsa Cazelles

Learning Beyond the Gaussian Data: Learning Dynamics of Neural Networks on an Expressive and Cumulant-Controllable Data Model

We study the effect of high-order statistics of data on the learning dynamics of neural networks (NNs) by using a moment-controllable non-Gaussian data model. Considering the expressivity of two-layer neural networks, we first construct the…

机器学习 · 统计学 2026-02-03 Onat Ure , Samet Demir , Zafer Dogan

Training-free score-based diffusion for parameter-dependent stochastic dynamical systems

Simulating parameter-dependent stochastic differential equations (SDEs) presents significant computational challenges, as separate high-fidelity simulations are typically required for each parameter value of interest. Despite the success of…

机器学习 · 统计学 2026-02-03 Minglei Yang , Sicheng He

Stochastic Interpolants in Hilbert Spaces

Although diffusion models have successfully extended to function-valued data, stochastic interpolants -- which offer a flexible way to bridge arbitrary distributions -- remain limited to finite-dimensional settings. This work bridges this…

机器学习 · 统计学 2026-02-03 James Boran Yu , RuiKang OuYang , Julien Horwood , José Miguel Hernández-Lobato

Reliable Real-Time Value at Risk Estimation via Quantile Regression Forest with Conformal Calibration

Rapidly evolving market conditions call for real-time risk monitoring, but its online estimation remains challenging. In this paper, we study the online estimation of one of the most widely used risk measures, Value at Risk (VaR). Its…

机器学习 · 统计学 2026-02-03 Du-Yi Wang , Guo Liang , Kun Zhang , Qianwen Zhu

Transformers as Measure-Theoretic Associative Memory: A Statistical Perspective and Minimax Optimality

Transformers excel through content-addressable retrieval and the ability to exploit contexts of, in principle, unbounded length. We recast associative memory at the level of probability measures, treating a context as a distribution over…

机器学习 · 统计学 2026-02-03 Ryotaro Kawata , Taiji Suzuki

Inference-Aware Meta-Alignment of LLMs via Non-Linear GRPO

Aligning large language models (LLMs) to diverse human preferences is fundamentally challenging since criteria can often conflict with each other. Inference-time alignment methods have recently gained popularity as they allow LLMs to be…

机器学习 · 统计学 2026-02-03 Shokichi Takakura , Akifumi Wachi , Rei Higuchi , Kohei Miyaguchi , Taiji Suzuki

Density-Informed Pseudo-Counts for Calibrated Evidential Deep Learning

Evidential Deep Learning (EDL) is a popular framework for uncertainty-aware classification that models predictive uncertainty via Dirichlet distributions parameterized by neural networks. Despite its popularity, its theoretical foundations…

机器学习 · 统计学 2026-02-03 Pietro Carlotti , Nevena Gligić , Arya Farahi